NeurIPS 2023 Recap — Best Papers | Latent Space: The AI Engineer Podcast

This podcast episode covers various topics in natural language processing and machine learning, including word2vec, reinforcement learning, data filtering, multimodal models, visual reasoning, tree search algorithms, and more. It also examines recent breakthroughs, such as Toolformer, Voyager, and LLMs, as well as ongoing research and advancements. Takeaways • Semi-supervised objectives, fast and weakly synchronized computation, focusing compute, and treating language as a sequence of dense vectors are key factors in the success of word2vec. • Direct preference optimization (DPO) is a promising approach to reinforcement learning for language models that is computationally cheaper and easier to implement. • Repeating data during training is a simple solution to the challenge of training large language models with limited data, achieving similar performance to training on more data with less compute. • Lava, a visual instruction tuning approach, enables multimodal models to reason about the visual world and reflect with natural language. • The use of language models in combination with search algorithms, as demonstrated by Tree of Thoughts, unlocks more deliberate and powerful reasoning capabilities. • LLMs are not robust to different tasks or graphs and struggle with efficient planning. • S4, a state space model inspired by signal processing, provides stability and unification of CNNs and RNNs for processing long sequences.

Outlines

Sign in to continue reading, translating and more.

Continue

NeurIPS 2023 Recap — Best Papers

Latent Space: The AI Engineer Podcast

Word2vec: A Retrospective on the Test of Time Award-Winning Paper

Emergent Abilities in Large Language Models: A Deeper Look

Direct Preference Optimization: A New Approach to Reinforcement Learning for Language Models

Direct Preference Optimization: A New Approach to Reinforcement Learning from Human Feedback

Scaling Language Models with Data Constraints

Quantization Techniques for Efficient Fine-Tuning of Large Language Models

DataComp: Benchmarking Dataset Development for Multimodal Models

DataComp: A Community-Driven Approach to Data Curation for Machine Learning

Lava: A Simple and Effective Visual Instruction Tuning Approach for Multimodal Models

Exploring Multimodality and Prompting Techniques with Lava and GPT-4

Tree of Thoughts: Combining Language Models and Search for Deliberate Reasoning

Toolformer: A Language Model that can use External Tools

Recent Advancements in Language Models and Embodied Agents

Voyager: A No-Gradient Architecture for Lifelong Learning in Minecraft

Evaluating the Planning Capabilities of LLMs

Exploring Long Sequences and State Space Models in Deep Learning

Understanding LTI Systems and Their Applications in Signal Processing

Recent Advances in Language Models: Addressing the Attention Gap

NeurIPS 2023 Recap — Best Papers

Latent Space: The AI Engineer Podcast

00:03Word2vec: A Retrospective on the Test of Time Award-Winning Paper

Word2vec: A Retrospective on the Test of Time Award-Winning Paper

14:47Emergent Abilities in Large Language Models: A Deeper Look

Emergent Abilities in Large Language Models: A Deeper Look

25:51Direct Preference Optimization: A New Approach to Reinforcement Learning for Language Models

Direct Preference Optimization: A New Approach to Reinforcement Learning for Language Models

39:27Direct Preference Optimization: A New Approach to Reinforcement Learning from Human Feedback

Direct Preference Optimization: A New Approach to Reinforcement Learning from Human Feedback

47:47Scaling Language Models with Data Constraints

Scaling Language Models with Data Constraints

56:38Quantization Techniques for Efficient Fine-Tuning of Large Language Models

Quantization Techniques for Efficient Fine-Tuning of Large Language Models

1:08:25DataComp: Benchmarking Dataset Development for Multimodal Models

DataComp: Benchmarking Dataset Development for Multimodal Models

1:21:13DataComp: A Community-Driven Approach to Data Curation for Machine Learning

DataComp: A Community-Driven Approach to Data Curation for Machine Learning

1:33:48Lava: A Simple and Effective Visual Instruction Tuning Approach for Multimodal Models

Lava: A Simple and Effective Visual Instruction Tuning Approach for Multimodal Models

1:47:03Exploring Multimodality and Prompting Techniques with Lava and GPT-4

Exploring Multimodality and Prompting Techniques with Lava and GPT-4

1:59:55Tree of Thoughts: Combining Language Models and Search for Deliberate Reasoning

Tree of Thoughts: Combining Language Models and Search for Deliberate Reasoning

2:13:11Toolformer: A Language Model that can use External Tools

Toolformer: A Language Model that can use External Tools

2:26:21Recent Advancements in Language Models and Embodied Agents

Recent Advancements in Language Models and Embodied Agents

2:36:38Voyager: A No-Gradient Architecture for Lifelong Learning in Minecraft

Voyager: A No-Gradient Architecture for Lifelong Learning in Minecraft

2:49:16Evaluating the Planning Capabilities of LLMs

Evaluating the Planning Capabilities of LLMs

2:55:11Exploring Long Sequences and State Space Models in Deep Learning

Exploring Long Sequences and State Space Models in Deep Learning

3:04:14Understanding LTI Systems and Their Applications in Signal Processing

Understanding LTI Systems and Their Applications in Signal Processing

3:11:34Recent Advances in Language Models: Addressing the Attention Gap

Recent Advances in Language Models: Addressing the Attention Gap