Lessons from a Year of Building with LLMs

This podcast episode explores the challenges and opportunities in evaluating and developing large language models (LLMs). The speakers discuss the importance of guidance and tools in evaluating LLMs, the power of LLM evals in AI product development, the role of evals and data in AI engineering, the significance of data literacy, the potential of LLMs in data analysis, the importance of intelligent software, the collaboration among the speakers, the rise of data-centric AI application developers, the expectations for AI engineering roles, the factors that can derail a fine-tuning project, the concept of RAG systems, the process of fine-tuning models, the power of building synthetic worlds, the significance of trace analysis in LLMOps, the availability of trace viewer visualization, the benefits of Weave for experiment tracking, the importance of building end-to-end systems with LLMs, the gap between software engineers and data scientists in data literacy, the excitement for the future of robotics and AI, and the importance of iteration in building complex systems.

Outlines

Sign in to continue reading, translating and more.

Continue

Vanishing Gradients

Introduction to the speakers and their interest in LLMs

Evaluating Large Language Models (LLMs) and the Challenges of Writing Evals

The Importance of LLM Evals in AI Product Development

The Misconception of AI Engineer Skills and the Importance of Evals and Data

The Impact of Evals and Fine-tuning Pipeline on AI Development

The Importance of Data Literacy in AI Engineering

The Role of AI and LLMs in Data Analysis and Misconceptions

LLMs and the Future of Intelligent Software

Collaborative Effort and Valuable Insights: A Journey of Distilling Cutting-Edge Knowledge

The Rise of Data-Centric AI Application Development

The Importance of Fine-tuning and Data Analysis in AI Engineering Roles

The Challenges and Simplicity of Fine-tuning and the Importance of Evaluation Frameworks

The Role of RAG Systems in Providing Relevant Content and Deeper Problem-Solving

Fine-Tuning a Model with Limited Curated Data and LLMs for the Honeycomb Query Language

Building Synthetic Worlds and Critical Process Considerations in LLMs

Trace Analysis and the Flexibility of Weave Query Language in LLMOps

Hex Notebooks and LangSmith: Building Your Own Tools in the Data Warehouse

Importance of Weave for Experiment Tracking and Establishing Trust in AI Applications

Importance of Building Systems and Using LLMs for Successful Project Implementation

The Gap Between Software Engineers and Data Scientists in Data Literacy

Exciting Future Possibilities in Robotics and AI

The Importance of Iteration in Building Complex Systems

Lessons from a Year of Building with LLMs

Vanishing Gradients

00:03Introduction to the speakers and their interest in LLMs

Introduction to the speakers and their interest in LLMs

07:39Evaluating Large Language Models (LLMs) and the Challenges of Writing Evals

Evaluating Large Language Models (LLMs) and the Challenges of Writing Evals

14:41The Importance of LLM Evals in AI Product Development

The Importance of LLM Evals in AI Product Development

22:32The Misconception of AI Engineer Skills and the Importance of Evals and Data

The Misconception of AI Engineer Skills and the Importance of Evals and Data

32:09The Impact of Evals and Fine-tuning Pipeline on AI Development

The Impact of Evals and Fine-tuning Pipeline on AI Development

41:03The Importance of Data Literacy in AI Engineering

The Importance of Data Literacy in AI Engineering

48:38The Role of AI and LLMs in Data Analysis and Misconceptions

The Role of AI and LLMs in Data Analysis and Misconceptions

55:29LLMs and the Future of Intelligent Software

LLMs and the Future of Intelligent Software

1:04:08Collaborative Effort and Valuable Insights: A Journey of Distilling Cutting-Edge Knowledge

Collaborative Effort and Valuable Insights: A Journey of Distilling Cutting-Edge Knowledge

1:12:27The Rise of Data-Centric AI Application Development

The Rise of Data-Centric AI Application Development

1:20:07The Importance of Fine-tuning and Data Analysis in AI Engineering Roles

The Importance of Fine-tuning and Data Analysis in AI Engineering Roles

1:29:29The Challenges and Simplicity of Fine-tuning and the Importance of Evaluation Frameworks

The Challenges and Simplicity of Fine-tuning and the Importance of Evaluation Frameworks

1:39:01The Role of RAG Systems in Providing Relevant Content and Deeper Problem-Solving

The Role of RAG Systems in Providing Relevant Content and Deeper Problem-Solving

1:47:03Fine-Tuning a Model with Limited Curated Data and LLMs for the Honeycomb Query Language

Fine-Tuning a Model with Limited Curated Data and LLMs for the Honeycomb Query Language

1:52:53Building Synthetic Worlds and Critical Process Considerations in LLMs

Building Synthetic Worlds and Critical Process Considerations in LLMs

2:01:40Trace Analysis and the Flexibility of Weave Query Language in LLMOps

Trace Analysis and the Flexibility of Weave Query Language in LLMOps

2:09:06Hex Notebooks and LangSmith: Building Your Own Tools in the Data Warehouse

Hex Notebooks and LangSmith: Building Your Own Tools in the Data Warehouse

2:16:22Importance of Weave for Experiment Tracking and Establishing Trust in AI Applications

Importance of Weave for Experiment Tracking and Establishing Trust in AI Applications

2:24:56Importance of Building Systems and Using LLMs for Successful Project Implementation

Importance of Building Systems and Using LLMs for Successful Project Implementation

2:31:50The Gap Between Software Engineers and Data Scientists in Data Literacy

The Gap Between Software Engineers and Data Scientists in Data Literacy

2:38:39Exciting Future Possibilities in Robotics and AI

Exciting Future Possibilities in Robotics and AI

2:46:11The Importance of Iteration in Building Complex Systems

The Importance of Iteration in Building Complex Systems