YouTube19 Dec 2023
17m

Snowflake BUILD Keynote: Build Confidence In LLM Applications, From Prototype To Production

Podcast cover

Snowflake Developers

This episode explores the challenges and strategies involved in building robust, production-ready applications using large language models (LLMs). Against the backdrop of LLMs' limitations in accessing and processing real-time or private data, the speaker introduces the concept of "context-aware reasoning applications," which integrate LLMs with external data sources and computational tools. More significantly, the discussion details four key methods for incorporating external context into LLMs: instruction prompting, few-shot prompting, retrieval augmented generation (RAG), and fine-tuning. The speaker then outlines various cognitive architectures for LLM applications, ranging from simple code-driven outputs to more sophisticated agent-based systems that allow LLMs to dynamically determine their own action sequences. For instance, the "plan, execute, validate" loop is presented as an example of a cyclical architecture. Finally, the episode highlights the importance of building developer confidence through tracing, monitoring, testing, and collaboration, showcasing tools like LangChain, Streamlit, and LangSmith to facilitate this process. What this means for developers is a more streamlined approach to building reliable and scalable LLM applications, moving beyond prototypes and into full-fledged production environments.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise