Ashish Sharma
Building Blocks of RAG: From Understanding to Implementation
#1about 2 minutes
Tech stack for building a RAG application
The core technologies used for the RAG implementation include Python, Groq for LLM inference, LangChain as a framework, FAISS for the vector database, and Streamlit for the UI.
#2about 1 minute
Understanding the fundamentals of large language models
Large language models are deep learning models pre-trained on vast data, using a transformer architecture with an encoder and decoder to understand and generate human-like text.
#3about 3 minutes
The rapid evolution and adoption of LLMs
The journey of LLMs has accelerated from the 2022 ChatGPT launch to widespread experimentation in 2023 and enterprise production adoption in 2024.
#4about 2 minutes
Key challenges of LLMs like hallucination
Standard LLMs face significant challenges including hallucination, unverifiable sources, and knowledge cutoffs that limit their reliability for enterprise use.
#5about 1 minute
How RAG solves LLM limitations
Retrieval-Augmented Generation addresses LLM weaknesses by retrieving relevant, up-to-date information from external data sources to provide accurate and verifiable responses.
#6about 4 minutes
The data ingestion and processing pipeline
The first stage of RAG involves loading documents, splitting them into manageable chunks, converting those chunks into numerical embeddings, and storing them in a vector database.
#7about 2 minutes
The retrieval and generation process
The second stage of RAG handles user queries by retrieving relevant chunks from the vector store, constructing a detailed prompt with that context, and sending it to the LLM for generation.
#8about 4 minutes
Visualizing the end-to-end RAG architecture
A complete RAG system processes a user's query by creating an embedding, finding similar document chunks in the vector DB, and feeding both the query and context to an LLM to generate a grounded response.
#9about 5 minutes
Demo of a RAG-powered document chatbot
A live demonstration shows a Streamlit application that allows users to upload a PDF and ask questions, receiving answers grounded in the document's content.
#10about 2 minutes
Summary and deploying RAG solutions
A recap of the RAG process is provided, along with considerations for deploying these solutions in enterprise environments using managed cloud services or open-source models.
Related jobs
Jobs that call for the skills explored in this talk.
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
ROSEN Technology and Research Center GmbH
Osnabrück, Germany
Senior
TypeScript
React
+3
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Matching moments
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
09:10 MIN
How AI is changing the freelance developer experience
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
02:49 MIN
Using AI to overcome challenges in systems programming
AI in the Open and in Browsers - Tarek Ziadé
03:15 MIN
The future of recruiting beyond talent acquisition
What 2025 Taught Us: A Year-End Special with Hung Lee
04:06 MIN
Using AI to enable human connection in recruiting
Retention Over Attraction: A New Employer Branding Mindset
05:03 MIN
Building and iterating on an LLM-powered product
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
07:39 MIN
Prompt injection as an unsolved AI security problem
AI in the Open and in Browsers - Tarek Ziadé
06:28 MIN
Using AI agents to modernize legacy COBOL systems
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
Featured Partners
Related Videos
Build RAG from Scratch
Phil Nash
Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation
Carl Lapierre
Large Language Models ❤️ Knowledge Graphs
Michael Hunger
Creating Industry ready solutions with LLM Models
Vijay Krishan Gupta & Gauravdeep Singh Lotey
Martin O'Hanlon - Make LLMs make sense with GraphRAG
Martin O'Hanlon
Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB
Dieter Flick
Develop AI-powered Applications with OpenAI Embeddings and Azure Search
Rainer Stropek
Make it simple, using generative AI to accelerate learning
Duan Lightfoot
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

The Rolewe
Charing Cross, United Kingdom
API
Python
Machine Learning


Pathway
Paris, France
Remote
€72-75K
GIT
Python
Unit Testing
+2


Amazon.com Inc.
Senior
R
API
Unix
Perl
Ruby
+7

Deloitte
Leipzig, Germany
Azure
DevOps
Python
Docker
PyTorch
+6

Adarga
Charing Cross, United Kingdom
Intermediate
Python
ASP.NET
PyTorch
PostgreSQL
Kubernetes
+2

Riverty GmbH
Berlin, Germany
Remote
Java
Python
TypeScript

Riverty GmbH
Verl, Germany
Remote
Java
Python
TypeScript