Phil Nash
Build RAG from Scratch
#1about 3 minutes
Why large language models need retrieval augmented generation
Large language models have knowledge cutoffs and lack access to private data, a problem solved by providing relevant context at query time using RAG.
#2about 1 minute
How similarity search and vector embeddings power RAG
RAG relies on similarity search, not keyword search, which captures meaning by converting text into numerical representations called vector embeddings.
#3about 6 minutes
Building a simple bag-of-words vectorizer from scratch
A basic vector embedding can be created by tokenizing text, building a vocabulary of unique words, and representing each document as a vector of word counts.
#4about 8 minutes
Comparing document vectors using cosine similarity
Cosine similarity measures the angle between two vectors to determine their semantic closeness by focusing on direction (meaning) rather than magnitude.
#5about 3 minutes
Understanding the limitations of a bag-of-words model
The simple bag-of-words model is sensitive to vocabulary, slow to scale, and fails to capture nuanced semantic meaning like word order or synonyms.
#6about 4 minutes
Using professional embedding models and vector databases
Production RAG systems use sophisticated embedding models and specialized vector databases for efficient, accurate, and scalable similarity search.
#7about 2 minutes
Exploring advanced RAG techniques and other applications
Beyond basic similarity search, techniques like ColBERT and knowledge graphs can improve retrieval accuracy, and vector search can power features like related content recommendations.
Related jobs
Jobs that call for the skills explored in this talk.
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
ROSEN Technology and Research Center GmbH
Osnabrück, Germany
Senior
TypeScript
React
+3
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Matching moments
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
02:49 MIN
Using AI to overcome challenges in systems programming
AI in the Open and in Browsers - Tarek Ziadé
04:05 MIN
How AI code generators have become more reliable
AI in the Open and in Browsers - Tarek Ziadé
05:26 MIN
Using AI prompts to rebuild a classic 8-bit game
WeAreDevelopers LIVE – Frontend Inspirations, Web Standards and more
14:06 MIN
Exploring the role and ethics of AI in gaming
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
06:28 MIN
Using AI agents to modernize legacy COBOL systems
Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3
04:28 MIN
Building an open source community around AI models
AI in the Open and in Browsers - Tarek Ziadé
07:39 MIN
Prompt injection as an unsolved AI security problem
AI in the Open and in Browsers - Tarek Ziadé
Featured Partners
Related Videos
Building Blocks of RAG: From Understanding to Implementation
Ashish Sharma
Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation
Carl Lapierre
Large Language Models ❤️ Knowledge Graphs
Michael Hunger
Make it simple, using generative AI to accelerate learning
Duan Lightfoot
Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB
Dieter Flick
Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j
Martin O'Hanlon - Make LLMs make sense with GraphRAG
Martin O'Hanlon
Using LLMs in your Product
Daniel Töws
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning

The Rolewe
Charing Cross, United Kingdom
API
Python
Machine Learning

Riverty GmbH
Berlin, Germany
Remote
Java
Python
TypeScript

Riverty GmbH
Verl, Germany
Remote
Java
Python
TypeScript





Robert Ragge GmbH
Senior
API
Python
Terraform
Kubernetes
A/B testing
+3