Phil Nash

Aug 20, 2024 • World Congress 2024

Build RAG from Scratch

You don't need complex tools to start with RAG. This session builds a surprisingly effective system from scratch using basic vectorization and cosine similarity.

#1about 3 minutes

Why large language models need retrieval augmented generation

Large language models have knowledge cutoffs and lack access to private data, a problem solved by providing relevant context at query time using RAG.

#2about 1 minute

How similarity search and vector embeddings power RAG

RAG relies on similarity search, not keyword search, which captures meaning by converting text into numerical representations called vector embeddings.

#3about 6 minutes

Building a simple bag-of-words vectorizer from scratch

A basic vector embedding can be created by tokenizing text, building a vocabulary of unique words, and representing each document as a vector of word counts.

#4about 8 minutes

Comparing document vectors using cosine similarity

Cosine similarity measures the angle between two vectors to determine their semantic closeness by focusing on direction (meaning) rather than magnitude.

#5about 3 minutes

Understanding the limitations of a bag-of-words model

The simple bag-of-words model is sensitive to vocabulary, slow to scale, and fails to capture nuanced semantic meaning like word order or synonyms.

#6about 4 minutes

Using professional embedding models and vector databases

Production RAG systems use sophisticated embedding models and specialized vector databases for efficient, accurate, and scalable similarity search.

#7about 2 minutes

Exploring advanced RAG techniques and other applications

Beyond basic similarity search, techniques like ColBERT and knowledge graphs can improve retrieval accuracy, and vector search can power features like related content recommendations.

Picnic Technologies B.V.
Amsterdam, Netherlands

Intermediate

Senior

Python

Structured Query Language (SQL)

+1

ROSEN Technology and Research Center GmbH
Osnabrück, Germany

Senior

TypeScript

React

+3

Wilken GmbH
Ulm, Germany

Senior

Kubernetes

AI Frameworks

+3

Increasing the value of talk recordings post-event

04:57 MIN

Increasing the value of talk recordings post-event

Cat Herding with Lions and Tigers - Christian Heilmann

Using AI to overcome challenges in systems programming

02:49 MIN

Using AI to overcome challenges in systems programming

AI in the Open and in Browsers - Tarek Ziadé

How AI code generators have become more reliable

04:05 MIN

How AI code generators have become more reliable

AI in the Open and in Browsers - Tarek Ziadé

Using AI prompts to rebuild a classic 8-bit game

05:26 MIN

Using AI prompts to rebuild a classic 8-bit game

WeAreDevelopers LIVE – Frontend Inspirations, Web Standards and more

Exploring the role and ethics of AI in gaming

14:06 MIN

Exploring the role and ethics of AI in gaming

Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3

Using AI agents to modernize legacy COBOL systems

06:28 MIN

Using AI agents to modernize legacy COBOL systems

Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3

Building an open source community around AI models

04:28 MIN

Building an open source community around AI models

AI in the Open and in Browsers - Tarek Ziadé

Prompt injection as an unsolved AI security problem

07:39 MIN

Prompt injection as an unsolved AI security problem

AI in the Open and in Browsers - Tarek Ziadé

Featured Partners

Building Blocks of RAG: From Understanding to Implementation

Building Blocks of RAG: From Understanding to Implementation

Ashish Sharma

about a year ago • WeAreDevelopers LIVE

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre

about a year ago • World Congress 2024

Large Language Models ❤️ Knowledge Graphs

Large Language Models ❤️ Knowledge Graphs

Michael Hunger

about a year ago • World Congress 2024

Make it simple, using generative AI to accelerate learning

Make it simple, using generative AI to accelerate learning

Duan Lightfoot

about a year ago • World Congress 2024

Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB

Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB

Dieter Flick

about 2 years ago • World Congress 2023

Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j

Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j

about 9 months ago

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon

about 9 months ago • WeAreDevelopers LIVE

Using LLMs in your Product

Using LLMs in your Product

Daniel Töws

about a year ago • World Congress 2024

Related Articles

View all articles

DC

Daniel Cranney

Developers vs Scammers, Bad Design, AI is Pointless, AJAX is 20 and more - The Best of LIVE 2025 - Part 1

Every Wednesday, we’re joined by guests from around the world to discuss all the going on in the tech industry, and now that the year is wrapping up, we thought we’d take some time to look back on some of our favourites conversations with these thoug...

Developers vs Scammers, Bad Design, AI is Pointless, AJAX is 20 and more - The Best of LIVE 2025 - Part 1

DC

Daniel Cranney

Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2

This week, we’re continuing our look-back on some of the best moments from the Weekly Developer Show from 2025. Here’s what some of our fantastic guests had to say… Sebastian Gingter cracked open the idea of “slopsquatting” and explained why we shou...

Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2

DC

Daniel Cranney

Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3

In this, the third and final part of our series looking back on the best bits from the Weekly Developer Show, we dig into some more classic moments from our guests for you to enjoy. Raphael De Lio reminds us that contributing to open source - and sh...

Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3

CH

Chris Heilmann

AI overspill Dec 2026: AI in a JAM, Blocking AI browsers, learning programming languages

Welcome to the AI overspill for December 2025. Here are some of the links that didn’t make it to the WeAreDevelopers Dev Digest but are still interesting to check out. Jonah Glover of Hightouch tried to use Claude to re-create the recreate the 1996 ...

AI overspill Dec 2026: AI in a JAM, Blocking AI browsers, learning programming languages

From learning to earning

Jobs that call for the skills explored in this talk.

AI Systems and MLOps Engineer for Earth Observation

Forschungszentrum Jülich GmbH
Jülich, Germany

Intermediate

Senior

Linux

Docker

AI Frameworks

Machine Learning

AI Software Engineer | Python | RAG | Retrieval Augmented Generation | DAG | Dagster | London, UK

The Rolewe
Charing Cross, United Kingdom

API

Python

Machine Learning

Software Engineer - RAG, Knowledge Graphs & Agentic Systems

Riverty GmbH
Berlin, Germany

Remote

Java

Python

TypeScript

Software Engineer - RAG, Knowledge Graphs & Agentic Systems

Riverty GmbH
Verl, Germany

Remote

Java

Python

TypeScript

Software Engineer - KI & Retrieval (RAG/Azure)

Jurafuchs
Berlin, Germany

Remote

API

Azure

Python

Node.js

+4

AI Engineer (GraphRAG, RAG )

Tecdata
Municipality of Madrid, Spain

Azure

Neo4j

Amazon Web Services (AWS)

Software Engineer - KI & Retrieval (RAG/Azure)

Jurafuchs
Berlin, Germany

Remote

API

Azure

Python

Node.js

+4

Conversational AI Engineer

Coöperatieve Rabobank U.A.
Utrecht, Netherlands

€4K

API

Azure

Senior AI Engineer - LLMs & Agentic Systems (all genders)

Robert Ragge GmbH

Senior

API

Python

Terraform

Kubernetes

A/B testing

+3