Iulia Feroli

Harry Potter and the Elastic Semantic Search

How can you find text about bravery with a negative sentiment? Learn to build a semantic search engine using Elasticsearch and the world of Harry Potter.

Harry Potter and the Elastic Semantic Search
#1about 3 minutes

The evolution of NLP from early models to modern LLMs

Tracing the rapid advancement of natural language processing from early models like Word2Vec to the powerful generative AI we see today.

#2about 5 minutes

How vector embeddings represent language as numbers

Vector embeddings turn words and sentences into numerical arrays, allowing computers to understand semantic relationships through mathematical operations.

#3about 7 minutes

Using vector similarity and LLMs for semantic operations

The distance between vectors in an embedding space represents semantic similarity, enabling operations like finding related concepts or answering questions.

#4about 4 minutes

Using Elasticsearch as a vector database for search

Elasticsearch serves as a vector database to store document embeddings and integrates with models from sources like Hugging Face for inference.

#5about 7 minutes

Demonstrating advanced keyword search with the Python client

The Elasticsearch Python client enables complex, multi-field queries with boolean logic to filter data based on precise criteria before adding semantic layers.

#6about 4 minutes

Enriching data with sentiment analysis pipelines

An inference pipeline can automatically apply a sentiment analysis model to all documents, adding a new field to enable filtering by positive or negative tone.

#7about 4 minutes

Implementing semantic search with embedding models

By converting all text into vectors using an embedding model, you can perform a k-NN search to find the most semantically relevant results for a query.

#8about 5 minutes

Refining results with hybrid search techniques

Hybrid search combines the power of semantic vector search with traditional keyword filters and exclusions to create highly relevant and precise results.

#9about 19 minutes

Audience Q&A on models and implementation

The speaker answers audience questions about ensuring relevance, handling out-of-vocabulary terms, updating data sources, and debugging model outputs.

Related jobs
Jobs that call for the skills explored in this talk.

Featured Partners

Related Articles

View all articles
DC
Daniel Cranney
The State of WebDev AI 2025 Results: What Can We Learn?
Introduction The 2025 edition of The State of WebDev AI offers a detailed snapshot of how developers are using AI today, which tools have gained the most traction over the past year, and what these trends suggest about the future of the industry. In...
The State of WebDev AI 2025 Results: What Can We Learn?
CH
Chris Heilmann
With AIs wide open - WeAreDevelopers at All Things Open 2025
Last week our VP of Developer Relations, Chris Heilmann, flew to Raleigh, North Carolina to present at All Things Open . An excellent event he had spoken at a few times in the past and this being the “Lucky 13” edition, he didn’t hesitate to come and...
With AIs wide open - WeAreDevelopers at All Things Open 2025
CH
Chris Heilmann
Dev Digest 151: SEO in an AI world, security fixes and Doomed PDFs
Inside last week’s Dev Digest 151 . 🔎 How ChatGPT compares to search and what that means for SEO ✂️ Job cuts across the board as companies curb DEI programs 🟨 @Microsoft releases 161 Windows security updates ⚠️ @Google’s OAuth bug endangers million...
Dev Digest 151: SEO in an AI world, security fixes and Doomed PDFs

From learning to earning

Jobs that call for the skills explored in this talk.

AI Engineer, London

AI Engineer, London

Eloquent AI
Charing Cross, United Kingdom

52K
Intermediate
Azure
React
Python
Node.js
+4
Data Engineer

Data Engineer

Searchability
Charing Cross, United Kingdom

Remote
£40-70K
ETL
Python
Docker
+7