Iulia Feroli

Feb 23, 2024 • WeAreDevelopers LIVE

Harry Potter and the Elastic Semantic Search

How can you find text about bravery with a negative sentiment? Learn to build a semantic search engine using Elasticsearch and the world of Harry Potter.

#1about 3 minutes

The evolution of NLP from early models to modern LLMs

Tracing the rapid advancement of natural language processing from early models like Word2Vec to the powerful generative AI we see today.

#2about 5 minutes

How vector embeddings represent language as numbers

Vector embeddings turn words and sentences into numerical arrays, allowing computers to understand semantic relationships through mathematical operations.

#3about 7 minutes

Using vector similarity and LLMs for semantic operations

The distance between vectors in an embedding space represents semantic similarity, enabling operations like finding related concepts or answering questions.

#4about 4 minutes

Using Elasticsearch as a vector database for search

Elasticsearch serves as a vector database to store document embeddings and integrates with models from sources like Hugging Face for inference.

#5about 7 minutes

Demonstrating advanced keyword search with the Python client

The Elasticsearch Python client enables complex, multi-field queries with boolean logic to filter data based on precise criteria before adding semantic layers.

#6about 4 minutes

Enriching data with sentiment analysis pipelines

An inference pipeline can automatically apply a sentiment analysis model to all documents, adding a new field to enable filtering by positive or negative tone.

#7about 4 minutes

Implementing semantic search with embedding models

By converting all text into vectors using an embedding model, you can perform a k-NN search to find the most semantically relevant results for a query.

#8about 5 minutes

Refining results with hybrid search techniques

Hybrid search combines the power of semantic vector search with traditional keyword filters and exclusions to create highly relevant and precise results.

#9about 19 minutes

Audience Q&A on models and implementation

The speaker answers audience questions about ensuring relevance, handling out-of-vocabulary terms, updating data sources, and debugging model outputs.

ZEISS Group
Oberkochen, Germany

Intermediate

Python

Azure

Bonial International GmbH
Berlin, Germany

Senior

Python

Java

envelio
Köln, Germany

Remote

Senior

Python

JavaScript

+1

Demystifying AI by exploring human language processing

07:57 MIN

Demystifying AI by exploring human language processing

WeAreDevelopers LIVE – PHP Is Alive and Kicking and More

Using LLMs for advanced codebase search and understanding

03:01 MIN

Using LLMs for advanced codebase search and understanding

The Alpha‑Developer of Tomorrow: Building the Future of the Software Development Lifecycle

Understanding the role of embeddings and vector databases

02:02 MIN

Understanding the role of embeddings and vector databases

Best practices: Building Enterprise Applications that leverage GenAI

How modern NLP uses Transformer models for search

04:52 MIN

How modern NLP uses Transformer models for search

Hybrid AI: Next Generation Natural Language Processing

Using LLMs to discover datasets and manage metadata

03:02 MIN

Using LLMs to discover datasets and manage metadata

How E.On productionizes its AI model & Implementation of Secure Generative AI.

Using large language models as a learning tool

03:42 MIN

Using large language models as a learning tool

Google Gemini: Open Source and Deep Thinking Models - Sam Witteveen

Moving beyond hype with real-world generative AI

01:06 MIN

Moving beyond hype with real-world generative AI

Semantic AI: Why Embeddings Might Matter More Than LLMs

Exploring practical NLP applications at Slido

08:57 MIN

Exploring practical NLP applications at Slido

Serverless deployment of (large) NLP models

Featured Partners

A beginner’s guide to modern natural language processing

A beginner’s guide to modern natural language processing

Jodie Burchell

about 2 years ago • WeAreDevelopers LIVE

WeAreDevelopers LIVE - Vector Similarity Search Patterns for Efficiency and more

WeAreDevelopers LIVE - Vector Similarity Search Patterns for Efficiency and more

Chris Heilmann, Daniel Cranney, Raphael De Lio & Developer Advocate at Redis

about 6 months ago • WeAreDevelopers LIVE

Develop AI-powered Applications with OpenAI Embeddings and Azure Search

Develop AI-powered Applications with OpenAI Embeddings and Azure Search

Rainer Stropek

about 2 years ago • WeAreDevelopers LIVE

Semantic AI: Why Embeddings Might Matter More Than LLMs

Semantic AI: Why Embeddings Might Matter More Than LLMs

Christian Weyer

about 6 months ago • World Congress 2025

Vision for Websites: Training Your Frontend to See

Vision for Websites: Training Your Frontend to See

Daniel Madalitso Phiri

about 2 years ago • WeAreDevelopers LIVE

Enter the Brave New World of GenAI with Vector Search

Enter the Brave New World of GenAI with Vector Search

Mary Grygleski

about 2 years ago • WeAreDevelopers LIVE

Creating Industry ready solutions with LLM Models

Creating Industry ready solutions with LLM Models

Vijay Krishan Gupta & Gauravdeep Singh Lotey

about 2 years ago • WeAreDevelopers LIVE

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon

about 11 months ago • WeAreDevelopers LIVE

Related Articles

View all articles

DC

Daniel Cranney

Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2

This week, we’re continuing our look-back on some of the best moments from the Weekly Developer Show from 2025. Here’s what some of our fantastic guests had to say… Sebastian Gingter cracked open the idea of “slopsquatting” and explained why we shou...

Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2

CH

Chris Heilmann

SEO in an AI world - Google vs. ChatGPT and survival tips for content creators

In the ever-evolving world of technology, the landscape of search engines and AI tools is shifting at an unprecedented pace. This transformational journey is being shaped by the rising influence of AI-powered tools like ChatGPT, which are increasingl...

SEO in an AI world - Google vs. ChatGPT and survival tips for content creators

CH

Chris Heilmann

With AIs wide open - WeAreDevelopers at All Things Open 2025

Last week our VP of Developer Relations, Chris Heilmann, flew to Raleigh, North Carolina to present at All Things Open . An excellent event he had spoken at a few times in the past and this being the “Lucky 13” edition, he didn’t hesitate to come and...

With AIs wide open - WeAreDevelopers at All Things Open 2025

EM

Eli McGarvie

13 AI Tools You Have to Try

First, it was NFTs, then it was Web3, and now it’s generative AI… it’s probably time to stop collecting pictures of monkeys and kitties. Chatbots and generative AI are the next big thing. This time we’ve jumped on a trend that has real-world applicat...

13 AI Tools You Have to Try

From learning to earning

Jobs that call for the skills explored in this talk.

AI/ Machine Learning Engineer (NLP / LLM)

Ai-powered
Peterborough, United Kingdom

Remote

Senior

Machine Learning

Natural Language Processing

Elasticsearch - Principal Software Engineer II - Search Internals, Lucene

Referral Board
Charing Cross, United Kingdom

Java

Solr

Elasticsearch

Continuous Integration

Software Engineer - Large Language Models

Fastino Labs
Guildford, United Kingdom

Remote

£65-100K

Senior

Docker

PyTorch

TensorFlow

+3

Senior AI Platform Backend Engineer (LLM)

IT Partner España

Remote

API

NLTK

Azure

Scrum

+13

AI Engineer Bootcamp Instructor (ML, DL, MLOps & LLM Systems) - Onsite&Remote

WeCloudData

Remote

Python

Machine Learning

Continuous Integration

AI Engineer - Laravel & Python

Follo
Ghent, Belgium

Remote

€36-48K

API

Python

Laravel

+2

Software Engineer | Language Products | Full-Stack

DeepL
Charing Cross, United Kingdom

Remote

API

React

Python

.NET Core

Software Engineer | Language Products | Full-Stack

DeepL
Charing Cross, United Kingdom

Remote

API

React

Python

TypeScript

+1

Full Stack Developer (AI Infrastructure

Nebius
Amsterdam, Netherlands

API

GIT

JSON

REST

Azure

+3