Sergio Perez & Harshita Seth

Aug 20, 2025 • World Congress 2025

Adding knowledge to open-source LLMs

When is retrieval-augmented generation not enough? Learn the multi-stage process for deeply embedding new knowledge into an open-source LLM.

#1about 4 minutes

Understanding the LLM training pipeline and knowledge gaps

LLMs are trained through pre-training and alignment, but require new knowledge to stay current, adapt to specific domains, and acquire new skills.

#2about 5 minutes

Adding domain knowledge with continued pre-training

Continued pre-training adapts a foundation model to a specific domain by training it further on specialized, unlabeled data using self-supervised learning.

#3about 6 minutes

Developing skills and reasoning with supervised fine-tuning

Supervised fine-tuning uses instruction-based datasets to teach models specific tasks, chat capabilities, and complex reasoning through techniques like chain of thought.

#4about 8 minutes

Aligning models with human preferences using reinforcement learning

Preference alignment refines model behavior using reinforcement learning, evolving from complex RLHF with reward models to simpler methods like DPO.

#5about 2 minutes

Using frameworks like NeMo RL to simplify model alignment

Frameworks like the open-source NeMo RL abstract away the complexity of implementing advanced alignment algorithms like reinforcement learning.

Picnic Technologies B.V.
Amsterdam, Netherlands

Intermediate

Senior

Python

Structured Query Language (SQL)

+1

Wilken GmbH
Ulm, Germany

Senior

Kubernetes

AI Frameworks

+3

ROSEN Technology and Research Center GmbH
Osnabrück, Germany

Senior

TypeScript

React

+3

The hardware requirements for running LLMs locally

03:55 MIN

The hardware requirements for running LLMs locally

AI in the Open and in Browsers - Tarek Ziadé

The evolving role of the machine learning engineer

02:20 MIN

The evolving role of the machine learning engineer

AI in the Open and in Browsers - Tarek Ziadé

Why specialized models outperform generalist LLMs

05:09 MIN

Why specialized models outperform generalist LLMs

AI in the Open and in Browsers - Tarek Ziadé

Unlocking LLM potential with creative prompting techniques

04:59 MIN

Unlocking LLM potential with creative prompting techniques

WeAreDevelopers LIVE – Frontend Inspirations, Web Standards and more

Building an open source community around AI models

04:28 MIN

Building an open source community around AI models

AI in the Open and in Browsers - Tarek Ziadé

Building and iterating on an LLM-powered product

05:03 MIN

Building and iterating on an LLM-powered product

Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2

Prompt injection as an unsolved AI security problem

07:39 MIN

Prompt injection as an unsolved AI security problem

AI in the Open and in Browsers - Tarek Ziadé

Exploring the role and ethics of AI in gaming

14:06 MIN

Exploring the role and ethics of AI in gaming

Devs vs. Marketers, COBOL and Copilot, Make Live Coding Easy and more - The Best of LIVE 2025 - Part 3

Featured Partners

Inside the Mind of an LLM

Inside the Mind of an LLM

Emanuele Fabbiani

about 4 months ago • World Congress 2025

Unlocking the Power of AI: Accessible Language Model Tuning for All

Unlocking the Power of AI: Accessible Language Model Tuning for All

Cedric Clyburn & Legare Kerrison

about a year ago • World Congress 2024

LLMOps-driven fine-tuning, evaluation, and inference with NVIDIA NIM & NeMo Microservices

LLMOps-driven fine-tuning, evaluation, and inference with NVIDIA NIM & NeMo Microservices

Anshul Jindal

about 4 months ago • World Congress 2025

Self-Hosted LLMs: From Zero to Inference

Self-Hosted LLMs: From Zero to Inference

Roberto Carratalá & Cedric Clyburn

about 4 months ago • World Congress 2025

Exploring LLMs across clouds

Exploring LLMs across clouds

Tomislav Tipurić

about 4 months ago • World Congress 2025

Give Your LLMs a Left Brain

Give Your LLMs a Left Brain

Stephen Chin

about a year ago • World Congress 2024

Large Language Models ❤️ Knowledge Graphs

Large Language Models ❤️ Knowledge Graphs

Michael Hunger

about a year ago • World Congress 2024

Three years of putting LLMs into Software - Lessons learned

Three years of putting LLMs into Software - Lessons learned

Simon A.T. Jiménez

about 4 months ago • World Congress 2025

Related Articles

View all articles

LM

Luis Minvielle

What Are Large Language Models?

Developers and writers can finally agree on one thing: Large Language Models, the subset of AIs that drive ChatGPT and its competitors, are stunning tech creations. Developers enjoying the likes of GitHub Copilot know the feeling: this new kind of te...

What Are Large Language Models?

BB

Benedikt Bischof

MLops – Deploying, Maintaining And Evolving Machine Learning Models in Production

Welcome to this issue of the WeAreDevelopers Live Talk series. This article recaps an interesting talk by Bas Geerdink who gave advice on MLOps.‍About the speaker:‍Bas is a programmer, scientist, and IT manager. At ING, he is responsible for the Fast...

MLops – Deploying, Maintaining And Evolving Machine Learning Models in Production

KD

Krissy Davis

The Best Large Language Models on The Market

Large language models are sophisticated programs that enable machines to comprehend and generate human-like text. They have been the foundation of natural language processing for almost a decade. Although generative AI has only recently gained popula...

The Best Large Language Models on The Market

DC

Daniel Cranney

Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2

This week, we’re continuing our look-back on some of the best moments from the Weekly Developer Show from 2025. Here’s what some of our fantastic guests had to say… Sebastian Gingter cracked open the idea of “slopsquatting” and explained why we shou...

Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2

From learning to earning

Jobs that call for the skills explored in this talk.

AI Systems and MLOps Engineer for Earth Observation

Forschungszentrum Jülich GmbH
Jülich, Germany

Intermediate

Senior

Linux

Docker

AI Frameworks

Machine Learning

AI/ML Engineer Specializing in Large Language Models (Llms)

Xablu
Hengelo, Netherlands

Intermediate

.NET

Python

PyTorch

Blockchain

TensorFlow

+3

Machine Learning Engineer for Language Technologies

Barcelona Supercomputing Center
Barcelona, Spain

Intermediate

Python

PyTorch

Machine Learning

Hybrid Deep Learning Engineer for LLMs & AI

European Tech Recruit
Barcelona, Spain

Intermediate

Machine Learning Engineer - (LLM/Large Language Models) - Zaragoza

European Tech Recruit
Municipality of Zaragoza, Spain

Junior

Python

Docker

PyTorch

Computer Vision

Machine Learning

+1

Senior Principal Machine Learning Engineer, Foundational Models

Merantix
Amer, Spain

Senior

Azure

Spark

Python

PyTorch

Kubernetes

+2

Senior AI/ML Engineer - Generative AI & LLM Solutions

TMC
Utrecht, Netherlands

Senior

API

Azure

Python

Docker

FastAPI

+1

Senior Generative AI Engineer - MLOps & Cloud

Barone, Budge & Dominick (Pty) Ltd
Amsterdam, Netherlands

Senior

Python

Machine Learning

Senior AI/ML Engineer - NLP / LLM / GenAI

cinemo GmbH
Karlsruhe, Germany

Senior

C++

Linux

Python

PyTorch

Machine Learning

+2