Sergio Perez & Harshita Seth
Adding knowledge to open-source LLMs
#1about 4 minutes
Understanding the LLM training pipeline and knowledge gaps
LLMs are trained through pre-training and alignment, but require new knowledge to stay current, adapt to specific domains, and acquire new skills.
#2about 5 minutes
Adding domain knowledge with continued pre-training
Continued pre-training adapts a foundation model to a specific domain by training it further on specialized, unlabeled data using self-supervised learning.
#3about 6 minutes
Developing skills and reasoning with supervised fine-tuning
Supervised fine-tuning uses instruction-based datasets to teach models specific tasks, chat capabilities, and complex reasoning through techniques like chain of thought.
#4about 8 minutes
Aligning models with human preferences using reinforcement learning
Preference alignment refines model behavior using reinforcement learning, evolving from complex RLHF with reward models to simpler methods like DPO.
#5about 2 minutes
Using frameworks like NeMo RL to simplify model alignment
Frameworks like the open-source NeMo RL abstract away the complexity of implementing advanced alignment algorithms like reinforcement learning.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
02:07 MIN
How LLMs generate text and learn behavior
You are not my model anymore - understanding LLM model behavior
00:53 MIN
Understanding LLMs, context windows, and RAG
Beyond Prompting: Building Scalable AI with Multi-Agent Systems and MCP
02:12 MIN
Understanding the fundamentals of large language models
Building Blocks of RAG: From Understanding to Implementation
00:57 MIN
Why large language models need retrieval augmented generation
Build RAG from Scratch
01:31 MIN
Understanding the core capabilities of large language models
Data Privacy in LLMs: Challenges and Best Practices
00:02 MIN
Introducing InstructLab for accessible LLM fine-tuning
Unlocking the Power of AI: Accessible Language Model Tuning for All
00:04 MIN
The evolution of NLP from early models to modern LLMs
Harry Potter and the Elastic Semantic Search
23:35 MIN
Defining key GenAI concepts like GPT and LLMs
Enter the Brave New World of GenAI with Vector Search
Featured Partners
Related Videos
Inside the Mind of an LLM
Emanuele Fabbiani
Unlocking the Power of AI: Accessible Language Model Tuning for All
Cedric Clyburn & Legare Kerrison
LLMOps-driven fine-tuning, evaluation, and inference with NVIDIA NIM & NeMo Microservices
Anshul Jindal
Self-Hosted LLMs: From Zero to Inference
Roberto Carratalá & Cedric Clyburn
Exploring LLMs across clouds
Tomislav Tipurić
Give Your LLMs a Left Brain
Stephen Chin
Large Language Models ❤️ Knowledge Graphs
Michael Hunger
Three years of putting LLMs into Software - Lessons learned
Simon A.T. Jiménez
Related Articles
View all articles.png?w=240&auto=compress,format)
.gif?w=240&auto=compress,format)
.png?w=240&auto=compress,format)

From learning to earning
Jobs that call for the skills explored in this talk.

AI Systems and MLOps Engineer for Earth Observation
Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning

Machine Learning Engineer - Large Language Models (LLM) - Startup
Startup
Charing Cross, United Kingdom
PyTorch
Machine Learning

Deep Learning Engineer for Language Technologies (RE2)
Barcelona Supercomputing Center
Barcelona, Spain
Intermediate
Python
PyTorch
Machine Learning

Deep Learning Engineer For Language Technologies (Re3)
Barcelona Supercomputing Center
Barcelona, Spain
Docker
Ansible
Continuous Integration

Manager of Machine Learning (LLM/NLP/Generative AI) - Visas Supported
European Tech Recruit
Municipality of Bilbao, Spain
Junior
GIT
Python
Docker
Computer Vision
Machine Learning
+2

Machine Learning Engineer (LLM)
European Tech Recruit
Municipality of Madrid, Spain
Intermediate
Python
PyTorch
Computer Vision
Machine Learning

Agentic AI Architect - Python, LLMs & NLP
FRG Technology Consulting
Intermediate
Azure
Python
Machine Learning

AI Evaluation Data Scientist - AI/ML/LLM - (Hybrid) - Barcelona
European Tech Recruit
Barcelona, Spain
Intermediate
GIT
Python
Pandas
Docker
PyTorch
+2

Machine Learning Engineer (MLE) - LLMs / GenAI Madrid · Híbrido
Tecdata
Municipality of Madrid, Spain
Remote
Intermediate
GIT
DevOps
Python
Jenkins
+5