Sam Witteveen
Google Gemini: Open Source and Deep Thinking Models - Sam Witteveen
#1about 3 minutes
Navigating the current AI hype and research secrecy
The term "AI" is often used for marketing, while the competitive landscape has made research labs more secretive about their work.
#2about 3 minutes
Understanding Google's open weights Gemma models
Gemma models are "open weights," not fully open source, meaning developers can use the model weights for commercial projects but don't have the training data or code.
#3about 2 minutes
The training process of large language models
LLMs are trained in stages, starting with pre-training to predict the next token, followed by post-training and reinforcement learning to align with human instructions.
#4about 2 minutes
Comparing proprietary Gemini and open Gemma models
Gemini models are Google's proprietary, cloud-based offerings, while Gemma models are open-weight versions that can be run on-premise or on-device.
#5about 3 minutes
How AI models are becoming smaller and more efficient
Models are made smaller and more powerful through training on vast amounts of data and using techniques like distillation to transfer knowledge from larger models.
#6about 7 minutes
The rise of multilingual and multimodal AI
Modern models like Gemma 3 are trained on over 140 languages, and multimodal models like Gemini can process text, audio, video, and images simultaneously.
#7about 4 minutes
Improving text in AI-generated images and videos
Recent models are finally getting better at rendering accurate text in images, while video generation models can create complex, realistic scenes from prompts.
#8about 3 minutes
AI tools for advanced content creation and editing
AI-powered tools can now edit video by manipulating the transcript and even insert new words in a cloned voice, simplifying the post-production process.
#9about 4 minutes
Using large language models as a learning tool
LLMs serve as powerful educational tools by explaining complex concepts at different levels, acting as a personal tutor for learning new technical skills.
#10about 6 minutes
How deep reasoning models 'think' before answering
Reasoning models improve accuracy by generating an internal monologue to break down a problem and explore possibilities before providing a final answer.
#11about 10 minutes
The evolution of AI-powered coding assistants
AI coding tools are evolving into agents that can write code, run tests, and fix errors, shifting the developer's role toward architectural and conversational guidance.
#12about 2 minutes
How to get started with Google's Gemma models
Developers can start experimenting with Gemma models through Google's AI Studio, Hugging Face, or by running them locally with tools like Ollama and LM Studio.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
01:31 MIN
Understanding the fundamentals of Chrome AI
Generate AI in the Browser with Chrome AI - Raymond Camden
03:37 MIN
Understanding Google Gemini models and capabilities
Exploring Google Gemini and Generative AI
05:49 MIN
Understanding the roles of Gemini and Gemma models
Google Gemma and Open Source AI Models - Clement Farabet
30:13 MIN
Using open source Gemma for local AI processing
What’s New with Google Gemini?
18:41 MIN
Building agents with Google Gemini and open source tools
Beyond Chatbots: How to build Agentic AI systems
31:13 MIN
Running on-device AI in the browser with Gemini Nano
Exploring Google Gemini and Generative AI
02:50 MIN
Understanding the hackathon's open tech stack and rules
Coffee With Developers Michael Koitz
00:12 MIN
The history and merger of Google's AI teams
What’s New with Google Gemini?
Featured Partners
Related Videos
What’s New with Google Gemini?
Logan Kilpatrick
Google Gemma and Open Source AI Models - Clement Farabet
Exploring Google Gemini and Generative AI
Developer Productivity Using AI Tools and Services - Ryan J Salva
Ryan J Salva
Exploring the Future of Web AI with Google
Thomas Steiner
How to Avoid LLM Pitfalls - Mete Atamel and Guillaume Laforge
Meta Atamel & Guillaume Laforge
Coffee with Developers - Maria Apazoglou
Maria Apazoglou
Generate AI in the Browser with Chrome AI - Raymond Camden
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

AI Systems and MLOps Engineer for Earth Observation
Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning

Research Engineer, Project Astra, GenAi, Google DeepMindNew
Google
Charing Cross, United Kingdom
Python

Gemini Adoption Solution Engineer, Higher Education (Multiple Languages)
Google Netherlands B.V.
Amsterdam, Netherlands
Senior
API
XML
HTML
Machine Learning

Google Cloud AI Architect
Devoteam
Valdilecha, Spain
Keras
Python
TensorFlow
Google BigQuery
Machine Learning
+1

Gemini Adoption Solution Engineer, Higher Education
Google Netherlands B.V.
Amsterdam, Netherlands
Senior
API
XML
HTML
Machine Learning



