Sandra Ahlgrimm & Kevin Lewis

Bringing AI Model Testing and Prompt Management to Your Codebase with GitHub Models

Is your AI development just 'vibes-based'? Learn how to run automated prompt evaluations as a blocking check on every pull request.

Bringing AI Model Testing and Prompt Management to Your Codebase with GitHub Models
#1about 3 minutes

The challenge of testing non-deterministic AI features

Traditional development relies on rigorous testing, but AI features are often implemented based on intuition without a structured evaluation process.

#2about 5 minutes

Managing prompts as code with GitHub Models

GitHub Models integrates AI development into your repository by defining prompts, models, and parameters in a version-controlled YAML file.

#3about 6 minutes

Using evaluators to compare AI model variants

The platform allows you to run multiple prompt and model variations against a test dataset to compare outputs on metrics like latency, coherence, and similarity.

#4about 5 minutes

Consuming prompt files in your application code

Use the GitHub Models inference API or the Azure AI Inference SDK to load your version-controlled prompt files and integrate AI calls directly into your application.

#5about 2 minutes

Local development and testing with the CLI

The GitHub CLI extension allows you to run prompts and execute model evaluations directly from your terminal for rapid, local iteration before committing changes.

#6about 4 minutes

Automating repository tasks with AI-powered actions

Use GitHub Actions to automate common repository tasks like generating changelogs from pull requests, triaging bug reports, or creating weekly issue summaries.

#7about 1 minute

Implementing CI/CD for AI prompt changes

Integrate prompt evaluations into your CI/CD pipeline using GitHub Actions to automatically run tests and block pull requests that degrade model performance.

#8about 2 minutes

Adopting GitHub Models in existing projects

You can quickly convert existing prompt files to the GitHub Models format to gain access to powerful evaluation, comparison, and automation capabilities.

Related jobs
Jobs that call for the skills explored in this talk.

Featured Partners

Related Articles

View all articles
CH
Chris Heilmann
With AIs wide open - WeAreDevelopers at All Things Open 2025
Last week our VP of Developer Relations, Chris Heilmann, flew to Raleigh, North Carolina to present at All Things Open . An excellent event he had spoken at a few times in the past and this being the “Lucky 13” edition, he didn’t hesitate to come and...
With AIs wide open - WeAreDevelopers at All Things Open 2025
CH
Chris Heilmann
Exploring AI: Opportunities and Risks for Developers
In today's rapidly evolving tech landscape, the integration of Artificial Intelligence (AI) in development presents both exciting opportunities and notable risks. This dynamic was the focus of a recent panel discussion featuring industry experts Kent...
Exploring AI: Opportunities and Risks for Developers

From learning to earning

Jobs that call for the skills explored in this talk.

AI Prompt Engineer

AI Prompt Engineer

SonarSource
Bochum, Germany

Remote
API
Python
Data analysis
Machine Learning
+2