Markus Harrer
Data Science on Software Data
#1about 4 minutes
The challenge of justifying legacy system improvements
Technical debt in legacy systems is difficult to communicate to management because its impact is less visible than new features or bugs.
#2about 4 minutes
The promise and failure of universal software quality metrics
Early software analytics aimed to create universal quality dashboards but failed because metrics and models are not transferable between unique projects.
#3about 5 minutes
Adopting analytics approaches for project-specific questions
Instead of reusing non-transferable results, teams can adapt the methodologies and tools from software analytics to answer their own unique, high-impact questions.
#4about 5 minutes
Using data science as a foundation for software analytics
Reproducible data science provides the necessary methodologies and tools for open and automated analysis, leveraging skills developers already possess.
#5about 6 minutes
Exploring software data types and practical analysis use cases
Analyzing static, runtime, chronological, and community data can reveal code ownership gaps, performance bottlenecks, and opportunities for modularization.
#6about 13 minutes
Analyzing code coverage with Python, pandas, and Jupyter
A live coding demo shows how to use Python, pandas, and Jupyter notebooks to analyze production code coverage data and visualize unused code packages.
#7about 3 minutes
An introduction to graph analytics for software systems
Graph analytics with tools like jQAssistant and Neo4j helps visualize and query interconnected software data like class dependencies and method calls.
#8about 1 minute
Key principles for effective software data analysis
Successful software data analysis requires focusing on solving specific problems, working openly, automating processes, and deriving actionable next steps.
#9about 8 minutes
Q&A on production code analysis and performance bottlenecks
The speaker answers questions about analyzing production codebases, sharing examples of identifying performance bottlenecks and justifying technology choices with data.
Related jobs
Jobs that call for the skills explored in this talk.
WALTER GROUP
Wiener Neudorf, Austria
Intermediate
Senior
Python
Data Vizualization
+1
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Matching moments
01:54 MIN
The growing importance of data and technology in HR
From Data Keeper to Culture Shaper: The Evolution of HR Across Growth Stages
02:49 MIN
Using AI to overcome challenges in systems programming
AI in the Open and in Browsers - Tarek Ziadé
03:34 MIN
The business case for sustainable high performance
Sustainable High Performance: Build It or Pay the Price
03:58 MIN
Making accessibility tooling actionable and encouraging
Developer Time Is Valuable - Use the Right Tools - Kilian Valkhof
03:07 MIN
Final advice for developers adapting to AI
WeAreDevelopers LIVE – AI, Freelancing, Keeping Up with Tech and More
03:16 MIN
Improving the developer feedback loop with specialized tools
Developer Time Is Valuable - Use the Right Tools - Kilian Valkhof
04:09 MIN
How Python became the dominant language for AI
AI in the Open and in Browsers - Tarek Ziadé
02:33 MIN
Why you might not need JavaScript for everything
WeAreDevelopers LIVE – You Don’t Need JavaScript, Modern CSS and More
Featured Partners
Related Videos
Getting to Know Your Legacy (System) with AI-Driven Software Archeology
Markus Harrer
Modern Data Architectures need Software Engineering
Matthias Niehoff
Grappling With Clunky Old Software? Start by Understanding What’s Inside!
Luc Perard
New AI-Centric SDLC: Rethinking Software Development with Knowledge Graphs
Gregor Schumacher, Sujay Joshy & Marcel Gocke
From Monolith Tinkering to Modern Software Development
Lars Gentsch
The Road to One Billion Developers
Thomas Dohmke & Demetris Cheatham
The Clean as You Code Imperative
Olivier Gaudin
How we will build the software of tomorrow
Thomas Dohmke
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Smart Future Campus GmbH
Jena, Germany
ETL
JSON
Azure
NoSQL
Scrum
+1

Datamics Gmbh
€52K
API
Python
Microservices
Continuous Integration

Smart Future Campus GmbH
Offenbach am Main, Germany
ETL
JSON
Azure
NoSQL
Scrum
+1

Smart Future Campus GmbH
Kaiserslautern, Germany
ETL
JSON
Azure
NoSQL
Scrum
+1

Smart Future Campus GmbH
Rostock, Germany
ETL
JSON
Azure
NoSQL
Scrum
+1

Smart Future Campus GmbH
Ludwigshafen am Rhein, Germany
ETL
JSON
Azure
NoSQL
Scrum
+1

Smart Future Campus GmbH
Goslar, Germany
ETL
JSON
Azure
NoSQL
Scrum
+1

Smart Future Campus GmbH
Hamburg, Germany
ETL
JSON
Azure
NoSQL
Scrum
+1

Smart Future Campus GmbH
Plauen, Germany
ETL
JSON
Azure
NoSQL
Scrum
+1