Soroosh Khodami
Why and when should we consider Stream Processing frameworks in our solutions
#1about 2 minutes
Differentiating stream processing from event processing
Stream processing focuses on transforming continuous data streams, whereas event processing is about making decisions and triggering actions based on individual messages.
#2about 2 minutes
Handling out-of-order data with event time
Stream processing frameworks can reorder messages based on when the event actually occurred (event time) rather than when it was received (processing time).
#3about 2 minutes
Understanding message delivery guarantees
Frameworks provide mechanisms for exactly-once processing, which prevents duplicate message processing and is critical for financial systems.
#4about 3 minutes
Building data pipelines with sources and operators
Data pipelines are constructed by chaining operators that read from a source, apply transformations like filtering or joining, and write to a sink.
#5about 5 minutes
Using windowing to process continuous data streams
Windowing groups unbounded data into finite chunks for processing, with types like tumbling, sliding, and session windows serving different analytical needs.
#6about 1 minute
Joining data from multiple real-time streams
You can combine data from multiple streams using familiar concepts like inner joins and cross joins to create enriched data outputs.
#7about 2 minutes
Implementing complex logic with stateful processing
Stateful processing allows operators to store and retrieve data in memory, enabling complex logic like tracking user behavior or detecting fraud patterns over time.
#8about 1 minute
Overview of popular stream processing frameworks
Key frameworks for stream processing include Apache Flink, Apache Beam, Spark Streaming, and Kafka Streams, with cloud platforms offering managed services.
#9about 4 minutes
Comparing Spring Boot vs Apache Beam performance
A practical benchmark shows that while Apache Beam offers higher throughput, a standard Spring Boot and Redis setup can be sufficient and more cost-effective for many use cases.
#10about 3 minutes
Weighing the benefits and significant drawbacks
While powerful, stream processing frameworks are complex to learn, difficult to maintain and debug, and have a steep learning curve for development teams.
#11about 1 minute
Real-world use cases for stream processing
Stream processing is heavily used in industries like gaming for anti-cheat systems, IoT for real-time traffic data, and finance for fraud detection.
#12about 2 minutes
Learning resources and communicating with stakeholders
Before adopting these complex frameworks, it is crucial to manage stakeholder expectations about the high cost and difficulty of implementing and changing data pipelines.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
05:20 MIN
A traditional approach to streaming with Kafka and Debezium
Python-Based Data Streaming Pipelines Within Minutes
09:43 MIN
Exploring the operational complexity of Kafka and Flink
Python-Based Data Streaming Pipelines Within Minutes
02:05 MIN
Understanding the challenges of adopting real-time data streaming
Python-Based Data Streaming Pipelines Within Minutes
21:18 MIN
Why modern applications adopt event streaming
Event Messaging and Streaming with Apache Pulsar
27:46 MIN
Key takeaways for modern data processing
Convert batch code into streaming with Python
10:34 MIN
Decoupling microservices with event streams
From event streaming to event sourcing 101
05:39 MIN
Key use cases for Python streaming frameworks
Convert batch code into streaming with Python
34:35 MIN
Achieving massive throughput with sharded architectures
The Rise of Reactive Microservices
Featured Partners
Related Videos
Python-Based Data Streaming Pipelines Within Minutes
Bobur Umurzokov
Convert batch code into streaming with Python
Bobur Umurzokov
Kafka Streams Microservices
Denis Washington & Olli Salonen
Event Messaging and Streaming with Apache Pulsar
Mary Grygleski
In-Memory Computing - The Big Picture
Markus Kett
Building the platform for providing ML predictions based on real-time player activity
Artem Volk & Fabian Zillgens
Practical Change Data Streaming Use Cases With Debezium And Quarkus
Alex Soto
From event streaming to event sourcing 101
Gerard Klijs
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.
![Senior Software Engineer [TypeScript] (Prisma Postgres)](https://wearedevelopers.imgix.net/company/283ba9dbbab3649de02b9b49e6284fd9/cover/oKWz2s90Z218LE8pFthP.png?w=400&ar=3.55&fit=crop&crop=entropy&auto=compress,format)
Senior Software Engineer [TypeScript] (Prisma Postgres)
Prisma
Remote
Senior
Node.js
TypeScript
PostgreSQL

Data Engineer - Focus Apache Flink
ADMIRAL Technologies
Gumpoldskirchen, Austria
Remote
€55K
ETL
Java
Linux
+5

Technology Architect - Apache Kafka, Confluent - Germany
Infosys Limited
Frankfurt am Main, Germany
API
Java
Azure
Kafka
Python
+5

Data Streaming Engineer (Java y Kafka) (España)
Page Personnel
Municipality of Boadilla del Monte, Spain
Remote
€45-50K
Intermediate
Java
Kafka
Python
+1

Senior Software Engineer
Ververica | Original creators of Apache Flink®
Municipality of Madrid, Spain
API
Java

System Engineer, Messaging and Streaming Team
Amazon.com, Inc
Reading, United Kingdom
£63K
DNS
Bash
Perl
Ruby
+4

Data Engineer Python Spark SQL - Fintech
Client Server
Newcastle upon Tyne, United Kingdom
Remote
€90-120K
Hive
Azure
Spark
+3

{"@context":"https://schema.org/","@type":"JobPosting","title":"Fullstack Developer
Beam Inc.
Bristol, United Kingdom
Remote
Hive
NoSQL
Spark
Figma
+7

DevOps Engineer (Kafka - AWS)
hiberus
Municipality of Santander, Spain
Remote
Senior
Scrum
Kafka
DevOps
Ansible
+6