SlideShare a Scribd company logo
Why Serverless Flink Matters -
Blazing Fast Stream Processing
Made Scalable
1
Mayank Juneja, Confluent
Jean-Sébastien Brunner, Confluent
Agenda
2
1. Stream Processing
a. Overview and Challenges
2. Why Flink?
a. Why is it so popular?
b. Challenges of self managed Flink
3. Serverless Flink + Demo
Real-time
Data
A Sale
A Shipment
A Trade
Rich Front-End
Customer Experiences
A Customer
Experience
Real-Time Backend
Operations
Stream processing is computing over unbounded
streams of data
Stream
Processor
Stream processing use cases
4
Data Exploration Data Pipelines Real-time Apps
Engineers and Analysts both
need to be able to simply read
and understand the event
streams stored in Kafka
● Metadata discovery
● Throughput analysis
● Data sampling
● Interactive query
Data pipelines are used to
enrich, curate, and transform
events streams, creating new
derived event streams
● Filtering
● Joins
● Projections
● Aggregations
● Flattening
● Enrichment
Whole ecosystems of apps
feed on event streams
automating action in real-time
● Threat detection
● Quality of Service
● Fraud detection
● Intelligent routing
● Alerting
Challenges with Stream Processing
Ordering and
Timing
State
Management
Fault
Tolerance
Scalability
How do you
handle
out-of-order and
late events?
How do you scale
for unexpected
large throughput?
Do you need
exactly-once
semantics?
How do you
manage state in
a distributed
environment?
Why Flink?
What’s great about Apache Flink?
Scalability Language Flexibility Unified Processing
Flink is capable of supporting
stream processing workloads
at hyper scale, as evidenced by
its broad adoption by leading
digital native companies
Flink supports Java, Python, &
SQL without making major
tradeoffs in functionality,
enabling developers to work in
their language of choice
Flink supports stream
processing and batch
processing through one
technology, rather than
needing separate tools
Flink is a top 5 Apache project and is leveraged as the stream processing engine for >25% of Kafka users
Stream Processing with Flink
Ordering
and Timing
State
Management
Fault
Tolerance
Scalability &
Performance
● Event time
processing
● Watermarks
● Elastic scale out
● Network Traffic
Optimization
● Backpressure
Handling
Challenges
Flink
Features
● Local and
in-memory
states for all
computations
● Exactly once
semantics
● Distributed
snapshots /
checkpoints
So is Flink the perfect stream
processor?
Self Managed Flink comes with its own challenges
Configuration
and Setup
Monitoring Cost
Management
Security
- Resource allocation: Provisioning
resources (CPU, memory, storage)
for each Flink job can be a
complicated task
- Dependency management
- Connectors, databases
- Configuration
- Standalone vs k8s vs YARN,
Application mode vs Session
mode
Challenge #1 - Configuration and Setup
Challenge #2 - Monitoring and Maintenance
- Metrics:
- Filtering down to the most
relevant metrics for your
application can be
overwhelming
- Version Upgrades
- Upgrading Flink versions esp
when ensuring backward
compatibility is a pain
- Disaster recovery
- Needs regular backups,
checkpointing, savepoints
Flink downloads, Mar 2023
Challenge #3: Total Cost of Ownership
- Hardware costs: Significant
investments required for
managing hardware costs - can
be underutilized
- Expertise: Hiring of skilled
professionals who can set up,
manage and maintain Flink
- Opportunity Cost: Less time
spent on developing core
product or service
Challenge #4 - Security
- RBAC: Flink lacks built-in
capabilities for granular role based
access control
- Encryption: Data encryption at
rest for Flink state backends does
not come out of the box
- Multi-tenancy: Insufficient
capabilities to support multi
tenancy within the same cluster
Flink Serverless
1
6
Powerful SQL Streaming Operators
Time windows Pattern Matching Streaming Joins
● Time-based windows
● Event-density windows
● Event-based windows: every single
event can trigger a new window
● Complex Event
Processing
● Stream-to-stream joins
● Temporal joins
● Lookup joins
● Versioned joins
etc.
Solution - Serverless Flink
- Evergreen runtime: once you submit a job it can run 24.7 and you don't
need to take care of any upgrade (security patch, Flink, etc.), it just runs.
- Elastic autoscaling of the compute pools:
- Elastic scale up of the pool, with a user-defined maximum
- Elastic scale down of the pool, with scale-to-zero when nothing runs
- Usage based billing
- Separation of compute (Flink) and storage (Kafka)
- Scale independently to get best best cost and best performance
- Optimization of communications for even increased
cost/performance
Autoscale and monitoring at the job level
● Per task dynamic scaling
○ Rescale based on backpressure and utilization of the vertices, not only
based on CPU or infrastructure-level metrics
○ Take into account the throughput from the source
● Job level metrics and monitoring
●
Flink and Kafka closer together
High
bandwidth
Low
bandwidth
Flink and Kafka are closer
together, allowing to reduce:
● Latency
● Network cost
With Fetch-from-follower the
optimization can be done at the
Availability Zone level.
Confluent Cloud
High
bandwidth
Flink Serverless ++
Apache Flink in Confluent Cloud
2
1
Serverless Flink SQL
Rich Experience
Complete and Secure
● ANSI-SQL with powerful streaming operators
● Rich CLI Experience
● SQL Editor with "workspaces"
● Integration with Schema Registry and
Governance
● Support for user-authentication and Service
account
+
+
Support various use-cases and Personas
Developers Data Analysts Data Engineers
Languages Java & SQL ANSI SQL SQL & Python
Tools
Use Cases Streaming Apps Data Exploration Data Pipelines
IDE & SQL CLI Notebooks
UI / BI / JDBC
Thanks!
Q&A
Feedback/comments/questions: flink-preview@confluent.io

More Related Content

Similar to Why Serverless Flink Matters - Blazing Fast Stream Processing Made Scalable (20)

PDF
Apache Flink 101 - the rise of stream processing and beyond
Bowen Li
 
PPTX
Kostas Tzoumas_Stephan Ewen - Keynote -The maturing data streaming ecosystem ...
Flink Forward
 
PDF
A look at Flink 1.2
Stefan Richter
 
PDF
Stefan Richter - A look at Flink 1.2 and beyond @ Berlin Meetup
Ververica
 
PPTX
Data Stream Processing with Apache Flink
Fabian Hueske
 
PDF
Apache flink
pranay kumar
 
PPTX
The Past, Present, and Future of Apache Flink®
Aljoscha Krettek
 
PPTX
Apache Flink Meetup Munich (November 2015): Flink Overview, Architecture, Int...
Robert Metzger
 
PDF
Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)
Apache Flink Taiwan User Group
 
PPTX
QCon London - Stream Processing with Apache Flink
Robert Metzger
 
PDF
Running Flink in Production: The good, The bad and The in Between - Lakshmi ...
Flink Forward
 
PDF
Apache Flink - a Gentle Start
Liangjun Jiang
 
PPTX
Robust stream processing with Apache Flink
Aljoscha Krettek
 
PPTX
GOTO Night Amsterdam - Stream processing with Apache Flink
Robert Metzger
 
PPTX
Flink Forward Berlin 2018: Aljoscha Krettek & Till Rohrmann - Keynote: "A Yea...
Flink Forward
 
PPTX
The Past, Present, and Future of Apache Flink
Aljoscha Krettek
 
PPTX
Architecture of Flink's Streaming Runtime @ ApacheCon EU 2015
Robert Metzger
 
PPTX
Stephan Ewen - Experiences running Flink at Very Large Scale
Ververica
 
PDF
Flink at netflix paypal speaker series
Monal Daxini
 
PDF
Towards Apache Flink 2.0 - Unified Data Processing and Beyond, Bowen Li
Bowen Li
 
Apache Flink 101 - the rise of stream processing and beyond
Bowen Li
 
Kostas Tzoumas_Stephan Ewen - Keynote -The maturing data streaming ecosystem ...
Flink Forward
 
A look at Flink 1.2
Stefan Richter
 
Stefan Richter - A look at Flink 1.2 and beyond @ Berlin Meetup
Ververica
 
Data Stream Processing with Apache Flink
Fabian Hueske
 
Apache flink
pranay kumar
 
The Past, Present, and Future of Apache Flink®
Aljoscha Krettek
 
Apache Flink Meetup Munich (November 2015): Flink Overview, Architecture, Int...
Robert Metzger
 
Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)
Apache Flink Taiwan User Group
 
QCon London - Stream Processing with Apache Flink
Robert Metzger
 
Running Flink in Production: The good, The bad and The in Between - Lakshmi ...
Flink Forward
 
Apache Flink - a Gentle Start
Liangjun Jiang
 
Robust stream processing with Apache Flink
Aljoscha Krettek
 
GOTO Night Amsterdam - Stream processing with Apache Flink
Robert Metzger
 
Flink Forward Berlin 2018: Aljoscha Krettek & Till Rohrmann - Keynote: "A Yea...
Flink Forward
 
The Past, Present, and Future of Apache Flink
Aljoscha Krettek
 
Architecture of Flink's Streaming Runtime @ ApacheCon EU 2015
Robert Metzger
 
Stephan Ewen - Experiences running Flink at Very Large Scale
Ververica
 
Flink at netflix paypal speaker series
Monal Daxini
 
Towards Apache Flink 2.0 - Unified Data Processing and Beyond, Bowen Li
Bowen Li
 

More from HostedbyConfluent (20)

PDF
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
 
PDF
Renaming a Kafka Topic | Kafka Summit London
HostedbyConfluent
 
PDF
Evolution of NRT Data Ingestion Pipeline at Trendyol
HostedbyConfluent
 
PDF
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
HostedbyConfluent
 
PDF
Exactly-once Stream Processing with Arroyo and Kafka
HostedbyConfluent
 
PDF
Fish Plays Pokemon | Kafka Summit London
HostedbyConfluent
 
PDF
Tiered Storage 101 | Kafla Summit London
HostedbyConfluent
 
PDF
Building a Self-Service Stream Processing Portal: How And Why
HostedbyConfluent
 
PDF
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
HostedbyConfluent
 
PDF
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
HostedbyConfluent
 
PDF
Navigating Private Network Connectivity Options for Kafka Clusters
HostedbyConfluent
 
PDF
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
HostedbyConfluent
 
PDF
Explaining How Real-Time GenAI Works in a Noisy Pub
HostedbyConfluent
 
PDF
TL;DR Kafka Metrics | Kafka Summit London
HostedbyConfluent
 
PDF
A Window Into Your Kafka Streams Tasks | KSL
HostedbyConfluent
 
PDF
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
HostedbyConfluent
 
PDF
Data Contracts Management: Schema Registry and Beyond
HostedbyConfluent
 
PDF
Code-First Approach: Crafting Efficient Flink Apps
HostedbyConfluent
 
PDF
Debezium vs. the World: An Overview of the CDC Ecosystem
HostedbyConfluent
 
PDF
Beyond Tiered Storage: Serverless Kafka with No Local Disks
HostedbyConfluent
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
 
Renaming a Kafka Topic | Kafka Summit London
HostedbyConfluent
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
HostedbyConfluent
 
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
HostedbyConfluent
 
Exactly-once Stream Processing with Arroyo and Kafka
HostedbyConfluent
 
Fish Plays Pokemon | Kafka Summit London
HostedbyConfluent
 
Tiered Storage 101 | Kafla Summit London
HostedbyConfluent
 
Building a Self-Service Stream Processing Portal: How And Why
HostedbyConfluent
 
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
HostedbyConfluent
 
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
HostedbyConfluent
 
Navigating Private Network Connectivity Options for Kafka Clusters
HostedbyConfluent
 
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
HostedbyConfluent
 
Explaining How Real-Time GenAI Works in a Noisy Pub
HostedbyConfluent
 
TL;DR Kafka Metrics | Kafka Summit London
HostedbyConfluent
 
A Window Into Your Kafka Streams Tasks | KSL
HostedbyConfluent
 
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
HostedbyConfluent
 
Data Contracts Management: Schema Registry and Beyond
HostedbyConfluent
 
Code-First Approach: Crafting Efficient Flink Apps
HostedbyConfluent
 
Debezium vs. the World: An Overview of the CDC Ecosystem
HostedbyConfluent
 
Beyond Tiered Storage: Serverless Kafka with No Local Disks
HostedbyConfluent
 
Ad

Recently uploaded (20)

PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PPTX
Designing Production-Ready AI Agents
Kunal Rai
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
Designing Production-Ready AI Agents
Kunal Rai
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Ad

Why Serverless Flink Matters - Blazing Fast Stream Processing Made Scalable

  • 1. Why Serverless Flink Matters - Blazing Fast Stream Processing Made Scalable 1 Mayank Juneja, Confluent Jean-Sébastien Brunner, Confluent
  • 2. Agenda 2 1. Stream Processing a. Overview and Challenges 2. Why Flink? a. Why is it so popular? b. Challenges of self managed Flink 3. Serverless Flink + Demo
  • 3. Real-time Data A Sale A Shipment A Trade Rich Front-End Customer Experiences A Customer Experience Real-Time Backend Operations Stream processing is computing over unbounded streams of data Stream Processor
  • 4. Stream processing use cases 4 Data Exploration Data Pipelines Real-time Apps Engineers and Analysts both need to be able to simply read and understand the event streams stored in Kafka ● Metadata discovery ● Throughput analysis ● Data sampling ● Interactive query Data pipelines are used to enrich, curate, and transform events streams, creating new derived event streams ● Filtering ● Joins ● Projections ● Aggregations ● Flattening ● Enrichment Whole ecosystems of apps feed on event streams automating action in real-time ● Threat detection ● Quality of Service ● Fraud detection ● Intelligent routing ● Alerting
  • 5. Challenges with Stream Processing Ordering and Timing State Management Fault Tolerance Scalability How do you handle out-of-order and late events? How do you scale for unexpected large throughput? Do you need exactly-once semantics? How do you manage state in a distributed environment?
  • 7. What’s great about Apache Flink? Scalability Language Flexibility Unified Processing Flink is capable of supporting stream processing workloads at hyper scale, as evidenced by its broad adoption by leading digital native companies Flink supports Java, Python, & SQL without making major tradeoffs in functionality, enabling developers to work in their language of choice Flink supports stream processing and batch processing through one technology, rather than needing separate tools Flink is a top 5 Apache project and is leveraged as the stream processing engine for >25% of Kafka users
  • 8. Stream Processing with Flink Ordering and Timing State Management Fault Tolerance Scalability & Performance ● Event time processing ● Watermarks ● Elastic scale out ● Network Traffic Optimization ● Backpressure Handling Challenges Flink Features ● Local and in-memory states for all computations ● Exactly once semantics ● Distributed snapshots / checkpoints
  • 9. So is Flink the perfect stream processor?
  • 10. Self Managed Flink comes with its own challenges Configuration and Setup Monitoring Cost Management Security
  • 11. - Resource allocation: Provisioning resources (CPU, memory, storage) for each Flink job can be a complicated task - Dependency management - Connectors, databases - Configuration - Standalone vs k8s vs YARN, Application mode vs Session mode Challenge #1 - Configuration and Setup
  • 12. Challenge #2 - Monitoring and Maintenance - Metrics: - Filtering down to the most relevant metrics for your application can be overwhelming - Version Upgrades - Upgrading Flink versions esp when ensuring backward compatibility is a pain - Disaster recovery - Needs regular backups, checkpointing, savepoints Flink downloads, Mar 2023
  • 13. Challenge #3: Total Cost of Ownership - Hardware costs: Significant investments required for managing hardware costs - can be underutilized - Expertise: Hiring of skilled professionals who can set up, manage and maintain Flink - Opportunity Cost: Less time spent on developing core product or service
  • 14. Challenge #4 - Security - RBAC: Flink lacks built-in capabilities for granular role based access control - Encryption: Data encryption at rest for Flink state backends does not come out of the box - Multi-tenancy: Insufficient capabilities to support multi tenancy within the same cluster
  • 16. 1 6 Powerful SQL Streaming Operators Time windows Pattern Matching Streaming Joins ● Time-based windows ● Event-density windows ● Event-based windows: every single event can trigger a new window ● Complex Event Processing ● Stream-to-stream joins ● Temporal joins ● Lookup joins ● Versioned joins etc.
  • 17. Solution - Serverless Flink - Evergreen runtime: once you submit a job it can run 24.7 and you don't need to take care of any upgrade (security patch, Flink, etc.), it just runs. - Elastic autoscaling of the compute pools: - Elastic scale up of the pool, with a user-defined maximum - Elastic scale down of the pool, with scale-to-zero when nothing runs - Usage based billing - Separation of compute (Flink) and storage (Kafka) - Scale independently to get best best cost and best performance - Optimization of communications for even increased cost/performance
  • 18. Autoscale and monitoring at the job level ● Per task dynamic scaling ○ Rescale based on backpressure and utilization of the vertices, not only based on CPU or infrastructure-level metrics ○ Take into account the throughput from the source ● Job level metrics and monitoring ●
  • 19. Flink and Kafka closer together High bandwidth Low bandwidth Flink and Kafka are closer together, allowing to reduce: ● Latency ● Network cost With Fetch-from-follower the optimization can be done at the Availability Zone level. Confluent Cloud High bandwidth
  • 21. Apache Flink in Confluent Cloud 2 1 Serverless Flink SQL Rich Experience Complete and Secure ● ANSI-SQL with powerful streaming operators ● Rich CLI Experience ● SQL Editor with "workspaces" ● Integration with Schema Registry and Governance ● Support for user-authentication and Service account + +
  • 22. Support various use-cases and Personas Developers Data Analysts Data Engineers Languages Java & SQL ANSI SQL SQL & Python Tools Use Cases Streaming Apps Data Exploration Data Pipelines IDE & SQL CLI Notebooks UI / BI / JDBC