SlideShare a Scribd company logo
Resilient Real-time Data Streaming
across the Edge and Hybrid Cloud
Use Cases, Architectures, and Examples for Data in Motion powered by Apache Kafka
Kai Waehner
Field CTO
kai.waehner@confluent.io
linkedin.com/in/kaiwaehner
confluent.io
kai-waehner.de
@KaiWaehner
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Agenda
1) Resilient enterprise architectures
2) Real-time data streaming with the Apache Kafka ecosystem
3) Cloud-first and serverless Industrial IoT in automotive
4) Multi-region infrastructure for core banking
5) Hybrid cloud for customer experiences in retail
6) Disconnected edge for safety and security in the public sector
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Agenda
1) Resilient enterprise architectures
2) Real-time data streaming with the Apache Kafka ecosystem
3) Cloud-first and serverless Industrial IoT in automotive
4) Multi-region infrastructure for core banking
5) Hybrid cloud for customer experiences in retail
6) Disconnected edge for safety and security in the public sector
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
AWS Cloud Outage hit Disney World Visitors…
https://www.cnet.com/tech/services-and-software/disney-parks-were-already-facing-heat-from-fans-then-an-aws-outage-came-along/
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Why one data center or cloud region is not good enough?
Latency / Cost
Disaster
Recovery
Privacy / Compliance
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Disaster Recovery – RPO and RTO
RPO = Recovery Point Objective
RTO = Recovery Time Objective
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
ZERO RPO requires
synchronous replication
ZERO RTO requires
seamless failover
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Agenda
1) Resilient enterprise architectures
2) Real-time data streaming with the Apache Kafka ecosystem
3) Cloud-first and serverless Industrial IoT in automotive
4) Multi-region infrastructure for core banking
5) Hybrid cloud for customer experiences in retail
6) Disconnected edge for safety and security in the public sector
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Real-time Data in Motion beats Slow Data.
Transportation
Predictive
maintenance
Driver-rider match
ETA updates
Banking
Instant payments
Fraud detection
Mobile applications /
customer experience
Retail
Real-time inventory
Real-time POS
reporting
Personalization
Entertainment
Real-time
recommendations
Personalized
news feed
In-car purchases
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Apache Kafka is the Platform for Data in Motion
MES
ERP
Sensors
Mobile
Customer 360
Real-time
Alerting System
Data
warehouse
Producers
Consumers
Streams and storage of real time events
Stream
processing
apps
Connectors
Connectors
Stream
processing
apps
Supplier
Alert
Forecast
Inventory Customer
Order
11
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Apache Kafka =
A Resilient, Distributed System
Broker 1
Topic1
partition1
Broker 2 Broker 3 Broker 4
Topic1
partition1
Topic1
partition1
Leader Follower
Topic1
partition2
Topic1
partition2
Topic1
partition2
Topic1
partition3
Topic1
partition4
Topic1
partition3
Topic1
partition3
Topic1
partition4
Topic1
partition4
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Resilient Data Streaming across Edge and Hybrid Cloud
Streaming Replication between Kafka Clusters
Bridge to Databases, Data Lakes, Apps, APIs, SaaS
Aggregate Small Footprint
Edge Deployments with
Replication (Aggregation)
Simplify Disaster Recovery
Operations with
Multi-Region Clusters
for RPO=0 and RTO~0
Stream Data Globally with
Replication and Cluster Linking
16
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Shipping Industry
Marine, Oil Transport, Vessel Fleet, Shipping Line, Drones
Real-time Operations, Logistics, Predictive Maintenance, Security
Customer Data
Crew, Cargo
Vessel Data
Fuel Consumption, Speed,
Planned Maintenance
Automatic Identification System (AIS)
Unique Identification,
Position, Course, Weather, Draft
Drone Data
Deliveries,
Survey/Inspection
of Assets such as Oil Rigs,
Pipelines, Offshore Turbines
Edge Analytics
Bidirectional Edge to Cloud Integration
Data Ingestion
Stream
Processing
Data
Integration
Logistics
Track&Trace
Routing
Monitoring
Alerting
Command&Control
Batch Analytics
Reporting
Machine Learning
Backend Systems
Oracle, SAP,
OSIsoft PI, etc.
X = Event Streaming
X = Other Technologies
Bi-Directional Hybrid Cloud
Replication
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Agenda
1) Resilient enterprise architectures
2) Real-time data streaming with the Apache Kafka ecosystem
3) Cloud-first and serverless Industrial IoT in automotive
4) Multi-region infrastructure for core banking
5) Hybrid cloud for customer experiences in retail
6) Disconnected edge for safety and security in the public sector
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
BMW Group
Mission-critical workloads across the edge and cloud
• Why Kafka? Decoupling. Transparency. Innovation.
• Why Confluent? Stability is key in manufacturing
• Decoupling between logistics and production systems
• Cloud-first event streaming on Azure Cloud with serverless Confluent Cloud
• Use case
• Logistics and supply chain in global plants
• Right stock in place (physically and in ERP systems like SAP)
• Just in time, just in sequence
• Lot of critical applications
19
Jay Kreps, Confluent CEO
Felix Böhm, BMW Plant Digitalization and Cloud Transformation
Keynote at Kafka Summit Eurpoe 2021:
https://www.youtube.com/watch?v=3cG2ud7TRs4
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Condition Monitoring and Predictive Maintenance
Stateless and stateful stream processing for real-time data correlation with Kafka-native tools (Kafka Streams / ksqlDB)
8 9
1
0
1
1
1
2
1
3
1
4
1
5
1
6
3 4 5 6 7
2
1
Time
Sensor Events
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Condition Monitoring and Predictive Maintenance
Stateless and stateful stream processing for real-time data correlation with Kafka-native tools (Kafka Streams / ksqlDB)
8 9
1
0
1
1
1
2
1
3
1
4
1
5
3 4 5 6 7
2
1
Time
Sensor Events
1
6
Condition Monitoring
(Temperature Spikes)
Stateless Filter Above-Threshold Events
Streams
builder
.stream(”temperature-sensor")
.filter((key, sensor-data) ->
sensor-data.temperature > 100)
.to(”temperature-spikes");
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Condition Monitoring and Predictive Maintenance
Stateless and stateful stream processing for real-time data correlation with Kafka-native tools (Kafka Streams / ksqlDB)
8 9
1
0
1
1
1
2
1
3
1
4
1
5
1
6
3 4 5 6 7
2
1
Time
Sensor Events
Predictive Maintenance
(Continuous Anomaly Detection)
Stateful Correlation of Events
CREATE TABLE anomaly_detection AS
SELECT temperature_spike_id, COUNT(*) AS total_spikes,
AVG(temperature) AS avg_temperature
FROM sensor-data
WINDOW TUMBLING (SIZE 1 HOUR)
GROUP BY temperature_spike_id
EMIT CHANGES;
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Condition Monitoring and Predictive Maintenance
Stateless and stateful stream processing for real-time data correlation with Kafka-native tools (Kafka Streams / ksqlDB)
8 9
1
0
1
1
1
2
1
3
1
4
1
5
1
6
3 4 5 6 7
2
1
Time
Sensor Events
Predictive Maintenance
(Continuous Anomaly Detection)
Real-time Machine Learning
CREATE STREAM anomaly_detection AS
SELECT sensor_id, detect_anomaly(sensor_values)
FROM machine;
TensorFlow model embedded in User Defined
Function (UDF)
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Agenda
1) Resilient enterprise architectures
2) Real-time data streaming with the Apache Kafka ecosystem
3) Cloud-first and serverless Industrial IoT in automotive
4) Multi-region infrastructure for core banking
5) Hybrid cloud for customer experiences in retail
6) Disconnected edge for safety and security in the public sector
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Disaster Recovery @ JPMorgan
https://www.confluent.io/kafka-summit-san-francisco-2019/secure-kafka-at-scale-in-true-multi-tenant-environment
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Multi-Region Kafka Cluster in Financial Services
Zero downtime + zero data loss (RPO=0 and RTO~0) + automated disaster recovery
Large Bank
Transaction
Log
Transaction
Log
Location Location
synchronous
asynchronous
● ‘Payment’ transactions enter
from us-east and us-west with
fully synchronous replication
● ‘Log’ and ‘Location’ information
in the same cluster use async -
optimized for latency
● Automated disaster recovery
(zero downtime, zero data loss)
Result: Clearing time from ‘deposit’ to
‘available’ goes from 5 days to 5 seconds
(including security checks)
(Only available in Confluent Platform)
Hundreds of miles distance between the data centers
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Migration with Cluster Linking
27
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Robinhood
Mission: “Democratize finance for all”
Kafka for mission-critical and analytics use cases
Microservices using various technologies
28
https://www.confluent.io/events/kafka-summit-americas-2021/taming-a-massive-fleet-of-python-based-kafka-apps-at-robinhood/
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Thought Machine – Core Banking
• Cloud-native core banking software
• Transactional workloads (24/7, zero data loss)
• Flexible product engine powered by smart contracts (not blockchain)
29
https://www.confluent.io/events/kafka-summit-apac-2021/scaling-a-core-banking-engine-using-apache-kafka/
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
“Transactions” in Apache Kafka
30
Exactly-Once Semantics (EOS)
available since Kafka 0.11 (June 2017):
https://cwiki.apache.org/confluence/display/KAFKA/KIP-98+-+Exactly+Once+Delivery+and+Transactional+Messaging
https://www.confluent.io/kafka-summit-london18/dont-repeat-yourself-introducing-exactly-once-semantics-in-apache-kafka/
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Agenda
1) Resilient enterprise architectures
2) Real-time data streaming with the Apache Kafka ecosystem
3) Cloud-first and serverless Industrial IoT in automotive
4) Multi-region infrastructure for core banking
5) Hybrid cloud for customer experiences in retail
6) Disconnected edge for safety and security in the public sector
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Royal Caribbean - Offline Edge for Swimming Retail Stores
https://www.confluent.io/kafka-summit-lon19/seamless-guest-experience-with-kafka-streams/
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
CRM
3rd party
payment
provider
Context-specific
real-time upsell
Customer data
Payment processing and
fraud detection as a service
Manager
Get report
API
Customer Customer
Customer
data
Train
schedule
Payment
data
Loyalty
information
Streams of real time events
Customer
data
Train
schedule
Payment
data
Loyalty
information
Streams of real time events
Customer
data
Train
schedule
Payment
data
Loyalty
information
Streams of real time events
Hybrid Retail Architecture
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Point of Sale
(POS) Loyalty
System
Local Inventory
Management
Payment Discount
Customer
data
Train
schedule
Payment
data
Loyalty
information
Streams of real time events
Global Inventory
Management
Event Streaming at the Edge
in the Smart Retail Store
Item Availability
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Omnichannel Retail
Time
P
C3 C2
C1
Sales Talk on site in
Car Dealership
Right now
Location-based
Customer Action
Customer 360
(Website, Mobile App, On Site in Store, In-Car)
Car Configurator
10 and 8 days ago
Context-specific
Marketing Campaign
90 and 60 days ago
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Agenda
1) Resilient enterprise architectures
2) Real-time data streaming with the Apache Kafka ecosystem
3) Cloud-first and serverless Industrial IoT in automotive
4) Multi-region infrastructure for core banking
5) Hybrid cloud for customer experiences in retail
6) Disconnected edge for safety and security in the public sector
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Data Processing
at the Edge
Time
P
C1
C2
C3
Know-your-customer
Loyalty app, predictive behavior, …
Estimated
time of arrival
Connect to the
gaming server
for kids
Play games, earn rewards, communicate with
other kids in the train, …
Always on (even “offline”)
Replayability
Cost-efficiency
Low latency
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Devon Energy
Oil & Gas Industry
Improve drilling and well completion operations
Edge stream processing/analytics + closed-loop control ready
Vendor agnostic (pumping, wireline, coil, offset wells, drilling
operations, producing wells)
Replication to the cloud in real-time at scale
Cloud agnostic (AWS, GCP, Azure)
Source: Energy in Data - Powered by AAPG, SEG & SPE: energyindata.org
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Smart Soldiers at the Edge
39
Sensor A Sensor B Sensor X
MQTT
Confluent Platform (Single Broker)
Single Kafka broker deployed on a small
computer and leveraging Cluster Linking to
publish sensor data to the Command Post.
Command Post running Confluent
Platform aggregating information from
soliders and other sensor data
Weather
Personnel
Logistics
Targets Sensor data published to
Command Post when
connected to network
Enhanced
Situational Awareness
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Why people choose Confluent
for building resilient architectures?
kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud
Car Engine Car Self-driving Car
Confluent completes Apache Kafka. Cloud-native. Everywhere.
Kai Waehner
Field CTO
Confluent
kai.waehner@confluent.io
@KaiWaehner
confluent.io
kai-waehner.de
linkedin.com/in/kaiwaehner
Questions? Feedback?
Let’s connect!

More Related Content

What's hot (20)

PPTX
An Introduction to Confluent Cloud: Apache Kafka as a Service
confluent
 
PPTX
Kafka at Peak Performance
Todd Palino
 
PPTX
Apache Kafka
emreakis
 
PDF
What is Apache Kafka and What is an Event Streaming Platform?
confluent
 
PPTX
Introduction to Apache Kafka
Jeff Holoman
 
PDF
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
Kai Wähner
 
PDF
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Kai Wähner
 
PDF
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Kai Wähner
 
PPTX
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Simplilearn
 
PDF
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Kai Wähner
 
PPTX
The Top 5 Apache Kafka Use Cases and Architectures in 2022
Kai Wähner
 
PDF
Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uber
confluent
 
PDF
Storage Capacity Management on Multi-tenant Kafka Cluster with Nurettin Omeroglu
HostedbyConfluent
 
PDF
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai Wähner
 
PDF
Benefits of Stream Processing and Apache Kafka Use Cases
confluent
 
PDF
When NOT to use Apache Kafka?
Kai Wähner
 
PPTX
APACHE KAFKA / Kafka Connect / Kafka Streams
Ketan Gote
 
PDF
Apache Kafka in the Transportation and Logistics
Kai Wähner
 
PDF
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...
Databricks
 
PDF
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Kai Wähner
 
An Introduction to Confluent Cloud: Apache Kafka as a Service
confluent
 
Kafka at Peak Performance
Todd Palino
 
Apache Kafka
emreakis
 
What is Apache Kafka and What is an Event Streaming Platform?
confluent
 
Introduction to Apache Kafka
Jeff Holoman
 
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
Kai Wähner
 
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Kai Wähner
 
Apache Kafka for Real-time Supply Chain in the Food and Retail Industry
Kai Wähner
 
Apache Spark Architecture | Apache Spark Architecture Explained | Apache Spar...
Simplilearn
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Kai Wähner
 
The Top 5 Apache Kafka Use Cases and Architectures in 2022
Kai Wähner
 
Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uber
confluent
 
Storage Capacity Management on Multi-tenant Kafka Cluster with Nurettin Omeroglu
HostedbyConfluent
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai Wähner
 
Benefits of Stream Processing and Apache Kafka Use Cases
confluent
 
When NOT to use Apache Kafka?
Kai Wähner
 
APACHE KAFKA / Kafka Connect / Kafka Streams
Ketan Gote
 
Apache Kafka in the Transportation and Logistics
Kai Wähner
 
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...
Databricks
 
Apache Kafka vs. Cloud-native iPaaS Integration Platform Middleware
Kai Wähner
 

Similar to Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apache Kafka (20)

PDF
Mit Streaming die Brücken zum Erfolg bauen
confluent
 
PDF
Apache Kafka for Smart Grid, Utilities and Energy Production
Kai Wähner
 
PPTX
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...
Precisely
 
PDF
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
confluent
 
PDF
Apache Kafka® and Analytics in a Connected IoT World
confluent
 
PDF
Apache Kafka Landscape for Automotive and Manufacturing
Kai Wähner
 
PPTX
IoT and Event Streaming at Scale with Apache Kafka
confluent
 
PDF
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Kai Wähner
 
PDF
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...
Kai Wähner
 
PDF
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Kai Wähner
 
PDF
Set Your Data In Motion - CTO Roundtable
confluent
 
PDF
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Kai Wähner
 
PPTX
Unlock value with Confluent and AWS.pptx
Ahmed791434
 
PDF
Kafka and Machine Learning in Banking and Insurance Industry
Kai Wähner
 
PDF
The Top 5 Event Streaming Use Cases & Architectures in 2021
confluent
 
PDF
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Kai Wähner
 
PDF
Apache Kafka for Automotive Industry, Mobility Services & Smart City
Kai Wähner
 
PPTX
Supply Chain Optimization with Apache Kafka
Kai Wähner
 
PDF
Apache Kafka as Event Streaming Platform for Microservice Architectures
Kai Wähner
 
PDF
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Kai Wähner
 
Mit Streaming die Brücken zum Erfolg bauen
confluent
 
Apache Kafka for Smart Grid, Utilities and Energy Production
Kai Wähner
 
Keine Angst vorm Dinosaurier: Mainframe-Integration und -Offloading mit Confl...
Precisely
 
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
confluent
 
Apache Kafka® and Analytics in a Connected IoT World
confluent
 
Apache Kafka Landscape for Automotive and Manufacturing
Kai Wähner
 
IoT and Event Streaming at Scale with Apache Kafka
confluent
 
IoT Architectures for Apache Kafka and Event Streaming - Industry 4.0, Digita...
Kai Wähner
 
Enabling Smarter Cities and Connected Vehicles with an Event Streaming Platfo...
Kai Wähner
 
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Kai Wähner
 
Set Your Data In Motion - CTO Roundtable
confluent
 
Apache Kafka for Predictive Maintenance in Industrial IoT / Industry 4.0
Kai Wähner
 
Unlock value with Confluent and AWS.pptx
Ahmed791434
 
Kafka and Machine Learning in Banking and Insurance Industry
Kai Wähner
 
The Top 5 Event Streaming Use Cases & Architectures in 2021
confluent
 
Top 5 Event Streaming Use Cases for 2021 with Apache Kafka
Kai Wähner
 
Apache Kafka for Automotive Industry, Mobility Services & Smart City
Kai Wähner
 
Supply Chain Optimization with Apache Kafka
Kai Wähner
 
Apache Kafka as Event Streaming Platform for Microservice Architectures
Kai Wähner
 
Apache Kafka in the Automotive Industry (Connected Vehicles, Manufacturing 4....
Kai Wähner
 
Ad

More from Kai Wähner (20)

PDF
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Kai Wähner
 
PDF
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kai Wähner
 
PDF
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner
 
PDF
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Kai Wähner
 
PDF
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Kai Wähner
 
PDF
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Kai Wähner
 
PDF
Apache Kafka in the Healthcare Industry
Kai Wähner
 
PDF
Apache Kafka in the Healthcare Industry
Kai Wähner
 
PDF
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
PDF
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
PDF
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Kai Wähner
 
PDF
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Kai Wähner
 
PDF
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Kai Wähner
 
PDF
IBM Cloud Pak for Integration with Confluent Platform powered by Apache Kafka
Kai Wähner
 
PDF
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Kai Wähner
 
PDF
Apache Kafka in the Insurance Industry
Kai Wähner
 
PDF
Apache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Kai Wähner
 
PDF
Connected Vehicles and V2X with Apache Kafka
Kai Wähner
 
PDF
Can Apache Kafka Replace a Database?
Kai Wähner
 
PDF
Apache Kafka in the Airline, Aviation and Travel Industry
Kai Wähner
 
Apache Kafka as Data Hub for Crypto, NFT, Metaverse (Beyond the Buzz!)
Kai Wähner
 
Kafka for Live Commerce to Transform the Retail and Shopping Metaverse
Kai Wähner
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner
 
Data Warehouse vs. Data Lake vs. Data Streaming – Friends, Enemies, Frenemies?
Kai Wähner
 
Serverless Kafka and Spark in a Multi-Cloud Lakehouse Architecture
Kai Wähner
 
Data Streaming with Apache Kafka in the Defence and Cybersecurity Industry
Kai Wähner
 
Apache Kafka in the Healthcare Industry
Kai Wähner
 
Apache Kafka in the Healthcare Industry
Kai Wähner
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
Apache Kafka in the Public Sector (Government, National Security, Citizen Ser...
Kai Wähner
 
Telco 4.0 - Payment and FinServ Integration for Data in Motion with 5G and Ap...
Kai Wähner
 
Apache Kafka for Cybersecurity and SIEM / SOAR Modernization
Kai Wähner
 
IBM Cloud Pak for Integration with Confluent Platform powered by Apache Kafka
Kai Wähner
 
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies?
Kai Wähner
 
Apache Kafka in the Insurance Industry
Kai Wähner
 
Apache Kafka and MQTT - Overview, Comparison, Use Cases, Architectures
Kai Wähner
 
Connected Vehicles and V2X with Apache Kafka
Kai Wähner
 
Can Apache Kafka Replace a Database?
Kai Wähner
 
Apache Kafka in the Airline, Aviation and Travel Industry
Kai Wähner
 
Ad

Recently uploaded (20)

PDF
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
PPT
MergeSortfbsjbjsfk sdfik k
RafishaikIT02044
 
PPTX
MiniTool Power Data Recovery Full Crack Latest 2025
muhammadgurbazkhan
 
PPTX
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 
PPTX
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
PPTX
A Complete Guide to Salesforce SMS Integrations Build Scalable Messaging With...
360 SMS APP
 
PPTX
The Role of a PHP Development Company in Modern Web Development
SEO Company for School in Delhi NCR
 
PPTX
Tally software_Introduction_Presentation
AditiBansal54083
 
PPTX
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
PDF
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
PPTX
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
PDF
Executive Business Intelligence Dashboards
vandeslie24
 
PPTX
Equipment Management Software BIS Safety UK.pptx
BIS Safety Software
 
PDF
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Safe Software
 
PDF
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pdf
Varsha Nayak
 
PPTX
Revolutionizing Code Modernization with AI
KrzysztofKkol1
 
PPTX
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
PDF
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
PDF
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
PPTX
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
MergeSortfbsjbjsfk sdfik k
RafishaikIT02044
 
MiniTool Power Data Recovery Full Crack Latest 2025
muhammadgurbazkhan
 
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
A Complete Guide to Salesforce SMS Integrations Build Scalable Messaging With...
360 SMS APP
 
The Role of a PHP Development Company in Modern Web Development
SEO Company for School in Delhi NCR
 
Tally software_Introduction_Presentation
AditiBansal54083
 
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
Executive Business Intelligence Dashboards
vandeslie24
 
Equipment Management Software BIS Safety UK.pptx
BIS Safety Software
 
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Safe Software
 
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pdf
Varsha Nayak
 
Revolutionizing Code Modernization with AI
KrzysztofKkol1
 
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 

Resilient Real-time Data Streaming across the Edge and Hybrid Cloud with Apache Kafka

  • 1. Resilient Real-time Data Streaming across the Edge and Hybrid Cloud Use Cases, Architectures, and Examples for Data in Motion powered by Apache Kafka Kai Waehner Field CTO [email protected] linkedin.com/in/kaiwaehner confluent.io kai-waehner.de @KaiWaehner
  • 2. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Agenda 1) Resilient enterprise architectures 2) Real-time data streaming with the Apache Kafka ecosystem 3) Cloud-first and serverless Industrial IoT in automotive 4) Multi-region infrastructure for core banking 5) Hybrid cloud for customer experiences in retail 6) Disconnected edge for safety and security in the public sector
  • 3. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Agenda 1) Resilient enterprise architectures 2) Real-time data streaming with the Apache Kafka ecosystem 3) Cloud-first and serverless Industrial IoT in automotive 4) Multi-region infrastructure for core banking 5) Hybrid cloud for customer experiences in retail 6) Disconnected edge for safety and security in the public sector
  • 4. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud AWS Cloud Outage hit Disney World Visitors… https://www.cnet.com/tech/services-and-software/disney-parks-were-already-facing-heat-from-fans-then-an-aws-outage-came-along/
  • 5. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Why one data center or cloud region is not good enough? Latency / Cost Disaster Recovery Privacy / Compliance
  • 6. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Disaster Recovery – RPO and RTO RPO = Recovery Point Objective RTO = Recovery Time Objective
  • 7. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud ZERO RPO requires synchronous replication ZERO RTO requires seamless failover
  • 8. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Agenda 1) Resilient enterprise architectures 2) Real-time data streaming with the Apache Kafka ecosystem 3) Cloud-first and serverless Industrial IoT in automotive 4) Multi-region infrastructure for core banking 5) Hybrid cloud for customer experiences in retail 6) Disconnected edge for safety and security in the public sector
  • 9. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Real-time Data in Motion beats Slow Data. Transportation Predictive maintenance Driver-rider match ETA updates Banking Instant payments Fraud detection Mobile applications / customer experience Retail Real-time inventory Real-time POS reporting Personalization Entertainment Real-time recommendations Personalized news feed In-car purchases
  • 10. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Apache Kafka is the Platform for Data in Motion MES ERP Sensors Mobile Customer 360 Real-time Alerting System Data warehouse Producers Consumers Streams and storage of real time events Stream processing apps Connectors Connectors Stream processing apps Supplier Alert Forecast Inventory Customer Order 11
  • 11. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Apache Kafka = A Resilient, Distributed System Broker 1 Topic1 partition1 Broker 2 Broker 3 Broker 4 Topic1 partition1 Topic1 partition1 Leader Follower Topic1 partition2 Topic1 partition2 Topic1 partition2 Topic1 partition3 Topic1 partition4 Topic1 partition3 Topic1 partition3 Topic1 partition4 Topic1 partition4
  • 12. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Resilient Data Streaming across Edge and Hybrid Cloud Streaming Replication between Kafka Clusters Bridge to Databases, Data Lakes, Apps, APIs, SaaS Aggregate Small Footprint Edge Deployments with Replication (Aggregation) Simplify Disaster Recovery Operations with Multi-Region Clusters for RPO=0 and RTO~0 Stream Data Globally with Replication and Cluster Linking 16
  • 13. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Shipping Industry Marine, Oil Transport, Vessel Fleet, Shipping Line, Drones Real-time Operations, Logistics, Predictive Maintenance, Security Customer Data Crew, Cargo Vessel Data Fuel Consumption, Speed, Planned Maintenance Automatic Identification System (AIS) Unique Identification, Position, Course, Weather, Draft Drone Data Deliveries, Survey/Inspection of Assets such as Oil Rigs, Pipelines, Offshore Turbines Edge Analytics Bidirectional Edge to Cloud Integration Data Ingestion Stream Processing Data Integration Logistics Track&Trace Routing Monitoring Alerting Command&Control Batch Analytics Reporting Machine Learning Backend Systems Oracle, SAP, OSIsoft PI, etc. X = Event Streaming X = Other Technologies Bi-Directional Hybrid Cloud Replication
  • 14. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Agenda 1) Resilient enterprise architectures 2) Real-time data streaming with the Apache Kafka ecosystem 3) Cloud-first and serverless Industrial IoT in automotive 4) Multi-region infrastructure for core banking 5) Hybrid cloud for customer experiences in retail 6) Disconnected edge for safety and security in the public sector
  • 15. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud BMW Group Mission-critical workloads across the edge and cloud • Why Kafka? Decoupling. Transparency. Innovation. • Why Confluent? Stability is key in manufacturing • Decoupling between logistics and production systems • Cloud-first event streaming on Azure Cloud with serverless Confluent Cloud • Use case • Logistics and supply chain in global plants • Right stock in place (physically and in ERP systems like SAP) • Just in time, just in sequence • Lot of critical applications 19 Jay Kreps, Confluent CEO Felix Böhm, BMW Plant Digitalization and Cloud Transformation Keynote at Kafka Summit Eurpoe 2021: https://www.youtube.com/watch?v=3cG2ud7TRs4
  • 16. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Condition Monitoring and Predictive Maintenance Stateless and stateful stream processing for real-time data correlation with Kafka-native tools (Kafka Streams / ksqlDB) 8 9 1 0 1 1 1 2 1 3 1 4 1 5 1 6 3 4 5 6 7 2 1 Time Sensor Events
  • 17. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Condition Monitoring and Predictive Maintenance Stateless and stateful stream processing for real-time data correlation with Kafka-native tools (Kafka Streams / ksqlDB) 8 9 1 0 1 1 1 2 1 3 1 4 1 5 3 4 5 6 7 2 1 Time Sensor Events 1 6 Condition Monitoring (Temperature Spikes) Stateless Filter Above-Threshold Events Streams builder .stream(”temperature-sensor") .filter((key, sensor-data) -> sensor-data.temperature > 100) .to(”temperature-spikes");
  • 18. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Condition Monitoring and Predictive Maintenance Stateless and stateful stream processing for real-time data correlation with Kafka-native tools (Kafka Streams / ksqlDB) 8 9 1 0 1 1 1 2 1 3 1 4 1 5 1 6 3 4 5 6 7 2 1 Time Sensor Events Predictive Maintenance (Continuous Anomaly Detection) Stateful Correlation of Events CREATE TABLE anomaly_detection AS SELECT temperature_spike_id, COUNT(*) AS total_spikes, AVG(temperature) AS avg_temperature FROM sensor-data WINDOW TUMBLING (SIZE 1 HOUR) GROUP BY temperature_spike_id EMIT CHANGES;
  • 19. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Condition Monitoring and Predictive Maintenance Stateless and stateful stream processing for real-time data correlation with Kafka-native tools (Kafka Streams / ksqlDB) 8 9 1 0 1 1 1 2 1 3 1 4 1 5 1 6 3 4 5 6 7 2 1 Time Sensor Events Predictive Maintenance (Continuous Anomaly Detection) Real-time Machine Learning CREATE STREAM anomaly_detection AS SELECT sensor_id, detect_anomaly(sensor_values) FROM machine; TensorFlow model embedded in User Defined Function (UDF)
  • 20. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Agenda 1) Resilient enterprise architectures 2) Real-time data streaming with the Apache Kafka ecosystem 3) Cloud-first and serverless Industrial IoT in automotive 4) Multi-region infrastructure for core banking 5) Hybrid cloud for customer experiences in retail 6) Disconnected edge for safety and security in the public sector
  • 21. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Disaster Recovery @ JPMorgan https://www.confluent.io/kafka-summit-san-francisco-2019/secure-kafka-at-scale-in-true-multi-tenant-environment
  • 22. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Multi-Region Kafka Cluster in Financial Services Zero downtime + zero data loss (RPO=0 and RTO~0) + automated disaster recovery Large Bank Transaction Log Transaction Log Location Location synchronous asynchronous ● ‘Payment’ transactions enter from us-east and us-west with fully synchronous replication ● ‘Log’ and ‘Location’ information in the same cluster use async - optimized for latency ● Automated disaster recovery (zero downtime, zero data loss) Result: Clearing time from ‘deposit’ to ‘available’ goes from 5 days to 5 seconds (including security checks) (Only available in Confluent Platform) Hundreds of miles distance between the data centers
  • 23. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Migration with Cluster Linking 27
  • 24. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Robinhood Mission: “Democratize finance for all” Kafka for mission-critical and analytics use cases Microservices using various technologies 28 https://www.confluent.io/events/kafka-summit-americas-2021/taming-a-massive-fleet-of-python-based-kafka-apps-at-robinhood/
  • 25. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Thought Machine – Core Banking • Cloud-native core banking software • Transactional workloads (24/7, zero data loss) • Flexible product engine powered by smart contracts (not blockchain) 29 https://www.confluent.io/events/kafka-summit-apac-2021/scaling-a-core-banking-engine-using-apache-kafka/
  • 26. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud “Transactions” in Apache Kafka 30 Exactly-Once Semantics (EOS) available since Kafka 0.11 (June 2017): https://cwiki.apache.org/confluence/display/KAFKA/KIP-98+-+Exactly+Once+Delivery+and+Transactional+Messaging https://www.confluent.io/kafka-summit-london18/dont-repeat-yourself-introducing-exactly-once-semantics-in-apache-kafka/
  • 27. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Agenda 1) Resilient enterprise architectures 2) Real-time data streaming with the Apache Kafka ecosystem 3) Cloud-first and serverless Industrial IoT in automotive 4) Multi-region infrastructure for core banking 5) Hybrid cloud for customer experiences in retail 6) Disconnected edge for safety and security in the public sector
  • 28. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Royal Caribbean - Offline Edge for Swimming Retail Stores https://www.confluent.io/kafka-summit-lon19/seamless-guest-experience-with-kafka-streams/
  • 29. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud CRM 3rd party payment provider Context-specific real-time upsell Customer data Payment processing and fraud detection as a service Manager Get report API Customer Customer Customer data Train schedule Payment data Loyalty information Streams of real time events Customer data Train schedule Payment data Loyalty information Streams of real time events Customer data Train schedule Payment data Loyalty information Streams of real time events Hybrid Retail Architecture
  • 30. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Point of Sale (POS) Loyalty System Local Inventory Management Payment Discount Customer data Train schedule Payment data Loyalty information Streams of real time events Global Inventory Management Event Streaming at the Edge in the Smart Retail Store Item Availability
  • 31. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Omnichannel Retail Time P C3 C2 C1 Sales Talk on site in Car Dealership Right now Location-based Customer Action Customer 360 (Website, Mobile App, On Site in Store, In-Car) Car Configurator 10 and 8 days ago Context-specific Marketing Campaign 90 and 60 days ago
  • 32. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Agenda 1) Resilient enterprise architectures 2) Real-time data streaming with the Apache Kafka ecosystem 3) Cloud-first and serverless Industrial IoT in automotive 4) Multi-region infrastructure for core banking 5) Hybrid cloud for customer experiences in retail 6) Disconnected edge for safety and security in the public sector
  • 33. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Data Processing at the Edge Time P C1 C2 C3 Know-your-customer Loyalty app, predictive behavior, … Estimated time of arrival Connect to the gaming server for kids Play games, earn rewards, communicate with other kids in the train, … Always on (even “offline”) Replayability Cost-efficiency Low latency
  • 34. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Devon Energy Oil & Gas Industry Improve drilling and well completion operations Edge stream processing/analytics + closed-loop control ready Vendor agnostic (pumping, wireline, coil, offset wells, drilling operations, producing wells) Replication to the cloud in real-time at scale Cloud agnostic (AWS, GCP, Azure) Source: Energy in Data - Powered by AAPG, SEG & SPE: energyindata.org
  • 35. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Smart Soldiers at the Edge 39 Sensor A Sensor B Sensor X MQTT Confluent Platform (Single Broker) Single Kafka broker deployed on a small computer and leveraging Cluster Linking to publish sensor data to the Command Post. Command Post running Confluent Platform aggregating information from soliders and other sensor data Weather Personnel Logistics Targets Sensor data published to Command Post when connected to network Enhanced Situational Awareness
  • 36. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Why people choose Confluent for building resilient architectures?
  • 37. kai-waehner.de @KaiWaehner – Resilient Data Streaming at Edge and Hybrid Cloud Car Engine Car Self-driving Car Confluent completes Apache Kafka. Cloud-native. Everywhere.