SlideShare a Scribd company logo
Domain Driven Data
Apache Kafka and the Data
Mesh
James Gollan, Senior Solutions Engineer at Confluent
Why are we talking about Data
Mesh at a Kafka meetup?
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Producer Guarantees
P
Broker 1 Broker 2 Broker 3
Topic1
partition1
Leader Follower
Topic1
partition1
Topic1
partition1
Producer Properties
acks=all
min.insync.replica=2
ack
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
What problem are we trying to solve?
4
Monolithic datastores
Centralised processing and governance
Bottlenecks for processing and analysing data for the business
Data scientists don’t have a full understanding of the data’s context
Results in a data swamp
Data mesh is an attempt to solve
these problems
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Data Mesh
6
Concept first spoken about by Zhamak Dehghani from ThoughtWorks
Break apart the ‘data monolith’
Treats domains as first class citizens when dealing with data
Domains encouraged to stop treating data as an asset, and to start treating data as a product
Emphasis on a self-service data platform
Federated governance of organisational data
This seems kinda familiar - where
have I heard this before....
Could it be microservices?
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Microservices and Domain Driven Design
8
This problem has been solved before for the Monolithic application
This was broken down into microservices
Creation of these microservices emphasises business domains
The bounded context provides the public interfaces for the domain
Within the bounded context domain specific language and business logic is used
whiteboard...
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. 10
Summing up
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Kafka’s role in the data mesh
Kafka
Kafka facilitates the data mesh by acting as a central hub for events
Infinite storage in Kafka allows it to be used as the source of truth within the organization
12
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Kafka’s role in the data mesh
Connect
Domain based connect workers allow domain owners to integrate source and sink connectors
One of these sinks might be a data warehouse where the domain can conduct analysis on
their data sets, potentially combined with organisational wide data sets from other domains
13
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Kafka’s role in the data mesh
ksqlDB
Distributed ksqlDB allows domains to run their own real-time stream processing
This may be used to prepare data from multiple topics for publication across the organisation
It may also be used for advanced stream processing, such as real-time fraud detection
14
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Kafka’s role in the data mesh
Schema registry
With event driven architecture the schema is the API
Schema registry ensures consistency in event structure, and enables forward and backward
compatibility across schema changes
Is may be extended to provide more data governance features, such as field level tagging,
data catalog functionality etc.
15
Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Converters
Questions
(or did we run out of time?)
Domain Driven Data: Apache Kafka® and the Data Mesh

More Related Content

What's hot (20)

PDF
Modern Data Flow
confluent
 
PDF
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner
 
PPTX
Introduction to Apache Kafka
AIMDek Technologies
 
PPTX
Snowflake: The Good, the Bad, and the Ugly
Tyler Wishnoff
 
PDF
How to govern and secure a Data Mesh?
confluent
 
PPTX
Introduction to snowflake
Sunil Gurav
 
PDF
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai Wähner
 
PDF
Google cloud platform introduction
Simon Su
 
PDF
Vertex AI: Pipelines for your MLOps workflows
Márton Kodok
 
PDF
Architecture Patterns for Event Streaming (Nick Dearden, Confluent) London 20...
confluent
 
PPTX
Data Warehousing Trends, Best Practices, and Future Outlook
James Serra
 
PDF
Lakehouse in Azure
Sergio Zenatti Filho
 
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r2)
James Serra
 
PDF
Evolution from EDA to Data Mesh: Data in Motion
confluent
 
PDF
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
PDF
How Uber scaled its Real Time Infrastructure to Trillion events per day
DataWorks Summit
 
PPTX
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
DataScienceConferenc1
 
PPTX
Azure DataBricks for Data Engineering by Eugene Polonichko
Dimko Zhluktenko
 
PDF
Making Data Timelier and More Reliable with Lakehouse Technology
Matei Zaharia
 
PPT
How Retail Banks Use MongoDB
MongoDB
 
Modern Data Flow
confluent
 
The Heart of the Data Mesh Beats in Real-Time with Apache Kafka
Kai Wähner
 
Introduction to Apache Kafka
AIMDek Technologies
 
Snowflake: The Good, the Bad, and the Ugly
Tyler Wishnoff
 
How to govern and secure a Data Mesh?
confluent
 
Introduction to snowflake
Sunil Gurav
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai Wähner
 
Google cloud platform introduction
Simon Su
 
Vertex AI: Pipelines for your MLOps workflows
Márton Kodok
 
Architecture Patterns for Event Streaming (Nick Dearden, Confluent) London 20...
confluent
 
Data Warehousing Trends, Best Practices, and Future Outlook
James Serra
 
Lakehouse in Azure
Sergio Zenatti Filho
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
James Serra
 
Evolution from EDA to Data Mesh: Data in Motion
confluent
 
Kafka for Real-Time Replication between Edge and Hybrid Cloud
Kai Wähner
 
How Uber scaled its Real Time Infrastructure to Trillion events per day
DataWorks Summit
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
DataScienceConferenc1
 
Azure DataBricks for Data Engineering by Eugene Polonichko
Dimko Zhluktenko
 
Making Data Timelier and More Reliable with Lakehouse Technology
Matei Zaharia
 
How Retail Banks Use MongoDB
MongoDB
 

Similar to Domain Driven Data: Apache Kafka® and the Data Mesh (20)

PDF
Apache Kafka® and the Data Mesh
ConfluentInc1
 
PDF
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
HostedbyConfluent
 
PPTX
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
HostedbyConfluent
 
PDF
Continus sql with sql stream builder
Timothy Spann
 
PDF
Comprehensive Guide for Microsoft Fabric to Master Data Analytics
Sparity1
 
PDF
Citi Tech Talk: Messaging Modernization
confluent
 
PDF
Apache Kafka and the Data Mesh | Michael Noll, Confluent
HostedbyConfluent
 
PPTX
Slim Baltagi – Flink vs. Spark
Flink Forward
 
PPTX
Flink vs. Spark
Slim Baltagi
 
PDF
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
PPTX
data-mesh-101.pptx
TarekHamdi8
 
PDF
Enhancing Apache Kafka for Large Scale Real-Time Data Pipeline at Tencent | K...
HostedbyConfluent
 
PDF
Confluent Messaging Modernization Forum
confluent
 
PDF
Docker Containers- Data Engineers' Arsenal.pdf
gr6336192
 
PDF
WhatIsData-Blitz
pharvener
 
PDF
Citi Tech Talk: Hybrid Cloud
confluent
 
PPTX
Developing Actors in Azure with .net
Marco Parenzan
 
PDF
Real Time Streaming with Flink & Couchbase
Manuel Hurtado
 
PDF
Big Data Companies and Apache Software
Bob Marcus
 
PDF
Lesson 2
Sandra Ahn
 
Apache Kafka® and the Data Mesh
ConfluentInc1
 
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
HostedbyConfluent
 
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
HostedbyConfluent
 
Continus sql with sql stream builder
Timothy Spann
 
Comprehensive Guide for Microsoft Fabric to Master Data Analytics
Sparity1
 
Citi Tech Talk: Messaging Modernization
confluent
 
Apache Kafka and the Data Mesh | Michael Noll, Confluent
HostedbyConfluent
 
Slim Baltagi – Flink vs. Spark
Flink Forward
 
Flink vs. Spark
Slim Baltagi
 
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
data-mesh-101.pptx
TarekHamdi8
 
Enhancing Apache Kafka for Large Scale Real-Time Data Pipeline at Tencent | K...
HostedbyConfluent
 
Confluent Messaging Modernization Forum
confluent
 
Docker Containers- Data Engineers' Arsenal.pdf
gr6336192
 
WhatIsData-Blitz
pharvener
 
Citi Tech Talk: Hybrid Cloud
confluent
 
Developing Actors in Azure with .net
Marco Parenzan
 
Real Time Streaming with Flink & Couchbase
Manuel Hurtado
 
Big Data Companies and Apache Software
Bob Marcus
 
Lesson 2
Sandra Ahn
 
Ad

More from confluent (20)

PDF
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
PPTX
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
PDF
Migration, backup and restore made easy using Kannika
confluent
 
PDF
Five Things You Need to Know About Data Streaming in 2025
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
PDF
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
PDF
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
PDF
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
PDF
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
PDF
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
PDF
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
PDF
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
PDF
Unlocking value with event-driven architecture by Confluent
confluent
 
PDF
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
PDF
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
PDF
Building API data products on top of your real-time data infrastructure
confluent
 
PDF
Speed Wins: From Kafka to APIs in Minutes
confluent
 
PDF
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 
PDF
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
confluent
 
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
Migration, backup and restore made easy using Kannika
confluent
 
Five Things You Need to Know About Data Streaming in 2025
confluent
 
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
Unlocking value with event-driven architecture by Confluent
confluent
 
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
Building API data products on top of your real-time data infrastructure
confluent
 
Speed Wins: From Kafka to APIs in Minutes
confluent
 
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
confluent
 
Ad

Recently uploaded (20)

DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 

Domain Driven Data: Apache Kafka® and the Data Mesh

  • 1. Domain Driven Data Apache Kafka and the Data Mesh James Gollan, Senior Solutions Engineer at Confluent
  • 2. Why are we talking about Data Mesh at a Kafka meetup?
  • 3. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. Producer Guarantees P Broker 1 Broker 2 Broker 3 Topic1 partition1 Leader Follower Topic1 partition1 Topic1 partition1 Producer Properties acks=all min.insync.replica=2 ack
  • 4. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. What problem are we trying to solve? 4 Monolithic datastores Centralised processing and governance Bottlenecks for processing and analysing data for the business Data scientists don’t have a full understanding of the data’s context Results in a data swamp
  • 5. Data mesh is an attempt to solve these problems
  • 6. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. Data Mesh 6 Concept first spoken about by Zhamak Dehghani from ThoughtWorks Break apart the ‘data monolith’ Treats domains as first class citizens when dealing with data Domains encouraged to stop treating data as an asset, and to start treating data as a product Emphasis on a self-service data platform Federated governance of organisational data
  • 7. This seems kinda familiar - where have I heard this before.... Could it be microservices?
  • 8. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. Microservices and Domain Driven Design 8 This problem has been solved before for the Monolithic application This was broken down into microservices Creation of these microservices emphasises business domains The bounded context provides the public interfaces for the domain Within the bounded context domain specific language and business logic is used
  • 10. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. 10
  • 12. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. Kafka’s role in the data mesh Kafka Kafka facilitates the data mesh by acting as a central hub for events Infinite storage in Kafka allows it to be used as the source of truth within the organization 12
  • 13. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. Kafka’s role in the data mesh Connect Domain based connect workers allow domain owners to integrate source and sink connectors One of these sinks might be a data warehouse where the domain can conduct analysis on their data sets, potentially combined with organisational wide data sets from other domains 13
  • 14. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. Kafka’s role in the data mesh ksqlDB Distributed ksqlDB allows domains to run their own real-time stream processing This may be used to prepare data from multiple topics for publication across the organisation It may also be used for advanced stream processing, such as real-time fraud detection 14
  • 15. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. Kafka’s role in the data mesh Schema registry With event driven architecture the schema is the API Schema registry ensures consistency in event structure, and enables forward and backward compatibility across schema changes Is may be extended to provide more data governance features, such as field level tagging, data catalog functionality etc. 15
  • 16. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc. Converters
  • 17. Questions (or did we run out of time?)