SlideShare a Scribd company logo
Jessy Jordan, Sta
ff
Software Engineer at Meroxa, @jayjayjpg
Building on top of Kafka
How Meroxa leveraged Kafka to build a Stream Processing
Application Platform
Apache, Apache Kafka, Kafka and the Kafka logo are trademarks of the Apache Software Foundation. The Apache Software Foundation has no a
ffi
liation with
and does not endorse the materials provided at this event.
@meroxadata
🤔
@meroxadata
If you wanted to build a stream processing application
platform…
🤔
…that allowed you to process real-time data e
ffi
ciently for
hundreds and thousands of users….
@meroxadata
🤔
Including a great UX: wouldn’t you want to build on the
shoulders of giants, such as Apache Kafka?
@meroxadata
What is a Stream Processing Application Platform?
Allowing users to sync,
transform and persist
real-time data
Application Platform →
code-
fi
rst user interface
@meroxadata
What is a Stream Processing Application Platform?
@meroxadata
Building an Stream Processing Application Platform
Why use Kafka?
@meroxadata
Real-time stream processing essential for modern data engineering
Vast majority of Fortune 100
companies already relies on
event streaming to make use
of real-time data
Mission to build tools
equipped for real-time data
use cases
→ Kafka as de facto standard
for (event) streaming
https://kafka.apache.org/
@meroxadata
Building a Streaming Platform as a Service:
Why use Kafka?
Robust
Scalable
Easy to observe
Easy to extend
@meroxadata
Building a Streaming Platform as a Service:
Why use Kafka?
Robust
Scalable
Easy to observe
Easy to extend
Team expertise working
with Kafka
@meroxadata
Apache Kafka as integral part of the Meroxa platform
Technology Stack
@meroxadata
Control Plane Data Plane
REST API
Provisioner
Microservice
MSK
CRD
CRD
CRD CRD
Controller
Microservice
Kafka
Connect
@meroxadata
Orchestrating Data Applications as part of the Meroxa
platform
Kubernetes Operator
@meroxadata
Control Plane Data Plane
REST API
Provisioner
Microservice
MSK
CRD
CRD
CRD CRD
Controller
Microservice
Kafka
Connect
@meroxadata
Provisioning
Reliance on AWS Cloud Services
Provisioning of Data Plane via AWS
Cloudformation, incl.
Managed Kubernetes Service (EKS)
Fully Managed Apache Kafka on AWS
(MSK)
AWS S3, ECR (Streaming App Build)
Data Plane provisioning to external end user
VPC possible
@meroxadata
Kafka cluster setup and scaling
Setup: 8 partitions, 3 replications, brokers
across several AZs
Horizontal + Vertical scaling of Kafka
Connect pods
Fully managed Kafka cluster
Time + Cost-e
ffi
ciency
Ability to focus on product development,
instead of data infrastructure operation
@meroxadata
Challenges of using Cloud-Hosted Kafka
Up- and down-scaling of storage limited
Still some operational burden for Kafka that
goes into maintenance in contrast to Cloud
Native
upgrades
scaling
monitoring
Limitation: air-gapped environments for
external deployment
@meroxadata
Deploy, destroy and modify Kafka connectors
Connector CRDs - platform
connectors based on Kafka Connect
Other CRDs also used for custom
connectors (Conduit)
Controllers create, delete, modify
connectors
Data Plane
MSK
CRD
CRD
CRD CRD
Controller
Microservice
Kafka
Connect
@meroxadata
@meroxadata
@meroxadata
CLI allows user to initialize data app as a local code repository (ex: Go)
@meroxadata
User deploys data application via the CLI
@meroxadata
Control Plane Data Plane
REST API
Provisioner
Microservice
Controller
Microservice
CRD CRD
CRD
Kafka
Connect
MSK?
HTTP Request
POST /applications
@meroxadata
Control Plane Data Plane
REST API
Provisioner
Microservice
Controller
Microservice
CRD CRD
CRD CRD
Kafka
Connect
MSK?
POST /applications
{
"spec": {
"connectors": [{
"collection": "user_activity",
"type": "source",
"resource": "my-postgres",
"con
fi
g": {
"logical_replication": true
}
},
{
"collection": "user_activity_enriched",
"type": "destination",
"resource": "my-kafka-cluster",
}],
"functions": [
{
"name": "user_activity_enriched",
"image": "ftorres/enrich:9"
}
],
"metadata": {
"turbine": {
"language": "go",
"version": "0.1.0",
},
}
}
}
Client sends an intermediary representation of the user’s data
application to the server
@meroxadata
Control Plane Data Plane
REST API
Provisioner
Microservice
MSK
CRD
CRD
CRD CRD
Controller
Microservice
Kafka
Connect
HTTP Request
@meroxadata
Control Plane Data Plane
REST API
Provisioner
Microservice
+
Data Plane
API
Controller
Microservice
CRD CRD
CRD CRD
Kafka
Connect
MSK?
HTTP Request
Upsert custom
resources
@meroxadata
Using Kubernetes to deploy, destroy and modify Kafka connectors
Controller
Microservice
Connector
Custom
Resource
De
fi
nition
Kafka Connect
MSK
Upsert custom
resources
Connector Controller
@meroxadata
Expanding Functionality of the Meroxa Platform
Extensibility
@meroxadata
Apache Kafka connector ecosystem
Apache Kafka as OSS promotes shared & faster development of compatible data integrations
120+ pre-built 🔌
Con
fl
uent
9 🔌
Debezium
Custom
(Kafka Connect)
@meroxadata
Extending the platform with community and custom connectors
Meroxa platform uses:
@meroxadata
Extending the platform with community and custom connectors
Meroxa platform uses:
Debezium connectors
@meroxadata
Extending the platform with community and custom connectors
Meroxa platform uses:
Debezium connectors
Custom Kafka Connect
connectors
@meroxadata
Apache Kafka connector ecosystem
Apache Kafka as OSS promotes shared & faster development of compatible data integrations
Con
fl
uent Debezium Kafka Connect
@meroxadata
Conduit connector ecosystem
Conduit as data integration OSS with its own connector ecosystem
Connector SDK (technically
language agnostic framework)
gRPC interface
OpenCDC Schema Format
@meroxadata
Conduit as alternative data connector framework
Enabling Kafka Connect - Conduit connector data pipelines
@meroxadata
Conduit as alternative data connector framework
Enabling Kafka Connect - Conduit connector data pipelines
@meroxadata
Extending the platform with community and custom connectors
Meroxa platform uses:
Debezium connectors
Custom Kafka Connect
connectors
Custom connectors
(Conduit)
@meroxadata
Extending the platform with community and custom connectors
Apache Kafka with open-source
connector ecosystem
Debezium providing open-source
platform for CDC, incl.
connectors
Conduit with its own connector
ecosystem
Including connectors integrating
back to end user’s Kafka clusters
https://conduit.io/
@meroxadata
Monitoring Streaming on the Meroxa Platform
Observability
@meroxadata
Observability for Meroxa platform end users
@meroxadata
Observability for Meroxa platform end users: Connector State
Running
Failed
Pending
@meroxadata
Observability for Meroxa platform end users: Connector state
Controller Microservice: Connector
Controller polls for connector
status
Running
Failed
Pending
Controller
Microservice
Connector
Custom
Resource
De
fi
nition
Kafka Connect
Read state custom
resources
Connector Controller
Conduit server
Conduit Controllers
@meroxadata
Observability for Meroxa platform end users: Connector Logs
@meroxadata
Observability for Meroxa platform end users
@meroxadata
Observability for Meroxa platform end users
Simple aggregation
and
formatting
@meroxadata
How we monitor Kafka cluster and connectors
MSK metrics tracked with Prometheus Data
Plane instance
Transfer of metrics across plane
components with multiple Prometheus
instances
Aggregation of metrics in DataDog
→Prometheus as cost-e
ffi
cient metric tool
(open-source)
@meroxadata
Sum Up
@meroxadata
How Meroxa leveraged Kafka to build a Stream Processing Application
Platform
Why Kafka?
Scalable, robust and well-supported foundation for building
modern data engineering software
Creating scalable data infrastructure using Kafka
Extensibility of our platform with Debezium, Kafka Connect and
Conduit
Internal and end-user observability with Kafka Connect and logging
+ metrics tooling
@meroxadata
Thank you!
@meroxadata
discord.meroxa.com
🌐 meroxa.com
@meroxadata

More Related Content

Similar to Leverage Kafka to build a stream processing platform (20)

PDF
How to Write Great Kafka Connectors
confluent
 
PDF
Confluent Partner Tech Talk with Reply
confluent
 
PPTX
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
Jean-Paul Azar
 
PDF
Kafka Connect & Kafka Streams/KSQL - the ecosystem around Kafka
Guido Schmutz
 
PPTX
Kafka Tutorial: Streaming Data Architecture
Jean-Paul Azar
 
PDF
Beyond the brokers - Un tour de l'écosystème Kafka
Florent Ramiere
 
PPTX
Confluent and Syncsort Webinar August 2016
Precisely
 
PPTX
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Michael Noll
 
PDF
Beyond the brokers - A tour of the Kafka ecosystem
Damien Gasparina
 
PDF
Beyond the Brokers: A Tour of the Kafka Ecosystem
confluent
 
PDF
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
Kai Wähner
 
PPTX
messaging.pptx
NParakh1
 
PPTX
Current and Future of Apache Kafka
Joe Stein
 
PDF
Data pipeline with kafka
Mole Wong
 
PDF
Show Me Kafka Tools That Will Increase My Productivity! (Stephane Maarek, Dat...
confluent
 
PDF
Partner Development Guide for Kafka Connect
confluent
 
PPTX
apidays LIVE India - REST the Events - REST APIs for Event-Driven Architectur...
apidays
 
PDF
Diving into the Deep End - Kafka Connect
confluent
 
PPTX
apidays LIVE Singapore 2021 - REST the Events - REST APIs for Event-Driven Ar...
apidays
 
PPTX
Introduction to kafka connector
Knoldus Inc.
 
How to Write Great Kafka Connectors
confluent
 
Confluent Partner Tech Talk with Reply
confluent
 
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
Jean-Paul Azar
 
Kafka Connect & Kafka Streams/KSQL - the ecosystem around Kafka
Guido Schmutz
 
Kafka Tutorial: Streaming Data Architecture
Jean-Paul Azar
 
Beyond the brokers - Un tour de l'écosystème Kafka
Florent Ramiere
 
Confluent and Syncsort Webinar August 2016
Precisely
 
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Michael Noll
 
Beyond the brokers - A tour of the Kafka ecosystem
Damien Gasparina
 
Beyond the Brokers: A Tour of the Kafka Ecosystem
confluent
 
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
Kai Wähner
 
messaging.pptx
NParakh1
 
Current and Future of Apache Kafka
Joe Stein
 
Data pipeline with kafka
Mole Wong
 
Show Me Kafka Tools That Will Increase My Productivity! (Stephane Maarek, Dat...
confluent
 
Partner Development Guide for Kafka Connect
confluent
 
apidays LIVE India - REST the Events - REST APIs for Event-Driven Architectur...
apidays
 
Diving into the Deep End - Kafka Connect
confluent
 
apidays LIVE Singapore 2021 - REST the Events - REST APIs for Event-Driven Ar...
apidays
 
Introduction to kafka connector
Knoldus Inc.
 

More from confluent (20)

PDF
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
PPTX
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
PDF
Migration, backup and restore made easy using Kannika
confluent
 
PDF
Five Things You Need to Know About Data Streaming in 2025
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
PDF
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
PDF
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
PDF
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
PDF
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
PDF
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
PDF
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
PDF
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
PDF
Unlocking value with event-driven architecture by Confluent
confluent
 
PDF
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
PDF
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
PDF
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
PDF
Building API data products on top of your real-time data infrastructure
confluent
 
PDF
Speed Wins: From Kafka to APIs in Minutes
confluent
 
PDF
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
Migration, backup and restore made easy using Kannika
confluent
 
Five Things You Need to Know About Data Streaming in 2025
confluent
 
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
Unlocking value with event-driven architecture by Confluent
confluent
 
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
Building API data products on top of your real-time data infrastructure
confluent
 
Speed Wins: From Kafka to APIs in Minutes
confluent
 
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 
Ad

Recently uploaded (20)

PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PPTX
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PPTX
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
Agentforce World Tour Toronto '25 - MCP with MuleSoft
Alexandra N. Martinez
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Designing_the_Future_AI_Driven_Product_Experiences_Across_Devices.pptx
presentifyai
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
Ad

Leverage Kafka to build a stream processing platform

  • 1. Jessy Jordan, Sta ff Software Engineer at Meroxa, @jayjayjpg Building on top of Kafka How Meroxa leveraged Kafka to build a Stream Processing Application Platform Apache, Apache Kafka, Kafka and the Kafka logo are trademarks of the Apache Software Foundation. The Apache Software Foundation has no a ffi liation with and does not endorse the materials provided at this event. @meroxadata
  • 2. 🤔 @meroxadata If you wanted to build a stream processing application platform…
  • 3. 🤔 …that allowed you to process real-time data e ffi ciently for hundreds and thousands of users…. @meroxadata
  • 4. 🤔 Including a great UX: wouldn’t you want to build on the shoulders of giants, such as Apache Kafka? @meroxadata
  • 5. What is a Stream Processing Application Platform? Allowing users to sync, transform and persist real-time data Application Platform → code- fi rst user interface @meroxadata
  • 6. What is a Stream Processing Application Platform? @meroxadata
  • 7. Building an Stream Processing Application Platform Why use Kafka? @meroxadata
  • 8. Real-time stream processing essential for modern data engineering Vast majority of Fortune 100 companies already relies on event streaming to make use of real-time data Mission to build tools equipped for real-time data use cases → Kafka as de facto standard for (event) streaming https://kafka.apache.org/ @meroxadata
  • 9. Building a Streaming Platform as a Service: Why use Kafka? Robust Scalable Easy to observe Easy to extend @meroxadata
  • 10. Building a Streaming Platform as a Service: Why use Kafka? Robust Scalable Easy to observe Easy to extend Team expertise working with Kafka @meroxadata
  • 11. Apache Kafka as integral part of the Meroxa platform Technology Stack @meroxadata
  • 12. Control Plane Data Plane REST API Provisioner Microservice MSK CRD CRD CRD CRD Controller Microservice Kafka Connect @meroxadata
  • 13. Orchestrating Data Applications as part of the Meroxa platform Kubernetes Operator @meroxadata
  • 14. Control Plane Data Plane REST API Provisioner Microservice MSK CRD CRD CRD CRD Controller Microservice Kafka Connect @meroxadata
  • 15. Provisioning Reliance on AWS Cloud Services Provisioning of Data Plane via AWS Cloudformation, incl. Managed Kubernetes Service (EKS) Fully Managed Apache Kafka on AWS (MSK) AWS S3, ECR (Streaming App Build) Data Plane provisioning to external end user VPC possible @meroxadata
  • 16. Kafka cluster setup and scaling Setup: 8 partitions, 3 replications, brokers across several AZs Horizontal + Vertical scaling of Kafka Connect pods Fully managed Kafka cluster Time + Cost-e ffi ciency Ability to focus on product development, instead of data infrastructure operation @meroxadata
  • 17. Challenges of using Cloud-Hosted Kafka Up- and down-scaling of storage limited Still some operational burden for Kafka that goes into maintenance in contrast to Cloud Native upgrades scaling monitoring Limitation: air-gapped environments for external deployment @meroxadata
  • 18. Deploy, destroy and modify Kafka connectors Connector CRDs - platform connectors based on Kafka Connect Other CRDs also used for custom connectors (Conduit) Controllers create, delete, modify connectors Data Plane MSK CRD CRD CRD CRD Controller Microservice Kafka Connect @meroxadata
  • 21. CLI allows user to initialize data app as a local code repository (ex: Go) @meroxadata
  • 22. User deploys data application via the CLI @meroxadata
  • 23. Control Plane Data Plane REST API Provisioner Microservice Controller Microservice CRD CRD CRD Kafka Connect MSK? HTTP Request POST /applications @meroxadata
  • 24. Control Plane Data Plane REST API Provisioner Microservice Controller Microservice CRD CRD CRD CRD Kafka Connect MSK? POST /applications { "spec": { "connectors": [{ "collection": "user_activity", "type": "source", "resource": "my-postgres", "con fi g": { "logical_replication": true } }, { "collection": "user_activity_enriched", "type": "destination", "resource": "my-kafka-cluster", }], "functions": [ { "name": "user_activity_enriched", "image": "ftorres/enrich:9" } ], "metadata": { "turbine": { "language": "go", "version": "0.1.0", }, } } } Client sends an intermediary representation of the user’s data application to the server @meroxadata
  • 25. Control Plane Data Plane REST API Provisioner Microservice MSK CRD CRD CRD CRD Controller Microservice Kafka Connect HTTP Request @meroxadata
  • 26. Control Plane Data Plane REST API Provisioner Microservice + Data Plane API Controller Microservice CRD CRD CRD CRD Kafka Connect MSK? HTTP Request Upsert custom resources @meroxadata
  • 27. Using Kubernetes to deploy, destroy and modify Kafka connectors Controller Microservice Connector Custom Resource De fi nition Kafka Connect MSK Upsert custom resources Connector Controller @meroxadata
  • 28. Expanding Functionality of the Meroxa Platform Extensibility @meroxadata
  • 29. Apache Kafka connector ecosystem Apache Kafka as OSS promotes shared & faster development of compatible data integrations 120+ pre-built 🔌 Con fl uent 9 🔌 Debezium Custom (Kafka Connect) @meroxadata
  • 30. Extending the platform with community and custom connectors Meroxa platform uses: @meroxadata
  • 31. Extending the platform with community and custom connectors Meroxa platform uses: Debezium connectors @meroxadata
  • 32. Extending the platform with community and custom connectors Meroxa platform uses: Debezium connectors Custom Kafka Connect connectors @meroxadata
  • 33. Apache Kafka connector ecosystem Apache Kafka as OSS promotes shared & faster development of compatible data integrations Con fl uent Debezium Kafka Connect @meroxadata
  • 34. Conduit connector ecosystem Conduit as data integration OSS with its own connector ecosystem Connector SDK (technically language agnostic framework) gRPC interface OpenCDC Schema Format @meroxadata
  • 35. Conduit as alternative data connector framework Enabling Kafka Connect - Conduit connector data pipelines @meroxadata
  • 36. Conduit as alternative data connector framework Enabling Kafka Connect - Conduit connector data pipelines @meroxadata
  • 37. Extending the platform with community and custom connectors Meroxa platform uses: Debezium connectors Custom Kafka Connect connectors Custom connectors (Conduit) @meroxadata
  • 38. Extending the platform with community and custom connectors Apache Kafka with open-source connector ecosystem Debezium providing open-source platform for CDC, incl. connectors Conduit with its own connector ecosystem Including connectors integrating back to end user’s Kafka clusters https://conduit.io/ @meroxadata
  • 39. Monitoring Streaming on the Meroxa Platform Observability @meroxadata
  • 40. Observability for Meroxa platform end users @meroxadata
  • 41. Observability for Meroxa platform end users: Connector State Running Failed Pending @meroxadata
  • 42. Observability for Meroxa platform end users: Connector state Controller Microservice: Connector Controller polls for connector status Running Failed Pending Controller Microservice Connector Custom Resource De fi nition Kafka Connect Read state custom resources Connector Controller Conduit server Conduit Controllers @meroxadata
  • 43. Observability for Meroxa platform end users: Connector Logs @meroxadata
  • 44. Observability for Meroxa platform end users @meroxadata
  • 45. Observability for Meroxa platform end users Simple aggregation and formatting @meroxadata
  • 46. How we monitor Kafka cluster and connectors MSK metrics tracked with Prometheus Data Plane instance Transfer of metrics across plane components with multiple Prometheus instances Aggregation of metrics in DataDog →Prometheus as cost-e ffi cient metric tool (open-source) @meroxadata
  • 48. How Meroxa leveraged Kafka to build a Stream Processing Application Platform Why Kafka? Scalable, robust and well-supported foundation for building modern data engineering software Creating scalable data infrastructure using Kafka Extensibility of our platform with Debezium, Kafka Connect and Conduit Internal and end-user observability with Kafka Connect and logging + metrics tooling @meroxadata