SlideShare a Scribd company logo
—
Kafka & SingleStore: Better Together
to Power Modern Real-Time Data
Architecture
Rohit Reddy
Principal Solutions Engineer
SingleStore
2
Increasing Focus on Cloud and Real-Time Analytics
90% 75% 30%
By 2022,
public cloud services
will be essential for
90% of data and
analytics innovation
By 2022, 75% of all
workloads will move
to Hybrid-Cloud
By 2025, nearly 30%
of all data generated
will be real-time
Gartner Top 10 Trends in Data
and Analytics for 2020
McKinsey - Unlocking business acceleration in a
hybrid cloud world, Aug 2019
IDC - Data Age 2025
3
—
Traditional Data Architecture
System of Record
System of Engagement
Data Warehouse
ODS
ETL
CDC
Reporting
Visualization
Batch Engine
Real-Time Engine
Data Lake
4
—
Modern Real-Time Data Architecture
System of Record System of Engagement
Smart Apps
System of Insight
System of Intelligence
CDC
5
The Unified Database for Fast Analytics
Data
Warehouse
Operational
Database
Transactional Workloads
Fast Queries | Large Data Size
Aggregation
Fast Lookup | High Concurrency
Simplifies the support of diverse workloads by
reducing operational complexity
Analytical Workloads
6
Ultra-Fast Ingest
Parallel, high-scale
streaming data Ingest
Super-Low Latency High Concurrency
Blazing Fast
Queries
Unparalleled
Scalability
Billions of events/sec for
immediate availability
Sub-second latencies with
immediate consistency
Millions of real-time queries across
tens of thousands of users
Fast Analytics
Fast analytics on dynamic data
for complex analytical queries
SingleStore - Key Capabilities for Fast Analytics
7
—
Kafka & SingleStore Meet Demands of Operational Analytics
Real-Time
Millions of records per second
Consistent
Exactly-once semantic
Distributed, Fault Tolerant
Parallel ingest
Developer-Friendly
Pub-Sub & SQL
8
—
Anatomy of SingleStore Pipelines Sequence
SingleStore pulls for
changes from a data
source system.
SingleStore pulls the data into its
memory space (no commit) where a
transform can be applied.
The data is committed in a
transaction (and in parallel)
Pipelines
Kafka
SingleStore
Data can be directly inserted
into tables or pre-processed
by a stored procedure
Write to Kafka
9
SingleStore Pipelines Creation
10M
UPSERTS PER SECOND
WITH KAFKA + SINGLESTORE
CREATE OR REPLACE PIPELINE load_trade_data
AS LOAD DATA KAFKA 'hostname:9092/trades'
WITH TRANSFORM ('score_data.py','','') -- optional
INTO TABLE live_predictions -- directly into tables
INTO PROCEDURE trade_proc -- via a stored procedure
FIELDS TERMINATED BY ',';
START PIPELINE load_trade_data;
;
—
10
—
SingleStore Kafka Pipelines
SingleStore Cluster
11
—
SingleStore Transforms
● Build transforms using any language!
● Transforms are an optional user-defined
program that receives data from a pipeline’s
extractor and outputs modified data (JSON,
Avro, CSV)
○ Examples: Data modification,
aggregation, feature engineering,
model execution, and more!
● Linux distribution must have the required
dependencies to execute the transform
● Data streamed to the transform is
byte-length encoded
Stream Transform Load
12
—
SingleStore Stored Procedures
DELIMITER //
CREATE OR REPLACE PROCEDURE tweets_proc(batch QUERY(tweet JSON))
AS
BEGIN
INSERT IGNORE INTO tweets(tweet_id, tweet_user_id, tweet_text)
SELECT tweet::tweet_id, tweet::tweet_user_id, tweet::tweet_text
FROM batch;
INSERT INTO retweets_counter(user_id, num_retweets)
SELECT tweet::retweet_user_id, 1
FROM batch
WHERE tweet::retweet_user_id IS NOT NULL
ON DUPLICATE KEY UPDATE num_retweets = num_retweets + 1;
END //
DELIMITER ;
;
● Preprocess incoming data: cleansing,
aggregation, filtering…
● Dispatch to multiple tables
● Cross-reference with dimension tables
● Integrity check
● Push to Kafka
13
—
SingleStore Push to Kafka
● Allows users to leverage SingleStore as a true Operational Data Hub with downstream
decisioning
● “SELECT … INTO KAFKA …” runs a SELECT query, constructs Kafka message for each
row in the result set, and publishes the messages to a Kafka topic
● Includes every column value in the result set’s row and separates the column values
by a delimiter
● Configure security credentials within the statement easily
SELECT col1, col2, col3 FROM t
ORDER BY col1
INTO KAFKA 'host.example.com:9092/test-topic'
FIELDS TERMINATED BY ',' ENCLOSED BY '"' ESCAPED BY "t"
LINES TERMINATED BY '}' STARTING BY '{';
14
—
SingleStore Confluent Kafka Connector
● SingleStore Kafka Connect Connector on the
Confluent Hub
● Integration with Confluent Kafka Connect to stream
data into SingleStore
● Management and deployment capabilities of Confluent
make this incredibly easy to get started
● Cloud-first: Kafka Connector sits Kafka-side,
eliminating many potential security constraints
5X
THAN JDBC CONNECTOR
FASTER
Real-time fraud analytics
for Credit card swipes in
less than 50ms.
Real-time geospatial
insights with massive
concurrency to
manage 24/7
operations
300K
Events per
second
Streaming analytics to
drive proactive care and
real-time
recommendations
IoT Analytics ingesting
and analyzing data from
over 1.2 Million smart
meters
13x data growth
moving from batch to
near-real time
visibility and analytics
3500+
Users
1.2M
Smart meters
analyzed
10M
Upserts per
second
Tier-1 US
Bank
50ms
Real-time Fraud
Detection
Top Energy
Company
SingleStore is the Unified Database
for Fast Analytics on Any Data, Anywhere
17
Learn Your Way
—
Get Started with
$500 in Free Credits
Today
Go to
singlestore.com/managed-service-trial
● Learn by Reading
○ docs.singlestore.com
● Learn by Engaging Peers
○ singlestore.com/forum
● Learn by Watching
○ youtube.com/singlestore
● Learn through Training
○ training.singlestore.com
—
18
Thank You
Sales
Please fill out the form if you need to learn
more.
For immediate sales help, call us at
1-855-463-6775 or email us at
team@singlestore.com.
Enterprise Edition Support
Are you encountering an issue and
have an enterprise support?
Submit a support request.
U.S. Office Locations
San Francisco (HQ)
534 Fourth Street
San Francisco, CA 94107
Seattle
96 Union Street
Seattle, WA 98101
Portland
700 SW Fifth Ave
Portland, OR 97204

More Related Content

What's hot (20)

PDF
Apache Iceberg - A Table Format for Hige Analytic Datasets
Alluxio, Inc.
 
PDF
Achieving Lakehouse Models with Spark 3.0
Databricks
 
PDF
Time to Talk about Data Mesh
LibbySchulze
 
PDF
Introducing Databricks Delta
Databricks
 
PPTX
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
confluent
 
PPTX
Search and analyze your data with elasticsearch
Anton Udovychenko
 
PPTX
APACHE KAFKA / Kafka Connect / Kafka Streams
Ketan Gote
 
PPTX
Azure Synapse Analytics Overview (r2)
James Serra
 
PDF
PySpark Programming | PySpark Concepts with Hands-On | PySpark Training | Edu...
Edureka!
 
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
PPTX
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
HostedbyConfluent
 
PPT
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
PDF
Run Apache Spark on Kubernetes in Large Scale_ Challenges and Solutions-2.pdf
Anya Bida
 
PDF
Spark SQL Deep Dive @ Melbourne Spark Meetup
Databricks
 
PPTX
Scaling Data Quality @ Netflix
Michelle Ufford
 
PDF
Speeding Time to Insight with a Modern ELT Approach
Databricks
 
PPTX
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
DataWorks Summit
 
PDF
Data Pipline Observability meetup
Omid Vahdaty
 
PPTX
TechEvent Databricks on Azure
Trivadis
 
PPTX
Apache NiFi in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
 
Apache Iceberg - A Table Format for Hige Analytic Datasets
Alluxio, Inc.
 
Achieving Lakehouse Models with Spark 3.0
Databricks
 
Time to Talk about Data Mesh
LibbySchulze
 
Introducing Databricks Delta
Databricks
 
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
confluent
 
Search and analyze your data with elasticsearch
Anton Udovychenko
 
APACHE KAFKA / Kafka Connect / Kafka Streams
Ketan Gote
 
Azure Synapse Analytics Overview (r2)
James Serra
 
PySpark Programming | PySpark Concepts with Hands-On | PySpark Training | Edu...
Edureka!
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Databricks
 
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
HostedbyConfluent
 
Data Lakehouse Symposium | Day 1 | Part 2
Databricks
 
Run Apache Spark on Kubernetes in Large Scale_ Challenges and Solutions-2.pdf
Anya Bida
 
Spark SQL Deep Dive @ Melbourne Spark Meetup
Databricks
 
Scaling Data Quality @ Netflix
Michelle Ufford
 
Speeding Time to Insight with a Modern ELT Approach
Databricks
 
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
DataWorks Summit
 
Data Pipline Observability meetup
Omid Vahdaty
 
TechEvent Databricks on Azure
Trivadis
 
Apache NiFi in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
 

Similar to SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architecture | Rohit Reddy, SingleStore (20)

PPTX
Streaming Data Ingest and Processing with Apache Kafka
Attunity
 
PDF
Citi Tech Talk: Hybrid Cloud
confluent
 
PDF
Streaming Time Series Data With Kenny Gorman and Elena Cuevas | Current 2022
HostedbyConfluent
 
PDF
Building Event Streaming Architectures on Scylla and Kafka
ScyllaDB
 
PPTX
Webinar: Data Streaming with Apache Kafka & MongoDB
MongoDB
 
PPTX
Data Streaming with Apache Kafka & MongoDB - EMEA
Andrew Morgan
 
PDF
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
confluent
 
PDF
Leveraging Mainframe Data for Modern Analytics
confluent
 
PDF
Stream Processing with Flink and Stream Sharing
confluent
 
PDF
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
HostedbyConfluent
 
PPTX
Streaming Data and Stream Processing with Apache Kafka
confluent
 
PDF
Reinventing Kafka in the Data Streaming Era - Jun Rao
confluent
 
PDF
The Never Landing Stream with HTAP and Streaming
Timothy Spann
 
PDF
Serhii Kholodniuk: What you need to know, before migrating data platform to G...
Lviv Startup Club
 
PDF
Building scalable data with kafka and spark
babatunde ekemode
 
PPTX
IIoT with Kafka and Machine Learning for Supply Chain Optimization In Real Ti...
Kai Wähner
 
PPTX
Apache Kafka® + Machine Learning for Supply Chain 
confluent
 
PDF
Devoxx university - Kafka de haut en bas
Florent Ramiere
 
PDF
How Kafka and MemSQL Became the Dynamic Duo (Sarung Tripathi, MemSQL) Kafka S...
HostedbyConfluent
 
PDF
Cloud-Native Patterns for Data-Intensive Applications
VMware Tanzu
 
Streaming Data Ingest and Processing with Apache Kafka
Attunity
 
Citi Tech Talk: Hybrid Cloud
confluent
 
Streaming Time Series Data With Kenny Gorman and Elena Cuevas | Current 2022
HostedbyConfluent
 
Building Event Streaming Architectures on Scylla and Kafka
ScyllaDB
 
Webinar: Data Streaming with Apache Kafka & MongoDB
MongoDB
 
Data Streaming with Apache Kafka & MongoDB - EMEA
Andrew Morgan
 
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
confluent
 
Leveraging Mainframe Data for Modern Analytics
confluent
 
Stream Processing with Flink and Stream Sharing
confluent
 
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
HostedbyConfluent
 
Streaming Data and Stream Processing with Apache Kafka
confluent
 
Reinventing Kafka in the Data Streaming Era - Jun Rao
confluent
 
The Never Landing Stream with HTAP and Streaming
Timothy Spann
 
Serhii Kholodniuk: What you need to know, before migrating data platform to G...
Lviv Startup Club
 
Building scalable data with kafka and spark
babatunde ekemode
 
IIoT with Kafka and Machine Learning for Supply Chain Optimization In Real Ti...
Kai Wähner
 
Apache Kafka® + Machine Learning for Supply Chain 
confluent
 
Devoxx university - Kafka de haut en bas
Florent Ramiere
 
How Kafka and MemSQL Became the Dynamic Duo (Sarung Tripathi, MemSQL) Kafka S...
HostedbyConfluent
 
Cloud-Native Patterns for Data-Intensive Applications
VMware Tanzu
 
Ad

More from HostedbyConfluent (20)

PDF
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
 
PDF
Renaming a Kafka Topic | Kafka Summit London
HostedbyConfluent
 
PDF
Evolution of NRT Data Ingestion Pipeline at Trendyol
HostedbyConfluent
 
PDF
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
HostedbyConfluent
 
PDF
Exactly-once Stream Processing with Arroyo and Kafka
HostedbyConfluent
 
PDF
Fish Plays Pokemon | Kafka Summit London
HostedbyConfluent
 
PDF
Tiered Storage 101 | Kafla Summit London
HostedbyConfluent
 
PDF
Building a Self-Service Stream Processing Portal: How And Why
HostedbyConfluent
 
PDF
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
HostedbyConfluent
 
PDF
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
HostedbyConfluent
 
PDF
Navigating Private Network Connectivity Options for Kafka Clusters
HostedbyConfluent
 
PDF
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
HostedbyConfluent
 
PDF
Explaining How Real-Time GenAI Works in a Noisy Pub
HostedbyConfluent
 
PDF
TL;DR Kafka Metrics | Kafka Summit London
HostedbyConfluent
 
PDF
A Window Into Your Kafka Streams Tasks | KSL
HostedbyConfluent
 
PDF
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
HostedbyConfluent
 
PDF
Data Contracts Management: Schema Registry and Beyond
HostedbyConfluent
 
PDF
Code-First Approach: Crafting Efficient Flink Apps
HostedbyConfluent
 
PDF
Debezium vs. the World: An Overview of the CDC Ecosystem
HostedbyConfluent
 
PDF
Beyond Tiered Storage: Serverless Kafka with No Local Disks
HostedbyConfluent
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
HostedbyConfluent
 
Renaming a Kafka Topic | Kafka Summit London
HostedbyConfluent
 
Evolution of NRT Data Ingestion Pipeline at Trendyol
HostedbyConfluent
 
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
HostedbyConfluent
 
Exactly-once Stream Processing with Arroyo and Kafka
HostedbyConfluent
 
Fish Plays Pokemon | Kafka Summit London
HostedbyConfluent
 
Tiered Storage 101 | Kafla Summit London
HostedbyConfluent
 
Building a Self-Service Stream Processing Portal: How And Why
HostedbyConfluent
 
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
HostedbyConfluent
 
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
HostedbyConfluent
 
Navigating Private Network Connectivity Options for Kafka Clusters
HostedbyConfluent
 
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
HostedbyConfluent
 
Explaining How Real-Time GenAI Works in a Noisy Pub
HostedbyConfluent
 
TL;DR Kafka Metrics | Kafka Summit London
HostedbyConfluent
 
A Window Into Your Kafka Streams Tasks | KSL
HostedbyConfluent
 
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
HostedbyConfluent
 
Data Contracts Management: Schema Registry and Beyond
HostedbyConfluent
 
Code-First Approach: Crafting Efficient Flink Apps
HostedbyConfluent
 
Debezium vs. the World: An Overview of the CDC Ecosystem
HostedbyConfluent
 
Beyond Tiered Storage: Serverless Kafka with No Local Disks
HostedbyConfluent
 
Ad

Recently uploaded (20)

PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PDF
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PDF
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PDF
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
NASA A Researcher’s Guide to International Space Station : Physical Sciences ...
Dr. PANKAJ DHUSSA
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 

SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architecture | Rohit Reddy, SingleStore

  • 1. — Kafka & SingleStore: Better Together to Power Modern Real-Time Data Architecture Rohit Reddy Principal Solutions Engineer SingleStore
  • 2. 2 Increasing Focus on Cloud and Real-Time Analytics 90% 75% 30% By 2022, public cloud services will be essential for 90% of data and analytics innovation By 2022, 75% of all workloads will move to Hybrid-Cloud By 2025, nearly 30% of all data generated will be real-time Gartner Top 10 Trends in Data and Analytics for 2020 McKinsey - Unlocking business acceleration in a hybrid cloud world, Aug 2019 IDC - Data Age 2025
  • 3. 3 — Traditional Data Architecture System of Record System of Engagement Data Warehouse ODS ETL CDC Reporting Visualization Batch Engine Real-Time Engine Data Lake
  • 4. 4 — Modern Real-Time Data Architecture System of Record System of Engagement Smart Apps System of Insight System of Intelligence CDC
  • 5. 5 The Unified Database for Fast Analytics Data Warehouse Operational Database Transactional Workloads Fast Queries | Large Data Size Aggregation Fast Lookup | High Concurrency Simplifies the support of diverse workloads by reducing operational complexity Analytical Workloads
  • 6. 6 Ultra-Fast Ingest Parallel, high-scale streaming data Ingest Super-Low Latency High Concurrency Blazing Fast Queries Unparalleled Scalability Billions of events/sec for immediate availability Sub-second latencies with immediate consistency Millions of real-time queries across tens of thousands of users Fast Analytics Fast analytics on dynamic data for complex analytical queries SingleStore - Key Capabilities for Fast Analytics
  • 7. 7 — Kafka & SingleStore Meet Demands of Operational Analytics Real-Time Millions of records per second Consistent Exactly-once semantic Distributed, Fault Tolerant Parallel ingest Developer-Friendly Pub-Sub & SQL
  • 8. 8 — Anatomy of SingleStore Pipelines Sequence SingleStore pulls for changes from a data source system. SingleStore pulls the data into its memory space (no commit) where a transform can be applied. The data is committed in a transaction (and in parallel) Pipelines Kafka SingleStore Data can be directly inserted into tables or pre-processed by a stored procedure Write to Kafka
  • 9. 9 SingleStore Pipelines Creation 10M UPSERTS PER SECOND WITH KAFKA + SINGLESTORE CREATE OR REPLACE PIPELINE load_trade_data AS LOAD DATA KAFKA 'hostname:9092/trades' WITH TRANSFORM ('score_data.py','','') -- optional INTO TABLE live_predictions -- directly into tables INTO PROCEDURE trade_proc -- via a stored procedure FIELDS TERMINATED BY ','; START PIPELINE load_trade_data; ; —
  • 11. 11 — SingleStore Transforms ● Build transforms using any language! ● Transforms are an optional user-defined program that receives data from a pipeline’s extractor and outputs modified data (JSON, Avro, CSV) ○ Examples: Data modification, aggregation, feature engineering, model execution, and more! ● Linux distribution must have the required dependencies to execute the transform ● Data streamed to the transform is byte-length encoded Stream Transform Load
  • 12. 12 — SingleStore Stored Procedures DELIMITER // CREATE OR REPLACE PROCEDURE tweets_proc(batch QUERY(tweet JSON)) AS BEGIN INSERT IGNORE INTO tweets(tweet_id, tweet_user_id, tweet_text) SELECT tweet::tweet_id, tweet::tweet_user_id, tweet::tweet_text FROM batch; INSERT INTO retweets_counter(user_id, num_retweets) SELECT tweet::retweet_user_id, 1 FROM batch WHERE tweet::retweet_user_id IS NOT NULL ON DUPLICATE KEY UPDATE num_retweets = num_retweets + 1; END // DELIMITER ; ; ● Preprocess incoming data: cleansing, aggregation, filtering… ● Dispatch to multiple tables ● Cross-reference with dimension tables ● Integrity check ● Push to Kafka
  • 13. 13 — SingleStore Push to Kafka ● Allows users to leverage SingleStore as a true Operational Data Hub with downstream decisioning ● “SELECT … INTO KAFKA …” runs a SELECT query, constructs Kafka message for each row in the result set, and publishes the messages to a Kafka topic ● Includes every column value in the result set’s row and separates the column values by a delimiter ● Configure security credentials within the statement easily SELECT col1, col2, col3 FROM t ORDER BY col1 INTO KAFKA 'host.example.com:9092/test-topic' FIELDS TERMINATED BY ',' ENCLOSED BY '"' ESCAPED BY "t" LINES TERMINATED BY '}' STARTING BY '{';
  • 14. 14 — SingleStore Confluent Kafka Connector ● SingleStore Kafka Connect Connector on the Confluent Hub ● Integration with Confluent Kafka Connect to stream data into SingleStore ● Management and deployment capabilities of Confluent make this incredibly easy to get started ● Cloud-first: Kafka Connector sits Kafka-side, eliminating many potential security constraints 5X THAN JDBC CONNECTOR FASTER
  • 15. Real-time fraud analytics for Credit card swipes in less than 50ms. Real-time geospatial insights with massive concurrency to manage 24/7 operations 300K Events per second Streaming analytics to drive proactive care and real-time recommendations IoT Analytics ingesting and analyzing data from over 1.2 Million smart meters 13x data growth moving from batch to near-real time visibility and analytics 3500+ Users 1.2M Smart meters analyzed 10M Upserts per second Tier-1 US Bank 50ms Real-time Fraud Detection Top Energy Company
  • 16. SingleStore is the Unified Database for Fast Analytics on Any Data, Anywhere
  • 17. 17 Learn Your Way — Get Started with $500 in Free Credits Today Go to singlestore.com/managed-service-trial ● Learn by Reading ○ docs.singlestore.com ● Learn by Engaging Peers ○ singlestore.com/forum ● Learn by Watching ○ youtube.com/singlestore ● Learn through Training ○ training.singlestore.com
  • 18. — 18 Thank You Sales Please fill out the form if you need to learn more. For immediate sales help, call us at 1-855-463-6775 or email us at [email protected]. Enterprise Edition Support Are you encountering an issue and have an enterprise support? Submit a support request. U.S. Office Locations San Francisco (HQ) 534 Fourth Street San Francisco, CA 94107 Seattle 96 Union Street Seattle, WA 98101 Portland 700 SW Fifth Ave Portland, OR 97204