SlideShare a Scribd company logo
What’s New in the
MariaDB AX Platform
Dipti Joshi
Director Product Management
MariaDB AX
Analytics made easy –
simple, fast, scalable…
and open source
MariaDB AX
MariaDB Server
MariaDB MaxScale
MariaDB ColumnStore
Parallel queries
Distributed storage
No indexes
Automatic partitioning
Read optimized
High compression
Low disk IO
ColumnStore
PM
ColumnStore
PM
ColumnStore
PM
MariaDB Server
ColumnStore UM
MariaDB Server
ColumnStore UM
MariaDB MaxScale
MariaDB Server
ColumnStore UM
ColumnStore
PM
MariaDB MaxScale
Distributed Shared Nothing Storage
MariaDB AX
What was there
MariaDB ColumnStore 1.0
Manual import
Manual backup/restore
Window functions
Aggregate functions
User-defined functions
Cross-engine joins
ColumnStore PMMariaDB Server
ColumnStore
UM
InnoDB
Applications / Spark
MariaDB MaxScale
Goals for next MariaDB AX
1. Expand high availability/disaster recovery options
2. Make it easier to perform custom, complex analytics
3. Streamline and simplify the process of ingesting data
MariaDB AX
What’s new
MariaDB ColumnStore 1.1
Streaming data adapters
Bulk data adapters
User defined
Window functions
Distributed aggregates
Spark support
Read : JDBC
Publish: data adapters
High availability
Local storage (GlusterFS)
Parallel backup/restore
ColumnStore PMMariaDB Server
ColumnStore
UM
InnoDB
Applications / Spark
MariaDB MaxScale
What’s new in MariaDB AX
BI CERTIFICATION
INGESTION
ANALYTICS
Applications, Apache Kafka, MariaDB MaxScale
User-defined aggregate and window functions
HA / DR GlusterFS support, Parallel backup/restore
DATA TYPES Text, BLOB columns
SECURITY Auditing
Tableau
Extend high availability
and disaster recovery options
GlusterFS Volume
Replication
High availability for
Local Storage
GlusterFS can replicate files
within a volume - HA without
the need for an SAN
ColumnStore storage nodes can
read other files within a volume
- simple, automatic failover
GlusterFS Volume
Replication
ColumnStore
PM 1
(dbroot1)
ColumnStore
PM 2
(dbroot2)
MariaDB Server
ColumnStore UM
MariaDB Server
ColumnStore UM
ColumnStore
PM 3
(dbroot3)
/dbroot
1
/dbroot
2
/dbroot
2
/dbroot
3
/dbroot
3
/dbroot
1
Parallel Backup/Restore
Parallel backup/restore using
rsync - faster backup and
restore
Support incremental backup and
restore - faster backup and
restore
Consolidate data from multiple
storage nodes in a single backup
location - simplified,
automatic backups and
restores
/home/user/columnstoreBackupData/pm1dbroot1
/home/user/columnstoreBackupData/pm2dbroot2
/home/user/columnstoreBackupData/pm3dbroot3
ColumnStore
PM 1
ColumnStore
PM 2
MariaDB Server
ColumnStore UM
MariaDB Server
ColumnStore UM
ColumnStore
PM 3
Backup and restore tool
rsync
/data1/*
rsync
/data2/*
rsync
/data3/*
Make it easier to perform
custom, complex analytics
User-defined distributed
aggregate and window
functions
User-defined distributed
aggregate functions - custom
analytical functions and better
performance
User-defined window functions
Example: calculate a weighted
sum (revenue)
$1-10 (0.5)
$11-100 (1.0)
$100+ (1.5)
MariaDB Server
ColumnStore UM
MariaDB Server
ColumnStore UM
ColumnStore
PM
ColumnStore
PM
ColumnStore
PM
$10 $5
$100 $100
$200 $300
Column WSUM
$4 $2
$8 $4
$20 $20
Column WSUM
$12 $6
$60 $60
$300 $450
Column WSUM
WSUM = $405 WSUM = $26 WSUM = $516
WSUM = $947
Streamline and simplify
the process of data ingestion
Motivation
Organizations need to make data available for analysis as
soon as it arrives
Machine learning results need to be stored where other
business/data analysts work with them
Time to insight and time to action are now competitive
differentiators for businesses
Bulk data adapters
Applications can use bulk data
adapters SDK to collect and write data
- on-demand data loading
No need to copy CSV to UM or
PM - simpler
Bypass SQL interface, parser and
optimizer - faster writes
C++
Python
Java
MariaDB Server
ColumnStore UM
Application
ColumnStore PM ColumnStore PMColumnStore PM
Write API Write API Write API
MariaDB Server
ColumnStore UM
Bulk Data Adapter
1. For each row
a. For each column
bulkInsert->setColumn
a. bulkInsert->writeRow
1. bulkInsert->commit
* Buffer 100,000 rows by default
Deep dive session: Ingesting Data with the New Bulk Data Adapters Today at 5 pm
Streaming data adapters
– MaxScale CDC
Stream writes from MariaDB TX to
MariaDB AX
automatically and continuously
- ensure analytical data is up
to date and not stale, no
need for batch jobs, manual
processes or human
intervention
MariaDB Server
InnoDB
MariaDB Server
ColumnStore UM
MariaDB MaxScale
ColumnStore PM ColumnStore PMColumnStore PM
Write API Write API Write API
MariaDB Server
ColumnStore UM
Streaming Data
Adapter
(CDC Client)
Binlog-Avro CDC
Router
Deep dive session: Real-time Analytics With The New Streaming Data Adapters
Tomorrow at 8:40 am
Streaming data adapters
– Apache Kafka
Stream all messages published to
Apache Kafka topics to MariaDB AX
automatically and continuously
- enable data from many
sources to be streamed and
collected for analysis without
complex code
MariaDB Server
ColumnStore UM
ColumnStore PM ColumnStore PMColumnStore PM
Write API Write API Write API
MariaDB Server
ColumnStore UM
Streaming Data
Adapter
(Kafka Client)
Apache Kafka
Topic Topic Topic
Deep dive session: Real-time Analytics With The New Streaming Data Adapters
Tomorrow at 8:40 am
The big picture – putting it all together
AnalyticsOperations Ingestion
Apache Kafka
Streaming Data Adapters
Data Services
Bulk Data Adapters
Spark / Python / ML
Bulk Data Adapters
Transaction (OLTP)
MariaDB Server
InnoDB
MariaDB MaxScale
Web/Mobile Services
MariaDB MaxScale
Analytics (OLAP)
MariaDB
ColumnStore
Resources
Reach me
Download
Documentation https://mariadb.com/kb/en/library/mariadb-columnstore/
Blogs https://mariadb.com/blog-tags/columnstore
https://mariadb.com/blog-tags/big-data
dipti.joshi@mariadb.com
MariaDB ColumnStore 1.1 https://mariadb.com/downloads/mariadb-ax
MariaDB MaxScale https://mariadb.com/downloads/mariadb-ax/maxscale
Bulk Data Adapters and Streaming Data Adapters
https://mariadb.com/downloads/mariadb-ax/data-adapters
MariaDB ColumnStore Backup/Restore Tool
https://mariadb.com/downloads/mariadb-ax/tools-ax
Complex, custom analytics
User-defined aggregate functions
User-defined window functions
Text and binary columns
Spark integration
JDBC (SQL)
Direct (data adapter)
Improved HA/DR
GlusterFS support
Parallel backup/restore
Streamlined data ingestion
Streaming data adapters
Bulk data adapters
What’s new in MariaDB AX Summary
Thank you!

More Related Content

What's hot (20)

PDF
MariaDB Platform for hybrid transactional/analytical workloads
MariaDB plc
 
PDF
Global Data Replication with Galera for Ansell Guardian®
MariaDB plc
 
PDF
How to make data available for analytics ASAP
MariaDB plc
 
PDF
MariaDB Server Performance Tuning & Optimization
MariaDB plc
 
PPTX
Deploying MariaDB databases with containers at Nokia Networks
MariaDB plc
 
PDF
What to expect from MariaDB Platform X5, part 1
MariaDB plc
 
PDF
Faster, better, stronger: The new InnoDB
MariaDB plc
 
PPTX
Migrating from InnoDB and HBase to MyRocks at Facebook
MariaDB plc
 
PPTX
M|18 How DBAs at TradingScreen Make Life Easier With Automation
MariaDB plc
 
PDF
MariaDB Performance Tuning and Optimization
MariaDB plc
 
PDF
Using all of the high availability options in MariaDB
MariaDB plc
 
PPTX
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
MariaDB plc
 
PPTX
Inside CynosDB: MariaDB optimized for the cloud at Tencent
MariaDB plc
 
PDF
How MariaDB is approaching DBaaS
MariaDB plc
 
PPTX
How Pixid dropped Oracle and went hybrid with MariaDB
MariaDB plc
 
PDF
How to migrate from Oracle Database with ease
MariaDB plc
 
PDF
Introducing the ultimate MariaDB cloud, SkySQL
MariaDB plc
 
PDF
What's new in MariaDB Platform X3
MariaDB plc
 
PDF
MariaDB Enterprise Tools introduction
MariaDB plc
 
PDF
Optimizing MariaDB for maximum performance
MariaDB plc
 
MariaDB Platform for hybrid transactional/analytical workloads
MariaDB plc
 
Global Data Replication with Galera for Ansell Guardian®
MariaDB plc
 
How to make data available for analytics ASAP
MariaDB plc
 
MariaDB Server Performance Tuning & Optimization
MariaDB plc
 
Deploying MariaDB databases with containers at Nokia Networks
MariaDB plc
 
What to expect from MariaDB Platform X5, part 1
MariaDB plc
 
Faster, better, stronger: The new InnoDB
MariaDB plc
 
Migrating from InnoDB and HBase to MyRocks at Facebook
MariaDB plc
 
M|18 How DBAs at TradingScreen Make Life Easier With Automation
MariaDB plc
 
MariaDB Performance Tuning and Optimization
MariaDB plc
 
Using all of the high availability options in MariaDB
MariaDB plc
 
How Alibaba Cloud scaled ApsaraDB with MariaDB MaxScale
MariaDB plc
 
Inside CynosDB: MariaDB optimized for the cloud at Tencent
MariaDB plc
 
How MariaDB is approaching DBaaS
MariaDB plc
 
How Pixid dropped Oracle and went hybrid with MariaDB
MariaDB plc
 
How to migrate from Oracle Database with ease
MariaDB plc
 
Introducing the ultimate MariaDB cloud, SkySQL
MariaDB plc
 
What's new in MariaDB Platform X3
MariaDB plc
 
MariaDB Enterprise Tools introduction
MariaDB plc
 
Optimizing MariaDB for maximum performance
MariaDB plc
 

Similar to M|18 Analyzing Data with the MariaDB AX Platform (20)

PDF
M|18 What's New in the MariaDB AX Platform
MariaDB plc
 
PDF
What's new in MariaDB AX webinar
MariaDB plc
 
PDF
Introduction of MariaDB AX / TX
GOTO Satoru
 
PDF
What to expect from MariaDB Platform X5, part 2
MariaDB plc
 
PDF
Data Con LA 2018 - Why use a columnar database for analytical workloads by Sh...
Data Con LA
 
PDF
[db tech showcase Tokyo 2017] C37: MariaDB ColumnStore analytics engine : use...
Insight Technology, Inc.
 
PDF
How Columnar Databases Support Modern Analytics
DATAVERSITY
 
PDF
What’s new in MariaDB ColumnStore
MariaDB plc
 
PDF
Delivering fast, powerful and scalable analytics
MariaDB plc
 
PDF
Big Data Analytics with MariaDB ColumnStore
MariaDB plc
 
PDF
M|18 Real-time Analytics with the New Streaming Data Adapters
MariaDB plc
 
PDF
Open Source für den geschäftskritischen Einsatz
MariaDB plc
 
PDF
Improving Transactional Applications with Analytics
DATAVERSITY
 
PPTX
Keynote: Open Source für den geschäftskritischen Einsatz
MariaDB plc
 
PPTX
Delivering fast, powerful and scalable analytics
MariaDB plc
 
PDF
MariaDB today and our vision for the future
MariaDB plc
 
PDF
[db tech showcase OSS 2017] A23: Analytics with MariaDB ColumnStore by MariaD...
Insight Technology, Inc.
 
PDF
[db tech showcase OSS 2017] A25: Replacing Oracle Database at DBS Bank by Mar...
Insight Technology, Inc.
 
PDF
Fast, Powerful and Scalable Analytics
MariaDB plc
 
PPTX
Data Con LA 2019 - Hybrid Transactional Analytical Processing (HTAP) with Mar...
Data Con LA
 
M|18 What's New in the MariaDB AX Platform
MariaDB plc
 
What's new in MariaDB AX webinar
MariaDB plc
 
Introduction of MariaDB AX / TX
GOTO Satoru
 
What to expect from MariaDB Platform X5, part 2
MariaDB plc
 
Data Con LA 2018 - Why use a columnar database for analytical workloads by Sh...
Data Con LA
 
[db tech showcase Tokyo 2017] C37: MariaDB ColumnStore analytics engine : use...
Insight Technology, Inc.
 
How Columnar Databases Support Modern Analytics
DATAVERSITY
 
What’s new in MariaDB ColumnStore
MariaDB plc
 
Delivering fast, powerful and scalable analytics
MariaDB plc
 
Big Data Analytics with MariaDB ColumnStore
MariaDB plc
 
M|18 Real-time Analytics with the New Streaming Data Adapters
MariaDB plc
 
Open Source für den geschäftskritischen Einsatz
MariaDB plc
 
Improving Transactional Applications with Analytics
DATAVERSITY
 
Keynote: Open Source für den geschäftskritischen Einsatz
MariaDB plc
 
Delivering fast, powerful and scalable analytics
MariaDB plc
 
MariaDB today and our vision for the future
MariaDB plc
 
[db tech showcase OSS 2017] A23: Analytics with MariaDB ColumnStore by MariaD...
Insight Technology, Inc.
 
[db tech showcase OSS 2017] A25: Replacing Oracle Database at DBS Bank by Mar...
Insight Technology, Inc.
 
Fast, Powerful and Scalable Analytics
MariaDB plc
 
Data Con LA 2019 - Hybrid Transactional Analytical Processing (HTAP) with Mar...
Data Con LA
 
Ad

More from MariaDB plc (20)

PDF
MariaDB Berlin Roadshow Slides - 8 April 2025
MariaDB plc
 
PDF
MariaDB München Roadshow - 24 September, 2024
MariaDB plc
 
PDF
MariaDB Paris Roadshow - 19 September 2024
MariaDB plc
 
PDF
MariaDB Amsterdam Roadshow: 19 September, 2024
MariaDB plc
 
PDF
MariaDB Paris Workshop 2023 - MaxScale 23.02.x
MariaDB plc
 
PDF
MariaDB Paris Workshop 2023 - Newpharma
MariaDB plc
 
PDF
MariaDB Paris Workshop 2023 - Cloud
MariaDB plc
 
PDF
MariaDB Paris Workshop 2023 - MariaDB Enterprise
MariaDB plc
 
PDF
MariaDB Paris Workshop 2023 - Performance Optimization
MariaDB plc
 
PDF
MariaDB Paris Workshop 2023 - MaxScale
MariaDB plc
 
PDF
MariaDB Paris Workshop 2023 - novadys presentation
MariaDB plc
 
PDF
MariaDB Paris Workshop 2023 - DARVA presentation
MariaDB plc
 
PDF
MariaDB Tech und Business Update Hamburg 2023 - MariaDB Enterprise Server
MariaDB plc
 
PDF
MariaDB SkySQL Autonome Skalierung, Observability, Cloud-Backup
MariaDB plc
 
PDF
Einführung : MariaDB Tech und Business Update Hamburg 2023
MariaDB plc
 
PDF
Hochverfügbarkeitslösungen mit MariaDB
MariaDB plc
 
PDF
Die Neuheiten in MariaDB Enterprise Server
MariaDB plc
 
PDF
Introducing workload analysis
MariaDB plc
 
PDF
Under the hood: SkySQL monitoring
MariaDB plc
 
PDF
Introducing the R2DBC async Java connector
MariaDB plc
 
MariaDB Berlin Roadshow Slides - 8 April 2025
MariaDB plc
 
MariaDB München Roadshow - 24 September, 2024
MariaDB plc
 
MariaDB Paris Roadshow - 19 September 2024
MariaDB plc
 
MariaDB Amsterdam Roadshow: 19 September, 2024
MariaDB plc
 
MariaDB Paris Workshop 2023 - MaxScale 23.02.x
MariaDB plc
 
MariaDB Paris Workshop 2023 - Newpharma
MariaDB plc
 
MariaDB Paris Workshop 2023 - Cloud
MariaDB plc
 
MariaDB Paris Workshop 2023 - MariaDB Enterprise
MariaDB plc
 
MariaDB Paris Workshop 2023 - Performance Optimization
MariaDB plc
 
MariaDB Paris Workshop 2023 - MaxScale
MariaDB plc
 
MariaDB Paris Workshop 2023 - novadys presentation
MariaDB plc
 
MariaDB Paris Workshop 2023 - DARVA presentation
MariaDB plc
 
MariaDB Tech und Business Update Hamburg 2023 - MariaDB Enterprise Server
MariaDB plc
 
MariaDB SkySQL Autonome Skalierung, Observability, Cloud-Backup
MariaDB plc
 
Einführung : MariaDB Tech und Business Update Hamburg 2023
MariaDB plc
 
Hochverfügbarkeitslösungen mit MariaDB
MariaDB plc
 
Die Neuheiten in MariaDB Enterprise Server
MariaDB plc
 
Introducing workload analysis
MariaDB plc
 
Under the hood: SkySQL monitoring
MariaDB plc
 
Introducing the R2DBC async Java connector
MariaDB plc
 
Ad

Recently uploaded (20)

PDF
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
PDF
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
PDF
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PDF
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
PPTX
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PDF
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PPTX
Climate Action.pptx action plan for climate
justfortalabat
 
PPT
deep dive data management sharepoint apps.ppt
novaprofk
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PDF
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
PPTX
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
PDF
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
PPTX
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
Data Chunking Strategies for RAG in 2025.pdf
Tamanna
 
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
Climate Action.pptx action plan for climate
justfortalabat
 
deep dive data management sharepoint apps.ppt
novaprofk
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 

M|18 Analyzing Data with the MariaDB AX Platform

  • 1. What’s New in the MariaDB AX Platform Dipti Joshi Director Product Management
  • 2. MariaDB AX Analytics made easy – simple, fast, scalable… and open source
  • 3. MariaDB AX MariaDB Server MariaDB MaxScale MariaDB ColumnStore Parallel queries Distributed storage No indexes Automatic partitioning Read optimized High compression Low disk IO ColumnStore PM ColumnStore PM ColumnStore PM MariaDB Server ColumnStore UM MariaDB Server ColumnStore UM MariaDB MaxScale MariaDB Server ColumnStore UM ColumnStore PM MariaDB MaxScale Distributed Shared Nothing Storage
  • 4. MariaDB AX What was there MariaDB ColumnStore 1.0 Manual import Manual backup/restore Window functions Aggregate functions User-defined functions Cross-engine joins ColumnStore PMMariaDB Server ColumnStore UM InnoDB Applications / Spark MariaDB MaxScale
  • 5. Goals for next MariaDB AX 1. Expand high availability/disaster recovery options 2. Make it easier to perform custom, complex analytics 3. Streamline and simplify the process of ingesting data
  • 6. MariaDB AX What’s new MariaDB ColumnStore 1.1 Streaming data adapters Bulk data adapters User defined Window functions Distributed aggregates Spark support Read : JDBC Publish: data adapters High availability Local storage (GlusterFS) Parallel backup/restore ColumnStore PMMariaDB Server ColumnStore UM InnoDB Applications / Spark MariaDB MaxScale
  • 7. What’s new in MariaDB AX BI CERTIFICATION INGESTION ANALYTICS Applications, Apache Kafka, MariaDB MaxScale User-defined aggregate and window functions HA / DR GlusterFS support, Parallel backup/restore DATA TYPES Text, BLOB columns SECURITY Auditing Tableau
  • 8. Extend high availability and disaster recovery options
  • 9. GlusterFS Volume Replication High availability for Local Storage GlusterFS can replicate files within a volume - HA without the need for an SAN ColumnStore storage nodes can read other files within a volume - simple, automatic failover GlusterFS Volume Replication ColumnStore PM 1 (dbroot1) ColumnStore PM 2 (dbroot2) MariaDB Server ColumnStore UM MariaDB Server ColumnStore UM ColumnStore PM 3 (dbroot3) /dbroot 1 /dbroot 2 /dbroot 2 /dbroot 3 /dbroot 3 /dbroot 1
  • 10. Parallel Backup/Restore Parallel backup/restore using rsync - faster backup and restore Support incremental backup and restore - faster backup and restore Consolidate data from multiple storage nodes in a single backup location - simplified, automatic backups and restores /home/user/columnstoreBackupData/pm1dbroot1 /home/user/columnstoreBackupData/pm2dbroot2 /home/user/columnstoreBackupData/pm3dbroot3 ColumnStore PM 1 ColumnStore PM 2 MariaDB Server ColumnStore UM MariaDB Server ColumnStore UM ColumnStore PM 3 Backup and restore tool rsync /data1/* rsync /data2/* rsync /data3/*
  • 11. Make it easier to perform custom, complex analytics
  • 12. User-defined distributed aggregate and window functions User-defined distributed aggregate functions - custom analytical functions and better performance User-defined window functions Example: calculate a weighted sum (revenue) $1-10 (0.5) $11-100 (1.0) $100+ (1.5) MariaDB Server ColumnStore UM MariaDB Server ColumnStore UM ColumnStore PM ColumnStore PM ColumnStore PM $10 $5 $100 $100 $200 $300 Column WSUM $4 $2 $8 $4 $20 $20 Column WSUM $12 $6 $60 $60 $300 $450 Column WSUM WSUM = $405 WSUM = $26 WSUM = $516 WSUM = $947
  • 13. Streamline and simplify the process of data ingestion
  • 14. Motivation Organizations need to make data available for analysis as soon as it arrives Machine learning results need to be stored where other business/data analysts work with them Time to insight and time to action are now competitive differentiators for businesses
  • 15. Bulk data adapters Applications can use bulk data adapters SDK to collect and write data - on-demand data loading No need to copy CSV to UM or PM - simpler Bypass SQL interface, parser and optimizer - faster writes C++ Python Java MariaDB Server ColumnStore UM Application ColumnStore PM ColumnStore PMColumnStore PM Write API Write API Write API MariaDB Server ColumnStore UM Bulk Data Adapter 1. For each row a. For each column bulkInsert->setColumn a. bulkInsert->writeRow 1. bulkInsert->commit * Buffer 100,000 rows by default Deep dive session: Ingesting Data with the New Bulk Data Adapters Today at 5 pm
  • 16. Streaming data adapters – MaxScale CDC Stream writes from MariaDB TX to MariaDB AX automatically and continuously - ensure analytical data is up to date and not stale, no need for batch jobs, manual processes or human intervention MariaDB Server InnoDB MariaDB Server ColumnStore UM MariaDB MaxScale ColumnStore PM ColumnStore PMColumnStore PM Write API Write API Write API MariaDB Server ColumnStore UM Streaming Data Adapter (CDC Client) Binlog-Avro CDC Router Deep dive session: Real-time Analytics With The New Streaming Data Adapters Tomorrow at 8:40 am
  • 17. Streaming data adapters – Apache Kafka Stream all messages published to Apache Kafka topics to MariaDB AX automatically and continuously - enable data from many sources to be streamed and collected for analysis without complex code MariaDB Server ColumnStore UM ColumnStore PM ColumnStore PMColumnStore PM Write API Write API Write API MariaDB Server ColumnStore UM Streaming Data Adapter (Kafka Client) Apache Kafka Topic Topic Topic Deep dive session: Real-time Analytics With The New Streaming Data Adapters Tomorrow at 8:40 am
  • 18. The big picture – putting it all together
  • 19. AnalyticsOperations Ingestion Apache Kafka Streaming Data Adapters Data Services Bulk Data Adapters Spark / Python / ML Bulk Data Adapters Transaction (OLTP) MariaDB Server InnoDB MariaDB MaxScale Web/Mobile Services MariaDB MaxScale Analytics (OLAP) MariaDB ColumnStore
  • 20. Resources Reach me Download Documentation https://mariadb.com/kb/en/library/mariadb-columnstore/ Blogs https://mariadb.com/blog-tags/columnstore https://mariadb.com/blog-tags/big-data [email protected] MariaDB ColumnStore 1.1 https://mariadb.com/downloads/mariadb-ax MariaDB MaxScale https://mariadb.com/downloads/mariadb-ax/maxscale Bulk Data Adapters and Streaming Data Adapters https://mariadb.com/downloads/mariadb-ax/data-adapters MariaDB ColumnStore Backup/Restore Tool https://mariadb.com/downloads/mariadb-ax/tools-ax
  • 21. Complex, custom analytics User-defined aggregate functions User-defined window functions Text and binary columns Spark integration JDBC (SQL) Direct (data adapter) Improved HA/DR GlusterFS support Parallel backup/restore Streamlined data ingestion Streaming data adapters Bulk data adapters What’s new in MariaDB AX Summary