SlideShare a Scribd company logo
1Confidential
Leveraging
Mainframe Data
for Modern Analytics
2Confidential
Today’s Speakers
Jordan Martz, Director of Technology
Solutions, Attunity
David Tucker, Director of Partner
Engineering, Confluent
Keith Reid, Principal, Insights and Data:
Client Engagement and Practice Leader,
Capgemini
3Confidential
Agenda
• A quick history of the mainframe
• A quick history of data migration
• Attunity - data migration with CDC
• Confluent streaming platform powered by
Apache KafkaTM
• Putting it all together
• Use cases and Replicate demo
• Answering your questions
Image: © Mark Richards
4Confidential
History of the Mainframe
Big businesses with big needs required big
computers. Demands increased just when
“second generation” transistor-based computers
were replacing vacuum-tube machines in the late
1950s, spurred developments in hardware and
software. Manufacturers commonly built small
numbers of each model, targeting narrowly
defined markets.
Why are they called “Mainframes”?
Nobody knows for sure. There was no mainframe
“inventor” who coined the term.
Probably “main frame” originally referred to the
frames (designed for telephone switches) holding
processor circuits and main memory, separate
from racks or cabinets holding other components.
Over time, main frame became mainframe and
came to mean “big computer.”
Source: The Computer Museum
Source: © International Business Machines
Corporation (IBM), 1965
5Confidential
Death of the mainframe?
What Became of Mainframes?
“Mainframes will soon be extinct”, pundits have
announced regularly. Yet nobody told the
mainframes, which remain alive and well, the
backbone of world banking and other business
systems.
Reliable and secure, mainframes are seldom in
the limelight. But one probably approved your
last ATM withdrawal or reserved your last
airplane ticket.
Source: The Computer Museum
Source: © International Business
Machines Corporation (IBM), 2001
6Confidential
A quick history of data movement
Source
Data Warehouse
Batch Source History
7Confidential
A quick history of data movement
Source
Data Warehouse
CDC
Source History
8Confidential
A quick history of data movement – the ODS
Source CDC
Source ODS
(latest view)
9Confidential
This all changes with streaming / big data platforms (eg Kafka and Hadoop)
Source CDC
Source History
History
In-Memory Analytics
(latest view and events)
Point in Time End of Day
Data Lake
Streaming Platform CEP
10Confidential
So why does CDC work in a Big Data world?
Big Data likes volume and likes history
• Storage isn't an issue
• History helps machine learning
Re-creating any point in time is simple
• 8 lines of Scala code simple
Easiest way to get data without large system performance impacts
• Reduces concerns on data integration
Enables very rapid response to transactional events
• Fraud detection and even consumer response becomes much simpler
© 2016 Attunity
Attunity Platform for Enterprise Data Management
Attunity Replicate Attunity Compose Attunity Visibility
Universal Data Availability Data Warehouse Automation Metrics Driven Data Management
Integrate
new platforms
Automate
ETL/EDW
Optimize
performance and cost
On Premises / Cloud
Hadoop FilesRDBMS EDW SAP Mainframe
Attunity Replicate
© 2016 Attunity
Attunity Replicate
No manual coding or scripting
Automated end-to-end
Optimized and configurable
Hadoop
Files
RDBMS
EDW
Mainframe
• Target schema creation
• Heterogeneous data type
mapping
• Batch to CDC transition
• DDL change propagation
• Filtering
• Transformations
Hadoop
Files
RDBMS
EDW
Kafka
© 2016 Attunity
Data replication and ingest made easy
© 2016 Attunity
Zero-footprint Architecture
Lower impact on IT
• No software agents on sources
and targets for mainstream
databases
• Replicate data from 100’s of
source systems with easy
configuration
• No software upgrades required
at each database source or
target
Hadoop
Files
RDBMS
EDW
Mainframe
• Log based
• Source specific optimization
Hadoop
Files
RDBMS
EDW
Kafka
© 2016 Attunity
Heterogeneous – Broad support for sources and targets
RDBMS
Oracle
SQL Server
DB2 LUW
DB2 iSeries
DB2 z/OS
MySQL
Sybase ASE
Informix
Data Warehouse
Exadata
Teradata
Netezza
Vertica
Actian Vector
Actian Matrix
Hortonworks
Cloudera
MapR
Pivotal
Hadoop
IMS/DB
SQL M/P
Enscribe
RMS
VSAM
Legacy
AWS RDS
Salesforce
Cloud
RDBMS
Oracle
SQL Server
DB2 LUW
MySQL
PostgreSQL
Sybase ASE
Informix
Data Warehouse
AWS Redshift
Azure SQL DW
Exadata
Teradata
Netezza
Vertica
Pivotal DB
(Greenplum)
Pivotal HAWQ
Actian Vector
Actian Matrix
Sybase IQ
Hortonworks
Cloudera
MapR
Pivotal
Hadoop
MongoDB
NoSQL
AWS RDS/Redshift/S3
Azure SQL Data
Warehouse
Azure SQL Database
Google Cloud SQL
Google Cloud Dataproc
Cloud
Effective: 12/10/2015
Kafka
Message Broker
targets
sources
© 2016 Attunity
Real-time data migration of mainframe data
18Confidential
Confluent: Open source enterprise streaming built on Apache Kafka
Open Source ExternalCommercial
Confluent Platform
Monitoring
Analytics
Custom Apps
Transformations
Real-time
Applications
…
CRM
Data Warehouse
Database
Hadoop
Data
Integration
Mainframe
Control Center
Auto-data
Balancing
Multi-Data
Center Replication
24/7 Support
Supported
Connectors
Clients
Schema
Registry
REST
Proxy
Apache Kafka
Kafka
Connect
Kafka
Streams
Kafka
Core
Database Changes Log Events loT Data Web Events …
19Confidential
Stream Data is
The Faster the Better
Stream Data can be
Big or Fast (Lambda)
Stream Data will be
Big AND Fast (Kappa)
From Big Data to Stream Data
Apache Kafka is the Enabling Technology of this Transition
Big Data was
The More the Better
ValueofData
Volume of Data
ValueofData
Age of Data
Job 1 Job 2
Streams
Table 1 Table 2
DB
Speed Table Batch Table
DB
Streams Hadoop
20Confidential
Apache KafkaTM Connect
Effective Streaming Data Capture
21Confidential
Apache KafkaTM Connect – Streaming Data Capture
JDBC
Mongo
MySQL
Elastic
Cassandra
HDFS
Kafka Connect API
Kafka Pipeline
Connector
Connector
Connector
Connector
Connector
Connector
Sources Sinks
Fault tolerant
Manage hundreds of
data sources and sinks
Preserves data schema
Part of Apache Kafka
project
Integrated within
Confluent Platform’s
Control Center
22Confidential
Kafka Connect Library of Connectors
* Denotes Connectors developed at Confluent and distributed with the Confluent Platform. Extensive validation and testing has been performed.
Databases
*
Datastore/File Store
*
Analytics
*
Applications / Other
23Confidential
Apache KafkaTM Streams
Distributed Stream Processing Made Easy
24Confidential
Architecture of Kafka Streams, a Part of Apache Kafka
Kafka
Streams
Producer
Kafka Cluster
Topic TopicTopic
Consumer Consumer
Key benefits
• No additional cluster
• Easy to run as a service
• Supports large aggregations and joins
• Security and permissions fully
integrated from Kafka
Example Use Cases
• Microservices
• Continuous queries
• Continuous transformations
• Event-triggered processes
25Confidential
Kafka Streams: the Easiest Way to Process Data in Apache Kafka™
Example Use Cases
• Microservices
• Large-scale continuous queries and transformations
• Event-triggered processes
• Reactive applications
• Customer 360-degree view, fraud detection, location-
based marketing, smart electrical grids, fleet
management, …
Key Benefits of Apache Kafka’s Streams API
• Build Apps, Not Clusters: no additional cluster required
• Elastic, highly-performant, distributed, fault-tolerant,
secure
• Equally viable for small, medium, and large-scale use
cases
• “Run Everywhere”: integrates with your existing
deployment strategies such as containers, automation,
cloud
Your App
Kafka
Streams
26Confidential
Architecture Example
Before: Complexity for development and operations, heavy footprint
1 2 3
Capture business
events in Kafka
Must process events with separate,
special-purpose clusters
Write results
back to Kafka
Your Processing Job
27Confidential
Architecture Example
With Kafka Streams: App-centric architecture that blends well into your existing infrastructure
1 2 3a
Capture business
events in Kafka
Process events fast, reliably, securely
with standard Java applications
Write results
back to Kafka
Your App
Kafka
Streams
3b
External apps can directly
query the latest results
AppApp
28Confidential
Putting it all together
CDC with Attunity on Confluent Enterprise
29Confidential
Back to the high-level platform integration …
Mainframe CDC
Source History
History
In-Memory Analytics
(latest view and events)
Point in Time End of Day
Data Lake
Streaming Platform CEP
30Confidential
… made real in Attunity / Confluent Data Flow
Topic Data Flow
• Attunity publishes DB changes to Kafka
• ”Raw” connectors (eg FileSink or HDFS)
persist change records where needed
• K-Streams app reads CDC topic and
transforms (as necessary) for other data
systems.
• Sink connectors (JDBC or K-V as needed)
persist that transformed data for other uses.
Kafka
Streams
Producer
Kafka Cluster
Topic TopicTopic
Consumer Consumer
Data System
Sink
Attunity Replicate
Raw Sink
31Confidential
Use Cases
Query off-load
• Mainframe system accepts
operational updates
• Attunity CDC publishes table
updates to Kafka
• Certified Confluent
Connectors replicate tables
to other data systems for
read-only queries
Business Value
Greater analytics flexibility at
lower cost, without disrupting
operational system
Enhanced security
• Mainframe audit trails
published to Kafka
• Syslog and other access
events published to other
topics
• Event correlation via
LogStash or similar tools
Business Value
Enhanced threat detection
and end-to-end work-flow
auditing
Cross-system integration
• K-Streams application
joins customer data from
mainframe customer-
specific mobile information
• External applications use
interactive queries to
leverage up-to-the-second
customer state
Business Value
Improved customer
engagement, more efficient
marketing spend
Attunity Replicate Demo
33Confidential
Thanks !!!
Any Questions ?
References:
• http://discover.attunity.com/knowledge-brief-leveraging-
mainframe-data-for-modern-analytics.html
• http://confluent.io/product/connectors
• https://www.capgemini.com/resources/video/transform-to-
a-modern-data-landscape

More Related Content

What's hot (20)

PDF
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
confluent
 
PDF
ETL as a Platform: Pandora Plays Nicely Everywhere with Real-Time Data Pipelines
confluent
 
PDF
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
HostedbyConfluent
 
PPTX
Data Integration with Apache Kafka: What, Why, How
Pat Patterson
 
PDF
Kafka: Journey from Just Another Software to Being a Critical Part of PayPal ...
confluent
 
PDF
Hadoop made fast - Why Virtual Reality Needed Stream Processing to Survive
confluent
 
PDF
Death of the dumb pipes: Using Apache Kafka® for Integration projects
HostedbyConfluent
 
PDF
Maximize the Business Value of Machine Learning and Data Science with Kafka (...
confluent
 
PDF
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
confluent
 
PDF
Kafka in Context, Cloud, & Community (Simon Elliston Ball, Cloudera) Kafka Su...
HostedbyConfluent
 
PDF
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
HostedbyConfluent
 
PDF
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
confluent
 
PDF
Time series-analysis-using-an-event-streaming-platform -_v3_final
confluent
 
PDF
What is Apache Kafka and What is an Event Streaming Platform?
confluent
 
PDF
Kafka as your Data Lake - is it Feasible? (Guido Schmutz, Trivadis) Kafka Sum...
HostedbyConfluent
 
PDF
Benefits of Stream Processing and Apache Kafka Use Cases
confluent
 
PDF
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)
KafkaZone
 
PPTX
IIoT with Kafka and Machine Learning for Supply Chain Optimization In Real Ti...
Kai Wähner
 
PDF
Data Transformations on Ops Metrics using Kafka Streams (Srividhya Ramachandr...
confluent
 
PPTX
Streaming Data and Stream Processing with Apache Kafka
confluent
 
Achieve Sub-Second Analytics on Apache Kafka with Confluent and Imply
confluent
 
ETL as a Platform: Pandora Plays Nicely Everywhere with Real-Time Data Pipelines
confluent
 
Hybrid Kafka, Taking Real-time Analytics to the Business (Cody Irwin, Google ...
HostedbyConfluent
 
Data Integration with Apache Kafka: What, Why, How
Pat Patterson
 
Kafka: Journey from Just Another Software to Being a Critical Part of PayPal ...
confluent
 
Hadoop made fast - Why Virtual Reality Needed Stream Processing to Survive
confluent
 
Death of the dumb pipes: Using Apache Kafka® for Integration projects
HostedbyConfluent
 
Maximize the Business Value of Machine Learning and Data Science with Kafka (...
confluent
 
Enabling Insight to Support World-Class Supercomputing (Stefan Ceballos, Oak ...
confluent
 
Kafka in Context, Cloud, & Community (Simon Elliston Ball, Cloudera) Kafka Su...
HostedbyConfluent
 
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
HostedbyConfluent
 
Technical Deep Dive: Using Apache Kafka to Optimize Real-Time Analytics in Fi...
confluent
 
Time series-analysis-using-an-event-streaming-platform -_v3_final
confluent
 
What is Apache Kafka and What is an Event Streaming Platform?
confluent
 
Kafka as your Data Lake - is it Feasible? (Guido Schmutz, Trivadis) Kafka Sum...
HostedbyConfluent
 
Benefits of Stream Processing and Apache Kafka Use Cases
confluent
 
Abstractions for managed stream processing platform (Arya Ketan - Flipkart)
KafkaZone
 
IIoT with Kafka and Machine Learning for Supply Chain Optimization In Real Ti...
Kai Wähner
 
Data Transformations on Ops Metrics using Kafka Streams (Srividhya Ramachandr...
confluent
 
Streaming Data and Stream Processing with Apache Kafka
confluent
 

Viewers also liked (20)

PPTX
Streaming in Practice - Putting Apache Kafka in Production
confluent
 
PDF
A Practical Guide to Selecting a Stream Processing Technology
confluent
 
PDF
Apache kafka-a distributed streaming platform
confluent
 
PDF
Introducing Kafka's Streams API
confluent
 
PDF
What's new in Confluent 3.2 and Apache Kafka 0.10.2
confluent
 
PDF
Power of the Log: LSM & Append Only Data Structures
confluent
 
PDF
Data integration with Apache Kafka
confluent
 
PDF
The Data Dichotomy- Rethinking the Way We Treat Data and Services
confluent
 
PDF
Demystifying Stream Processing with Apache Kafka
confluent
 
PPTX
Deep Dive into Apache Kafka
confluent
 
PPTX
Protecting your data at rest with Apache Kafka by Confluent and Vormetric
confluent
 
PDF
Monitoring Apache Kafka with Confluent Control Center
confluent
 
PDF
Data Pipelines Made Simple with Apache Kafka
confluent
 
PPTX
Strata+Hadoop 2017 San Jose: Lessons from a year of supporting Apache Kafka
confluent
 
PDF
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
confluent
 
PDF
Distributed stream processing with Apache Kafka
confluent
 
PPTX
Microservices in the Apache Kafka Ecosystem
confluent
 
PPTX
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
confluent
 
PPTX
Introduction To Streaming Data and Stream Processing with Apache Kafka
confluent
 
PDF
Building Event-Driven Services with Apache Kafka
confluent
 
Streaming in Practice - Putting Apache Kafka in Production
confluent
 
A Practical Guide to Selecting a Stream Processing Technology
confluent
 
Apache kafka-a distributed streaming platform
confluent
 
Introducing Kafka's Streams API
confluent
 
What's new in Confluent 3.2 and Apache Kafka 0.10.2
confluent
 
Power of the Log: LSM & Append Only Data Structures
confluent
 
Data integration with Apache Kafka
confluent
 
The Data Dichotomy- Rethinking the Way We Treat Data and Services
confluent
 
Demystifying Stream Processing with Apache Kafka
confluent
 
Deep Dive into Apache Kafka
confluent
 
Protecting your data at rest with Apache Kafka by Confluent and Vormetric
confluent
 
Monitoring Apache Kafka with Confluent Control Center
confluent
 
Data Pipelines Made Simple with Apache Kafka
confluent
 
Strata+Hadoop 2017 San Jose: Lessons from a year of supporting Apache Kafka
confluent
 
Strata+Hadoop 2017 San Jose - The Rise of Real Time: Apache Kafka and the Str...
confluent
 
Distributed stream processing with Apache Kafka
confluent
 
Microservices in the Apache Kafka Ecosystem
confluent
 
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
confluent
 
Introduction To Streaming Data and Stream Processing with Apache Kafka
confluent
 
Building Event-Driven Services with Apache Kafka
confluent
 
Ad

Similar to Leveraging Mainframe Data for Modern Analytics (20)

PDF
Confluent & Attunity: Mainframe Data Modern Analytics
confluent
 
PPTX
Streaming Data Ingest and Processing with Apache Kafka
Attunity
 
PDF
Confluent kafka meetupseattle jan2017
Nitin Kumar
 
PDF
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
HostedbyConfluent
 
PDF
Apache Kafka® and the Data Mesh
ConfluentInc1
 
PPTX
Apache Kafka® + Machine Learning for Supply Chain 
confluent
 
PPTX
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
confluent
 
PDF
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
PDF
Processing Real-Time Data at Scale: A streaming platform as a central nervous...
confluent
 
PDF
The Never Landing Stream with HTAP and Streaming
Timothy Spann
 
PPTX
Keeping Analytics Data Fresh in a Streaming Architecture | John Neal, Qlik
HostedbyConfluent
 
PDF
Cloud-Native Patterns for Data-Intensive Applications
VMware Tanzu
 
PDF
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
PDF
Event Driven Architecture with a RESTful Microservices Architecture (Kyle Ben...
confluent
 
PDF
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
confluent
 
PDF
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
GlobalLogic Ukraine
 
PDF
Typesafe & William Hill: Cassandra, Spark, and Kafka - The New Streaming Data...
DataStax Academy
 
PDF
Lessons Learned from Modernizing USCIS Data Analytics Platform
Databricks
 
PDF
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
Matt Stubbs
 
PPTX
Solutions presentation
Bjørn Hell Larsen
 
Confluent & Attunity: Mainframe Data Modern Analytics
confluent
 
Streaming Data Ingest and Processing with Apache Kafka
Attunity
 
Confluent kafka meetupseattle jan2017
Nitin Kumar
 
Qlik and Confluent Success Stories with Kafka - How Generali and Skechers Kee...
HostedbyConfluent
 
Apache Kafka® and the Data Mesh
ConfluentInc1
 
Apache Kafka® + Machine Learning for Supply Chain 
confluent
 
Best Practices for Building Hybrid-Cloud Architectures | Hans Jespersen
confluent
 
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 
Processing Real-Time Data at Scale: A streaming platform as a central nervous...
confluent
 
The Never Landing Stream with HTAP and Streaming
Timothy Spann
 
Keeping Analytics Data Fresh in a Streaming Architecture | John Neal, Qlik
HostedbyConfluent
 
Cloud-Native Patterns for Data-Intensive Applications
VMware Tanzu
 
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
Event Driven Architecture with a RESTful Microservices Architecture (Kyle Ben...
confluent
 
Fast Data – Fast Cars: Wie Apache Kafka die Datenwelt revolutioniert
confluent
 
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
GlobalLogic Ukraine
 
Typesafe & William Hill: Cassandra, Spark, and Kafka - The New Streaming Data...
DataStax Academy
 
Lessons Learned from Modernizing USCIS Data Analytics Platform
Databricks
 
Big Data LDN 2018: FORTUNE 100 LESSONS ON ARCHITECTING DATA LAKES FOR REAL-TI...
Matt Stubbs
 
Solutions presentation
Bjørn Hell Larsen
 
Ad

More from confluent (20)

PDF
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
PPTX
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
PDF
Migration, backup and restore made easy using Kannika
confluent
 
PDF
Five Things You Need to Know About Data Streaming in 2025
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
PDF
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
PDF
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
PDF
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
PDF
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
PDF
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
PDF
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
PDF
Unlocking value with event-driven architecture by Confluent
confluent
 
PDF
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
PDF
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
PDF
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
PDF
Building API data products on top of your real-time data infrastructure
confluent
 
PDF
Speed Wins: From Kafka to APIs in Minutes
confluent
 
PDF
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 
PDF
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
confluent
 
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
Migration, backup and restore made easy using Kannika
confluent
 
Five Things You Need to Know About Data Streaming in 2025
confluent
 
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
Unlocking value with event-driven architecture by Confluent
confluent
 
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
Building API data products on top of your real-time data infrastructure
confluent
 
Speed Wins: From Kafka to APIs in Minutes
confluent
 
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
confluent
 

Recently uploaded (20)

PDF
Generative AI vs Predictive AI-The Ultimate Comparison Guide
Lily Clark
 
PDF
Per Axbom: The spectacular lies of maps
Nexer Digital
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PPTX
Agile Chennai 18-19 July 2025 | Workshop - Enhancing Agile Collaboration with...
AgileNetwork
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
Market Insight : ETH Dominance Returns
CIFDAQ
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
introduction to computer hardware and sofeware
chauhanshraddha2007
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Researching The Best Chat SDK Providers in 2025
Ray Fields
 
PPTX
Farrell_Programming Logic and Design slides_10e_ch02_PowerPoint.pptx
bashnahara11
 
Generative AI vs Predictive AI-The Ultimate Comparison Guide
Lily Clark
 
Per Axbom: The spectacular lies of maps
Nexer Digital
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Agile Chennai 18-19 July 2025 | Workshop - Enhancing Agile Collaboration with...
AgileNetwork
 
The Future of Artificial Intelligence (AI)
Mukul
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
Market Insight : ETH Dominance Returns
CIFDAQ
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
introduction to computer hardware and sofeware
chauhanshraddha2007
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Researching The Best Chat SDK Providers in 2025
Ray Fields
 
Farrell_Programming Logic and Design slides_10e_ch02_PowerPoint.pptx
bashnahara11
 

Leveraging Mainframe Data for Modern Analytics

  • 2. 2Confidential Today’s Speakers Jordan Martz, Director of Technology Solutions, Attunity David Tucker, Director of Partner Engineering, Confluent Keith Reid, Principal, Insights and Data: Client Engagement and Practice Leader, Capgemini
  • 3. 3Confidential Agenda • A quick history of the mainframe • A quick history of data migration • Attunity - data migration with CDC • Confluent streaming platform powered by Apache KafkaTM • Putting it all together • Use cases and Replicate demo • Answering your questions Image: © Mark Richards
  • 4. 4Confidential History of the Mainframe Big businesses with big needs required big computers. Demands increased just when “second generation” transistor-based computers were replacing vacuum-tube machines in the late 1950s, spurred developments in hardware and software. Manufacturers commonly built small numbers of each model, targeting narrowly defined markets. Why are they called “Mainframes”? Nobody knows for sure. There was no mainframe “inventor” who coined the term. Probably “main frame” originally referred to the frames (designed for telephone switches) holding processor circuits and main memory, separate from racks or cabinets holding other components. Over time, main frame became mainframe and came to mean “big computer.” Source: The Computer Museum Source: © International Business Machines Corporation (IBM), 1965
  • 5. 5Confidential Death of the mainframe? What Became of Mainframes? “Mainframes will soon be extinct”, pundits have announced regularly. Yet nobody told the mainframes, which remain alive and well, the backbone of world banking and other business systems. Reliable and secure, mainframes are seldom in the limelight. But one probably approved your last ATM withdrawal or reserved your last airplane ticket. Source: The Computer Museum Source: © International Business Machines Corporation (IBM), 2001
  • 6. 6Confidential A quick history of data movement Source Data Warehouse Batch Source History
  • 7. 7Confidential A quick history of data movement Source Data Warehouse CDC Source History
  • 8. 8Confidential A quick history of data movement – the ODS Source CDC Source ODS (latest view)
  • 9. 9Confidential This all changes with streaming / big data platforms (eg Kafka and Hadoop) Source CDC Source History History In-Memory Analytics (latest view and events) Point in Time End of Day Data Lake Streaming Platform CEP
  • 10. 10Confidential So why does CDC work in a Big Data world? Big Data likes volume and likes history • Storage isn't an issue • History helps machine learning Re-creating any point in time is simple • 8 lines of Scala code simple Easiest way to get data without large system performance impacts • Reduces concerns on data integration Enables very rapid response to transactional events • Fraud detection and even consumer response becomes much simpler
  • 11. © 2016 Attunity Attunity Platform for Enterprise Data Management Attunity Replicate Attunity Compose Attunity Visibility Universal Data Availability Data Warehouse Automation Metrics Driven Data Management Integrate new platforms Automate ETL/EDW Optimize performance and cost On Premises / Cloud Hadoop FilesRDBMS EDW SAP Mainframe
  • 13. © 2016 Attunity Attunity Replicate No manual coding or scripting Automated end-to-end Optimized and configurable Hadoop Files RDBMS EDW Mainframe • Target schema creation • Heterogeneous data type mapping • Batch to CDC transition • DDL change propagation • Filtering • Transformations Hadoop Files RDBMS EDW Kafka
  • 14. © 2016 Attunity Data replication and ingest made easy
  • 15. © 2016 Attunity Zero-footprint Architecture Lower impact on IT • No software agents on sources and targets for mainstream databases • Replicate data from 100’s of source systems with easy configuration • No software upgrades required at each database source or target Hadoop Files RDBMS EDW Mainframe • Log based • Source specific optimization Hadoop Files RDBMS EDW Kafka
  • 16. © 2016 Attunity Heterogeneous – Broad support for sources and targets RDBMS Oracle SQL Server DB2 LUW DB2 iSeries DB2 z/OS MySQL Sybase ASE Informix Data Warehouse Exadata Teradata Netezza Vertica Actian Vector Actian Matrix Hortonworks Cloudera MapR Pivotal Hadoop IMS/DB SQL M/P Enscribe RMS VSAM Legacy AWS RDS Salesforce Cloud RDBMS Oracle SQL Server DB2 LUW MySQL PostgreSQL Sybase ASE Informix Data Warehouse AWS Redshift Azure SQL DW Exadata Teradata Netezza Vertica Pivotal DB (Greenplum) Pivotal HAWQ Actian Vector Actian Matrix Sybase IQ Hortonworks Cloudera MapR Pivotal Hadoop MongoDB NoSQL AWS RDS/Redshift/S3 Azure SQL Data Warehouse Azure SQL Database Google Cloud SQL Google Cloud Dataproc Cloud Effective: 12/10/2015 Kafka Message Broker targets sources
  • 17. © 2016 Attunity Real-time data migration of mainframe data
  • 18. 18Confidential Confluent: Open source enterprise streaming built on Apache Kafka Open Source ExternalCommercial Confluent Platform Monitoring Analytics Custom Apps Transformations Real-time Applications … CRM Data Warehouse Database Hadoop Data Integration Mainframe Control Center Auto-data Balancing Multi-Data Center Replication 24/7 Support Supported Connectors Clients Schema Registry REST Proxy Apache Kafka Kafka Connect Kafka Streams Kafka Core Database Changes Log Events loT Data Web Events …
  • 19. 19Confidential Stream Data is The Faster the Better Stream Data can be Big or Fast (Lambda) Stream Data will be Big AND Fast (Kappa) From Big Data to Stream Data Apache Kafka is the Enabling Technology of this Transition Big Data was The More the Better ValueofData Volume of Data ValueofData Age of Data Job 1 Job 2 Streams Table 1 Table 2 DB Speed Table Batch Table DB Streams Hadoop
  • 21. 21Confidential Apache KafkaTM Connect – Streaming Data Capture JDBC Mongo MySQL Elastic Cassandra HDFS Kafka Connect API Kafka Pipeline Connector Connector Connector Connector Connector Connector Sources Sinks Fault tolerant Manage hundreds of data sources and sinks Preserves data schema Part of Apache Kafka project Integrated within Confluent Platform’s Control Center
  • 22. 22Confidential Kafka Connect Library of Connectors * Denotes Connectors developed at Confluent and distributed with the Confluent Platform. Extensive validation and testing has been performed. Databases * Datastore/File Store * Analytics * Applications / Other
  • 24. 24Confidential Architecture of Kafka Streams, a Part of Apache Kafka Kafka Streams Producer Kafka Cluster Topic TopicTopic Consumer Consumer Key benefits • No additional cluster • Easy to run as a service • Supports large aggregations and joins • Security and permissions fully integrated from Kafka Example Use Cases • Microservices • Continuous queries • Continuous transformations • Event-triggered processes
  • 25. 25Confidential Kafka Streams: the Easiest Way to Process Data in Apache Kafka™ Example Use Cases • Microservices • Large-scale continuous queries and transformations • Event-triggered processes • Reactive applications • Customer 360-degree view, fraud detection, location- based marketing, smart electrical grids, fleet management, … Key Benefits of Apache Kafka’s Streams API • Build Apps, Not Clusters: no additional cluster required • Elastic, highly-performant, distributed, fault-tolerant, secure • Equally viable for small, medium, and large-scale use cases • “Run Everywhere”: integrates with your existing deployment strategies such as containers, automation, cloud Your App Kafka Streams
  • 26. 26Confidential Architecture Example Before: Complexity for development and operations, heavy footprint 1 2 3 Capture business events in Kafka Must process events with separate, special-purpose clusters Write results back to Kafka Your Processing Job
  • 27. 27Confidential Architecture Example With Kafka Streams: App-centric architecture that blends well into your existing infrastructure 1 2 3a Capture business events in Kafka Process events fast, reliably, securely with standard Java applications Write results back to Kafka Your App Kafka Streams 3b External apps can directly query the latest results AppApp
  • 28. 28Confidential Putting it all together CDC with Attunity on Confluent Enterprise
  • 29. 29Confidential Back to the high-level platform integration … Mainframe CDC Source History History In-Memory Analytics (latest view and events) Point in Time End of Day Data Lake Streaming Platform CEP
  • 30. 30Confidential … made real in Attunity / Confluent Data Flow Topic Data Flow • Attunity publishes DB changes to Kafka • ”Raw” connectors (eg FileSink or HDFS) persist change records where needed • K-Streams app reads CDC topic and transforms (as necessary) for other data systems. • Sink connectors (JDBC or K-V as needed) persist that transformed data for other uses. Kafka Streams Producer Kafka Cluster Topic TopicTopic Consumer Consumer Data System Sink Attunity Replicate Raw Sink
  • 31. 31Confidential Use Cases Query off-load • Mainframe system accepts operational updates • Attunity CDC publishes table updates to Kafka • Certified Confluent Connectors replicate tables to other data systems for read-only queries Business Value Greater analytics flexibility at lower cost, without disrupting operational system Enhanced security • Mainframe audit trails published to Kafka • Syslog and other access events published to other topics • Event correlation via LogStash or similar tools Business Value Enhanced threat detection and end-to-end work-flow auditing Cross-system integration • K-Streams application joins customer data from mainframe customer- specific mobile information • External applications use interactive queries to leverage up-to-the-second customer state Business Value Improved customer engagement, more efficient marketing spend
  • 33. 33Confidential Thanks !!! Any Questions ? References: • http://discover.attunity.com/knowledge-brief-leveraging- mainframe-data-for-modern-analytics.html • http://confluent.io/product/connectors • https://www.capgemini.com/resources/video/transform-to- a-modern-data-landscape