SlideShare a Scribd company logo
Spring Cloud Data Flow + Geode
Sabby Anandan | Product Manager | @sabbyanandan
Stream Batch
Spring Cloud Data Flow
Spring Cloud Stream Spring Cloud Task
Shell; DSL; REST-
APIs
Drag & Drop UI Security
OOTB
Connectors
Reactive Data Science
Dataflow
Server
Admin / Flo UI
Shell
CURL
??X
Stream/Task Spring Boot Apps
YARN
Why are we here?
Data Pipelines requiring:
• Low latency and in-memory processing
• High Throughput SLAs
• Correlation between reference-data and data-
in-flight
• Frequent data-shuffling
http | transform | log
| = ?
http | transform | log
| = Binder
Binders
Region Data Buckets
http | transform | log
Geode Cluster
transform-processor.jar
PARTITION
transform-processor.jar
PARTITION
log-sink.jar
PARTITION
log-sink.jar
PARTITION
http-source.jar
PARTITION_PROXY
http-source.jar
PARTITION_PROXY
What’s next?
•+/- scaling and automatic re-partitioning
•Stream / Task metadata-repository
•Key-Value store for OOTB Counters
•Partition level local-state and SQL-like
stream processing
11
Join the Apache Geode Community!
• Check out http://geode.incubator.apache.org
• Subscribe: user-subscribe@geode.incubator.apache.org
• Download: http://geode.incubator.apache.org/releases/

More Related Content

What's hot (20)

PDF
#GeodeSummit - Redis to Geode Adaptor
PivotalOpenSourceHub
 
PDF
Spark on Mesos
Jen Aman
 
PPTX
Operationalizing YARN based Hadoop Clusters in the Cloud
DataWorks Summit/Hadoop Summit
 
PPTX
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
Yahoo Developer Network
 
PDF
Presto on Apache Spark: A Tale of Two Computation Engines
Databricks
 
PDF
Operational Tips for Deploying Spark by Miklos Christine
Spark Summit
 
PPTX
Building Efficient Pipelines in Apache Spark
Jeremy Beard
 
PPTX
Building Effective Near-Real-Time Analytics with Spark Streaming and Kudu
Jeremy Beard
 
PPTX
Simplified Cluster Operation & Troubleshooting
DataWorks Summit/Hadoop Summit
 
PPTX
HDInsight for Architects
Ashish Thapliyal
 
PPTX
Achieve big data analytic platform with lambda architecture on cloud
Scott Miao
 
PPTX
5 Apache Spark Tips in 5 Minutes
Cloudera, Inc.
 
PDF
IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...
In-Memory Computing Summit
 
PPTX
Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters
DataWorks Summit
 
PDF
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
HBaseCon
 
PPTX
Apache sqoop with an use case
Davin Abraham
 
PDF
Getting Started with Apache Cassandra and Apache Zeppelin (DuyHai DOAN, DataS...
DataStax
 
PPTX
Apache Geode Offheap Storage
PivotalOpenSourceHub
 
PDF
Apache Kylin: Speed Up Cubing with Apache Spark with Luke Han and Shaofeng Shi
Databricks
 
PDF
Spark Uber Development Kit
DataWorks Summit/Hadoop Summit
 
#GeodeSummit - Redis to Geode Adaptor
PivotalOpenSourceHub
 
Spark on Mesos
Jen Aman
 
Operationalizing YARN based Hadoop Clusters in the Cloud
DataWorks Summit/Hadoop Summit
 
October 2016 HUG: Architecture of an Open Source RDBMS powered by HBase and ...
Yahoo Developer Network
 
Presto on Apache Spark: A Tale of Two Computation Engines
Databricks
 
Operational Tips for Deploying Spark by Miklos Christine
Spark Summit
 
Building Efficient Pipelines in Apache Spark
Jeremy Beard
 
Building Effective Near-Real-Time Analytics with Spark Streaming and Kudu
Jeremy Beard
 
Simplified Cluster Operation & Troubleshooting
DataWorks Summit/Hadoop Summit
 
HDInsight for Architects
Ashish Thapliyal
 
Achieve big data analytic platform with lambda architecture on cloud
Scott Miao
 
5 Apache Spark Tips in 5 Minutes
Cloudera, Inc.
 
IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...
In-Memory Computing Summit
 
Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters
DataWorks Summit
 
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
HBaseCon
 
Apache sqoop with an use case
Davin Abraham
 
Getting Started with Apache Cassandra and Apache Zeppelin (DuyHai DOAN, DataS...
DataStax
 
Apache Geode Offheap Storage
PivotalOpenSourceHub
 
Apache Kylin: Speed Up Cubing with Apache Spark with Luke Han and Shaofeng Shi
Databricks
 
Spark Uber Development Kit
DataWorks Summit/Hadoop Summit
 

Viewers also liked (19)

PDF
#GeodeSummit: Architecting Data-Driven, Smarter Cloud Native Apps with Real-T...
PivotalOpenSourceHub
 
PDF
#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode
PivotalOpenSourceHub
 
PDF
#GeodeSummit: Democratizing Fast Analytics with Ampool (Powered by Apache Geode)
PivotalOpenSourceHub
 
PDF
#GeodeSummit: Easy Ways to Become a Contributor to Apache Geode
PivotalOpenSourceHub
 
PDF
#GeodeSummit: Combining Stream Processing and In-Memory Data Grids for Near-R...
PivotalOpenSourceHub
 
PDF
#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...
PivotalOpenSourceHub
 
PDF
#GeodeSummit - Modern manufacturing powered by Spring XD and Geode
PivotalOpenSourceHub
 
PDF
#GeodeSummit - Large Scale Fraud Detection using GemFire Integrated with Gree...
PivotalOpenSourceHub
 
PDF
Redis, a 2 minutes introduction
Mirko Calvaresi
 
PDF
Pivotal Cloud Foundry: A Technical Overview
VMware Tanzu
 
PPTX
Building Services with WSO2 Application Server and WSO2 Microservices Framewo...
Sagara Gunathunga
 
PDF
Cloud Native Runtime Platform
VMware Tanzu
 
PDF
Spring Cloud Into Production
Todd Miller
 
PDF
Devops Recto-Verso @ DevoxxMA
Arnaud Héritier
 
PPTX
WSO2ConUS 2015 - Introduction to WSO2 Microservices Server (MSS)
Afkham Azeez
 
PDF
QCon SP 2016 - Construindo Microservices Auto-curáveis com Spring Cloud e Net...
Rodrigo Cândido da Silva
 
PDF
Spring Cloud Servicesの紹介 #pcf_tokyo
Toshiaki Maki
 
PDF
Apache Geode Meetup, London
Apache Geode
 
PDF
Build your first Internet of Things app today with Open Source
Apache Geode
 
#GeodeSummit: Architecting Data-Driven, Smarter Cloud Native Apps with Real-T...
PivotalOpenSourceHub
 
#GeodeSummit - Wall St. Derivative Risk Solutions Using Geode
PivotalOpenSourceHub
 
#GeodeSummit: Democratizing Fast Analytics with Ampool (Powered by Apache Geode)
PivotalOpenSourceHub
 
#GeodeSummit: Easy Ways to Become a Contributor to Apache Geode
PivotalOpenSourceHub
 
#GeodeSummit: Combining Stream Processing and In-Memory Data Grids for Near-R...
PivotalOpenSourceHub
 
#GeodeSummit - Using Geode as Operational Data Services for Real Time Mobile ...
PivotalOpenSourceHub
 
#GeodeSummit - Modern manufacturing powered by Spring XD and Geode
PivotalOpenSourceHub
 
#GeodeSummit - Large Scale Fraud Detection using GemFire Integrated with Gree...
PivotalOpenSourceHub
 
Redis, a 2 minutes introduction
Mirko Calvaresi
 
Pivotal Cloud Foundry: A Technical Overview
VMware Tanzu
 
Building Services with WSO2 Application Server and WSO2 Microservices Framewo...
Sagara Gunathunga
 
Cloud Native Runtime Platform
VMware Tanzu
 
Spring Cloud Into Production
Todd Miller
 
Devops Recto-Verso @ DevoxxMA
Arnaud Héritier
 
WSO2ConUS 2015 - Introduction to WSO2 Microservices Server (MSS)
Afkham Azeez
 
QCon SP 2016 - Construindo Microservices Auto-curáveis com Spring Cloud e Net...
Rodrigo Cândido da Silva
 
Spring Cloud Servicesの紹介 #pcf_tokyo
Toshiaki Maki
 
Apache Geode Meetup, London
Apache Geode
 
Build your first Internet of Things app today with Open Source
Apache Geode
 
Ad

Similar to #GeodeSummit - Integration & Future Direction for Spring Cloud Data Flow & Geode (17)

PPTX
Real-time Analysis of Data Processing Pipelines with Spring Cloud Data Flow a...
VMware Tanzu
 
PDF
Cloud-Native Patterns for Data-Intensive Applications
VMware Tanzu
 
PPTX
Spring Data and In-Memory Data Management in Action
John Blum
 
PPTX
Building cloud native data microservice
Nilanjan Roy
 
PPTX
Introducing Apache Geode and Spring Data GemFire
John Blum
 
PDF
Spring Cloud Data Flow Overview
VMware Tanzu
 
PDF
Pivoting Spring XD to Spring Cloud Data Flow with Sabby Anandan
PivotalOpenSourceHub
 
POTX
Building Effective Apache Geode Applications with Spring Data GemFire
John Blum
 
PPTX
Sweet Streams (Are made of this)
Corneil du Plessis
 
PDF
Spring Data (GemFire) Overview
John Blum
 
PPTX
Getting Started with Apache Geode
John Blum
 
PPTX
Building Highly Scalable Spring Applications using In-Memory Data Grids
John Blum
 
PDF
Session State Caching with Spring
VMware Tanzu
 
PPTX
Data Engineer's Lunch #56: Spring Cloud Data Flow with Cassandra
Anant Corporation
 
PDF
Resilient Microservices with Spring Cloud
VMware Tanzu
 
PPTX
Data Microservices In The Cloud + 日本語コメント
Takuya Saeki
 
PDF
Apache Geode Meetup, Cork, Ireland at CIT
Apache Geode
 
Real-time Analysis of Data Processing Pipelines with Spring Cloud Data Flow a...
VMware Tanzu
 
Cloud-Native Patterns for Data-Intensive Applications
VMware Tanzu
 
Spring Data and In-Memory Data Management in Action
John Blum
 
Building cloud native data microservice
Nilanjan Roy
 
Introducing Apache Geode and Spring Data GemFire
John Blum
 
Spring Cloud Data Flow Overview
VMware Tanzu
 
Pivoting Spring XD to Spring Cloud Data Flow with Sabby Anandan
PivotalOpenSourceHub
 
Building Effective Apache Geode Applications with Spring Data GemFire
John Blum
 
Sweet Streams (Are made of this)
Corneil du Plessis
 
Spring Data (GemFire) Overview
John Blum
 
Getting Started with Apache Geode
John Blum
 
Building Highly Scalable Spring Applications using In-Memory Data Grids
John Blum
 
Session State Caching with Spring
VMware Tanzu
 
Data Engineer's Lunch #56: Spring Cloud Data Flow with Cassandra
Anant Corporation
 
Resilient Microservices with Spring Cloud
VMware Tanzu
 
Data Microservices In The Cloud + 日本語コメント
Takuya Saeki
 
Apache Geode Meetup, Cork, Ireland at CIT
Apache Geode
 
Ad

More from PivotalOpenSourceHub (13)

PPTX
Zettaset Elastic Big Data Security for Greenplum Database
PivotalOpenSourceHub
 
PPTX
New Security Framework in Apache Geode
PivotalOpenSourceHub
 
PPTX
Apache Geode Clubhouse - WAN-based Replication
PivotalOpenSourceHub
 
PDF
Building Apps with Distributed In-Memory Computing Using Apache Geode
PivotalOpenSourceHub
 
PPTX
GPORCA: Query Optimization as a Service
PivotalOpenSourceHub
 
PPTX
Apache Zeppelin Meetup Christian Tzolov 1/21/16
PivotalOpenSourceHub
 
PPTX
Build & test Apache Hawq
PivotalOpenSourceHub
 
PDF
Postgre sql linuxcontainers by Jignesh Shah
PivotalOpenSourceHub
 
PPTX
kafka for db as postgres
PivotalOpenSourceHub
 
PPTX
Geode Transactions by Swapnil Bawaskar
PivotalOpenSourceHub
 
PPTX
Greenplum Database Open Source December 2015
PivotalOpenSourceHub
 
PPTX
MADlib Architecture and Functional Demo on How to Use MADlib/PivotalR
PivotalOpenSourceHub
 
PDF
Data Science Perspective and DS demo
PivotalOpenSourceHub
 
Zettaset Elastic Big Data Security for Greenplum Database
PivotalOpenSourceHub
 
New Security Framework in Apache Geode
PivotalOpenSourceHub
 
Apache Geode Clubhouse - WAN-based Replication
PivotalOpenSourceHub
 
Building Apps with Distributed In-Memory Computing Using Apache Geode
PivotalOpenSourceHub
 
GPORCA: Query Optimization as a Service
PivotalOpenSourceHub
 
Apache Zeppelin Meetup Christian Tzolov 1/21/16
PivotalOpenSourceHub
 
Build & test Apache Hawq
PivotalOpenSourceHub
 
Postgre sql linuxcontainers by Jignesh Shah
PivotalOpenSourceHub
 
kafka for db as postgres
PivotalOpenSourceHub
 
Geode Transactions by Swapnil Bawaskar
PivotalOpenSourceHub
 
Greenplum Database Open Source December 2015
PivotalOpenSourceHub
 
MADlib Architecture and Functional Demo on How to Use MADlib/PivotalR
PivotalOpenSourceHub
 
Data Science Perspective and DS demo
PivotalOpenSourceHub
 

Recently uploaded (20)

PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
DOCX
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
Cryptography Quiz: test your knowledge of this important security concept.
Rajni Bhardwaj Grover
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 

#GeodeSummit - Integration & Future Direction for Spring Cloud Data Flow & Geode

  • 1. Spring Cloud Data Flow + Geode Sabby Anandan | Product Manager | @sabbyanandan
  • 2. Stream Batch Spring Cloud Data Flow Spring Cloud Stream Spring Cloud Task Shell; DSL; REST- APIs Drag & Drop UI Security OOTB Connectors Reactive Data Science
  • 3. Dataflow Server Admin / Flo UI Shell CURL ??X Stream/Task Spring Boot Apps YARN
  • 4. Why are we here? Data Pipelines requiring: • Low latency and in-memory processing • High Throughput SLAs • Correlation between reference-data and data- in-flight • Frequent data-shuffling
  • 6. | = ? http | transform | log
  • 9. Region Data Buckets http | transform | log Geode Cluster transform-processor.jar PARTITION transform-processor.jar PARTITION log-sink.jar PARTITION log-sink.jar PARTITION http-source.jar PARTITION_PROXY http-source.jar PARTITION_PROXY
  • 10. What’s next? •+/- scaling and automatic re-partitioning •Stream / Task metadata-repository •Key-Value store for OOTB Counters •Partition level local-state and SQL-like stream processing
  • 11. 11 Join the Apache Geode Community! • Check out http://geode.incubator.apache.org • Subscribe: [email protected] • Download: http://geode.incubator.apache.org/releases/