SlideShare a Scribd company logo
What Is Apache Flink ?
● A stream processing framework
● Open source / Apache 2.0 license
● Written in Java and Scala
● For batch and stream processing
● For high volume , low latency
● Develop in Java, Scala, Python, SQL
● Automatic compilation/optimization into data flows
How Does Flink Work ?
● Process Unbounded and Bounded Data
● Uses file systems to consume/persistently store data i.e.
– local, hadoop-compatible, Amazon S3, MapR FS, OpenStack
Swift FS, Aliyun OSS and Azure Blob Storage
● Leverages In-Memory Performance
● Provides a rich function set for handling
– Streams, state and time
– When building applications
● Provides layered API's which provides a balance between
– Conciseness and expressiveness
– See next slide
How Does Flink Work ?
Flink layered API's
Flink API's
● SQL & Table API
● DataStream API
● ProcessFunctions – event processing
● Flink also has libraries for common data processing
– Complex Event Processing (CEP)
– DataSet API
– Gelly - library for scalable graph processing/analysis
Flink Used By
Flink Deployment
● Deploy Flink to use the following cluster managers
– YARN
– Mesos
– Kubernetes
– Stand alone
● All application control communications via REST calls
● Deploy at any scale
– multiple trillions of events per day
– multiple terabytes of state
– thousands of cores
Flink Architecture
Flink Stateful Functions
● Simplifies building distributed stateful applications
● Provides a runtime built for serverless architectures
● Key Benefits
– Dynamic Messaging
– Consistent State
– Multi-language Support
– No Database Required
– Cloud Native
– "Stateless" Operation
Flink Stateful Functions
Flink Use Cases
● Event-driven Applications i.e.
– Fraud detection
– Anomaly detection
● Data Analytics Applications
– Quality monitoring of Telco networks
– Analysis of product updates & experiment evaluation
in mobile applications
● Data Pipeline Applications
– Real-time search index building in e-commerce
– Continuous ETL in e-commerce
Flink Use Cases
Flink Use Cases
Available Books
● See “Big Data Made Easy”
– Apress Jan 2015
●
See “Mastering Apache Spark”
– Packt Oct 2015
●
See “Complete Guide to Open Source Big Data Stack
– “Apress Jan 2018”
● Find the author on Amazon
– www.amazon.com/Michael-Frampton/e/B00NIQDOOM/
●
Connect on LinkedIn
– www.linkedin.com/in/mike-frampton-38563020
Connect
● Feel free to connect on LinkedIn
– www.linkedin.com/in/mike-frampton-38563020
● See my open source blog at
– open-source-systems.blogspot.com/
● I am always interested in
– New technology
– Opportunities
– Technology based issues
– Big data integration

More Related Content

PDF
Rootconf
akbarabi
 
PDF
Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uber
confluent
 
PDF
Serverless microservices in the wild
Rotem Tamir
 
PDF
Putting the Spark into Functional Fashion Tech Analystics
Gareth Rogers
 
PDF
Should we manage events like APIs? | Alan Chatt and Kim Clark, IBM
HostedbyConfluent
 
PPTX
Live Coding a KSQL Application
confluent
 
PDF
How We Migrate PBs Data from Beijing to Shanghai
Elmer Brown
 
PPTX
Stream Processing @ Lyft
Jamie Grier
 
Rootconf
akbarabi
 
Disaster Recovery for Multi-Region Apache Kafka Ecosystems at Uber
confluent
 
Serverless microservices in the wild
Rotem Tamir
 
Putting the Spark into Functional Fashion Tech Analystics
Gareth Rogers
 
Should we manage events like APIs? | Alan Chatt and Kim Clark, IBM
HostedbyConfluent
 
Live Coding a KSQL Application
confluent
 
How We Migrate PBs Data from Beijing to Shanghai
Elmer Brown
 
Stream Processing @ Lyft
Jamie Grier
 

What's hot (20)

PPTX
Spline: Data Lineage For Spark Structured Streaming
Vaclav Kosar
 
PDF
KSQL-ops! Running ksqlDB in the Wild (Simon Aubury, ThoughtWorks) Kafka Summi...
confluent
 
PDF
Live Coding a KSQL Application
confluent
 
PDF
Putting the Spark into Functional Fashion Tech Analystics
Gareth Rogers
 
PPTX
Robust stream processing with Apache Flink
Aljoscha Krettek
 
PDF
Flink Forward Berlin 2018: Ravi Suhag & Sumanth Nakshatrithaya - "Managing Fl...
Flink Forward
 
PPTX
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
Flink Forward
 
PDF
OPEN'17_2_Customer Experience_Essent
Kangaroot
 
PPTX
Exploring KSQL Patterns
confluent
 
PPTX
Deploying and Operating KSQL
confluent
 
PPTX
Apache Kafka Streams Use Case
Apache Kafka TLV
 
PPTX
Distributed Tracing at UBER Scale: Creating a treasure map for your monitori...
Yuri Shkuro
 
PPTX
Data Ingestion Engine
Adam Doyle
 
PDF
A Tool For Big Data Analysis using Apache Spark
datamantra
 
PPTX
Robust Stream Processing With Apache Flink
Jamie Grier
 
PDF
Metrics driven development with dedicated Observability Team
LINE Corporation
 
PPTX
Flink Forward Berlin 2018: Timo Walther - "Flink SQL in Action"
Flink Forward
 
PDF
Javier Lopez_Mihail Vieru - Flink in Zalando's World of Microservices - Flink...
Flink Forward
 
PDF
Bay Area Apache Flink Meetup Community Update August 2015
Henry Saputra
 
PDF
Building distributed processing system from scratch - Part 2
datamantra
 
Spline: Data Lineage For Spark Structured Streaming
Vaclav Kosar
 
KSQL-ops! Running ksqlDB in the Wild (Simon Aubury, ThoughtWorks) Kafka Summi...
confluent
 
Live Coding a KSQL Application
confluent
 
Putting the Spark into Functional Fashion Tech Analystics
Gareth Rogers
 
Robust stream processing with Apache Flink
Aljoscha Krettek
 
Flink Forward Berlin 2018: Ravi Suhag & Sumanth Nakshatrithaya - "Managing Fl...
Flink Forward
 
Flink Forward Berlin 2017 Keynote: Ferd Scheepers - Taking away customer fric...
Flink Forward
 
OPEN'17_2_Customer Experience_Essent
Kangaroot
 
Exploring KSQL Patterns
confluent
 
Deploying and Operating KSQL
confluent
 
Apache Kafka Streams Use Case
Apache Kafka TLV
 
Distributed Tracing at UBER Scale: Creating a treasure map for your monitori...
Yuri Shkuro
 
Data Ingestion Engine
Adam Doyle
 
A Tool For Big Data Analysis using Apache Spark
datamantra
 
Robust Stream Processing With Apache Flink
Jamie Grier
 
Metrics driven development with dedicated Observability Team
LINE Corporation
 
Flink Forward Berlin 2018: Timo Walther - "Flink SQL in Action"
Flink Forward
 
Javier Lopez_Mihail Vieru - Flink in Zalando's World of Microservices - Flink...
Flink Forward
 
Bay Area Apache Flink Meetup Community Update August 2015
Henry Saputra
 
Building distributed processing system from scratch - Part 2
datamantra
 
Ad

Similar to Apache Flink (20)

PPTX
Overview of Apache Flink: Next-Gen Big Data Analytics Framework
Slim Baltagi
 
PPTX
Unified Batch and Real-Time Stream Processing Using Apache Flink
Slim Baltagi
 
PPTX
Why apache Flink is the 4G of Big Data Analytics Frameworks
Slim Baltagi
 
PPTX
Workshop híbrido: Stream Processing con Flink
confluent
 
PDF
Apache Flink 101 - the rise of stream processing and beyond
Bowen Li
 
PPTX
Apache-Flink-What-How-Why-Who-Where-by-Slim-Baltagi
Slim Baltagi
 
PPTX
Flink Meetup Septmeber 2017 2018
Christos Hadjinikolis
 
PPTX
Apache Flink: Past, Present and Future
Gyula Fóra
 
PPTX
Data Stream Processing with Apache Flink
Fabian Hueske
 
PPTX
Kostas Tzoumas - Apache Flink®: State of the Union and What's Next
Ververica
 
PPTX
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
DataWorks Summit/Hadoop Summit
 
PPTX
Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks
Slim Baltagi
 
PPTX
Overview of Apache Fink: The 4G of Big Data Analytics Frameworks
Slim Baltagi
 
PDF
Apache flink
pranay kumar
 
PPTX
Apache Flink Online Training
Learntek1
 
PDF
Apache flink
Janu Jahnavi
 
PDF
Apache flink
Janu Jahnavi
 
PDF
Towards Apache Flink 2.0 - Unified Data Processing and Beyond, Bowen Li
Bowen Li
 
PDF
Santander Stream Processing with Apache Flink
confluent
 
PPTX
Flink September 2015 Community Update
Robert Metzger
 
Overview of Apache Flink: Next-Gen Big Data Analytics Framework
Slim Baltagi
 
Unified Batch and Real-Time Stream Processing Using Apache Flink
Slim Baltagi
 
Why apache Flink is the 4G of Big Data Analytics Frameworks
Slim Baltagi
 
Workshop híbrido: Stream Processing con Flink
confluent
 
Apache Flink 101 - the rise of stream processing and beyond
Bowen Li
 
Apache-Flink-What-How-Why-Who-Where-by-Slim-Baltagi
Slim Baltagi
 
Flink Meetup Septmeber 2017 2018
Christos Hadjinikolis
 
Apache Flink: Past, Present and Future
Gyula Fóra
 
Data Stream Processing with Apache Flink
Fabian Hueske
 
Kostas Tzoumas - Apache Flink®: State of the Union and What's Next
Ververica
 
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
DataWorks Summit/Hadoop Summit
 
Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks
Slim Baltagi
 
Overview of Apache Fink: The 4G of Big Data Analytics Frameworks
Slim Baltagi
 
Apache flink
pranay kumar
 
Apache Flink Online Training
Learntek1
 
Apache flink
Janu Jahnavi
 
Apache flink
Janu Jahnavi
 
Towards Apache Flink 2.0 - Unified Data Processing and Beyond, Bowen Li
Bowen Li
 
Santander Stream Processing with Apache Flink
confluent
 
Flink September 2015 Community Update
Robert Metzger
 
Ad

More from Mike Frampton (20)

PDF
Apache Airavata
Mike Frampton
 
PDF
Apache MADlib AI/ML
Mike Frampton
 
PDF
Apache MXNet AI
Mike Frampton
 
PDF
Apache Gobblin
Mike Frampton
 
PDF
Apache Singa AI
Mike Frampton
 
PDF
Apache Ranger
Mike Frampton
 
PDF
OrientDB
Mike Frampton
 
PDF
Prometheus
Mike Frampton
 
PDF
Apache Tephra
Mike Frampton
 
PDF
Apache Kudu
Mike Frampton
 
PDF
Apache Bahir
Mike Frampton
 
PDF
Apache Arrow
Mike Frampton
 
PDF
JanusGraph DB
Mike Frampton
 
PDF
Apache Ignite
Mike Frampton
 
PDF
Apache Samza
Mike Frampton
 
PDF
Apache Edgent
Mike Frampton
 
PDF
Apache CouchDB
Mike Frampton
 
ODP
An introduction to Apache Mesos
Mike Frampton
 
ODP
An introduction to Pentaho
Mike Frampton
 
ODP
An introduction to Apache Thrift
Mike Frampton
 
Apache Airavata
Mike Frampton
 
Apache MADlib AI/ML
Mike Frampton
 
Apache MXNet AI
Mike Frampton
 
Apache Gobblin
Mike Frampton
 
Apache Singa AI
Mike Frampton
 
Apache Ranger
Mike Frampton
 
OrientDB
Mike Frampton
 
Prometheus
Mike Frampton
 
Apache Tephra
Mike Frampton
 
Apache Kudu
Mike Frampton
 
Apache Bahir
Mike Frampton
 
Apache Arrow
Mike Frampton
 
JanusGraph DB
Mike Frampton
 
Apache Ignite
Mike Frampton
 
Apache Samza
Mike Frampton
 
Apache Edgent
Mike Frampton
 
Apache CouchDB
Mike Frampton
 
An introduction to Apache Mesos
Mike Frampton
 
An introduction to Pentaho
Mike Frampton
 
An introduction to Apache Thrift
Mike Frampton
 

Recently uploaded (20)

PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 

Apache Flink

  • 1. What Is Apache Flink ? ● A stream processing framework ● Open source / Apache 2.0 license ● Written in Java and Scala ● For batch and stream processing ● For high volume , low latency ● Develop in Java, Scala, Python, SQL ● Automatic compilation/optimization into data flows
  • 2. How Does Flink Work ? ● Process Unbounded and Bounded Data ● Uses file systems to consume/persistently store data i.e. – local, hadoop-compatible, Amazon S3, MapR FS, OpenStack Swift FS, Aliyun OSS and Azure Blob Storage ● Leverages In-Memory Performance ● Provides a rich function set for handling – Streams, state and time – When building applications ● Provides layered API's which provides a balance between – Conciseness and expressiveness – See next slide
  • 3. How Does Flink Work ? Flink layered API's
  • 4. Flink API's ● SQL & Table API ● DataStream API ● ProcessFunctions – event processing ● Flink also has libraries for common data processing – Complex Event Processing (CEP) – DataSet API – Gelly - library for scalable graph processing/analysis
  • 6. Flink Deployment ● Deploy Flink to use the following cluster managers – YARN – Mesos – Kubernetes – Stand alone ● All application control communications via REST calls ● Deploy at any scale – multiple trillions of events per day – multiple terabytes of state – thousands of cores
  • 8. Flink Stateful Functions ● Simplifies building distributed stateful applications ● Provides a runtime built for serverless architectures ● Key Benefits – Dynamic Messaging – Consistent State – Multi-language Support – No Database Required – Cloud Native – "Stateless" Operation
  • 10. Flink Use Cases ● Event-driven Applications i.e. – Fraud detection – Anomaly detection ● Data Analytics Applications – Quality monitoring of Telco networks – Analysis of product updates & experiment evaluation in mobile applications ● Data Pipeline Applications – Real-time search index building in e-commerce – Continuous ETL in e-commerce
  • 13. Available Books ● See “Big Data Made Easy” – Apress Jan 2015 ● See “Mastering Apache Spark” – Packt Oct 2015 ● See “Complete Guide to Open Source Big Data Stack – “Apress Jan 2018” ● Find the author on Amazon – www.amazon.com/Michael-Frampton/e/B00NIQDOOM/ ● Connect on LinkedIn – www.linkedin.com/in/mike-frampton-38563020
  • 14. Connect ● Feel free to connect on LinkedIn – www.linkedin.com/in/mike-frampton-38563020 ● See my open source blog at – open-source-systems.blogspot.com/ ● I am always interested in – New technology – Opportunities – Technology based issues – Big data integration