SlideShare a Scribd company logo
What’s new in
Apache FlinkTM 1.0
Kostas Tzoumas
@kostas_tzoumas
Flink 1.0
• March 8, 2016
• First release in 1.x.y series
• Initiates backwards compatibility for selected APIs
• More than 64 contributors
• More than 450 JIRAs resolved
2
Flink 1.0: major features
• Out of core state
• Savepoints
• CEP library
• Improved monitoring & Kafka 0.9 support
3
Out of core state
4
Out of core state
• Alternative to in-memory state
• Powered by RocksDB instances in Flink TMs
• Enabled by using the RocksDBStateBackend
• State limited by disk space only
• State checkpoints save RocksDB databases in
reliable store
5
Savepoints
6
Production deployments
• Maintaining stateful applications in production
settings comes with its own challenges
• Failures, code upgrades, cluster maintenance, …
• Streaming jobs cannot be simply stopped and
restarted
7
Reminder: fault tolerance
• At least once, at most once, exactly once
• Flink guarantees exactly-once processing
• Flink guarantees end to end exactly-once with
selected sources and sinks
• e.g., Kafka —> Flink —> HDFS
How? Checkpoints
• Flink guarantees fault tolerance by regularly taking checkpoints
of the application state without ever stopping the execution
• At failure, input stream is rewinded to the logical time of the last
checkpoint
9
Introducing savepoints
• A savepoint is a Flink checkpoint that (1) is taken by
the user, (2) is accessible externally, and (3) never
expires
• Command line save & resume interface
• Save: flink savepoint <JobID>
• Resume: flink run -s
<path/to/savepoint> <jobJar>
10
Savepoints and versions
• A savepoint saves a version of a stateful application at a
well-defined time
• E.g.: take snapshots of one application at well-defined
times
11
“Like git for state”
• Branch off from savepoints creating a tree of
running application versions
12
Essential for production
deployments
• Application code upgrades
• Flink version upgrades
• Maintenance, migration, debugging
• What-if simulations
• A/B testing
• Time travel
13
Complex Event
Processing
14
FlinkCEP
• What is Complex Event Processing?
• A catch-all term
• In our context: easily detect patterns in streams
15
16
Pattern API
17
18
19
Other features in 1.0
• Support for Kafka 0.9 API (and hence MapR
Streams)
• Monitoring console: job submission, checkpoint
statistics, detecting bottlenecks
• See
http://flink.apache.org/news/2016/03/08/release-
1.0.0.html
20
Closing
21
Summary
• Flink 1.0: Initiating backwards compatibility and
pushing the envelope even further for production
streaming deployments
22
What’s next
• SQL
• Dynamic scaling (+ savepoints)
• Hybrid in-memory/out-of-core state backend
• Query-able state
• Support for Apache Mesos
• More connectors and sinks (Kinesis, Cassandra, …)
23
Join the community
• Follow: @ApacheFlink, @dataArtisans
• Read: flink.apache.org/blog, data-artisans.com/blog
• Subscribe: (news | dev | user)@flink.apache.org

More Related Content

What's hot (20)

PPTX
The Evolution of (Open Source) Data Processing
Aljoscha Krettek
 
PDF
Jamie Grier - Robust Stream Processing with Apache Flink
Flink Forward
 
PDF
Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)
Apache Flink Taiwan User Group
 
PPTX
Taking a look under the hood of Apache Flink's relational APIs.
Fabian Hueske
 
PPTX
Robust Stream Processing With Apache Flink
Jamie Grier
 
PPTX
Aljoscha Krettek - The Future of Apache Flink
Flink Forward
 
PPTX
Apache Flink Community Updates November 2016 @ Berlin Meetup
Robert Metzger
 
PDF
Introduction to Apache Flink
datamantra
 
PPTX
Apache flink 1.7 and Beyond
Till Rohrmann
 
PPTX
Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4
Flink Forward
 
PDF
Stream Processing with Apache Flink
C4Media
 
PPTX
From Apache Flink® 1.3 to 1.4
Till Rohrmann
 
PPTX
Stephan Ewen - Experiences running Flink at Very Large Scale
Ververica
 
PPTX
Apache Flink: Past, Present and Future
Gyula Fóra
 
PPTX
Data Stream Processing with Apache Flink
Fabian Hueske
 
PPTX
Apache Flink Berlin Meetup May 2016
Stephan Ewen
 
PPTX
The Stream Processor as the Database - Apache Flink @ Berlin buzzwords
Stephan Ewen
 
PPTX
GOTO Night Amsterdam - Stream processing with Apache Flink
Robert Metzger
 
PPTX
Flink Forward Berlin 2017: Hao Wu - Large Scale User Behavior Analytics by Flink
Flink Forward
 
PPTX
Kostas Tzoumas - Apache Flink®: State of the Union and What's Next
Ververica
 
The Evolution of (Open Source) Data Processing
Aljoscha Krettek
 
Jamie Grier - Robust Stream Processing with Apache Flink
Flink Forward
 
Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)
Apache Flink Taiwan User Group
 
Taking a look under the hood of Apache Flink's relational APIs.
Fabian Hueske
 
Robust Stream Processing With Apache Flink
Jamie Grier
 
Aljoscha Krettek - The Future of Apache Flink
Flink Forward
 
Apache Flink Community Updates November 2016 @ Berlin Meetup
Robert Metzger
 
Introduction to Apache Flink
datamantra
 
Apache flink 1.7 and Beyond
Till Rohrmann
 
Flink Forward Berlin 2017: Till Rohrmann - From Apache Flink 1.3 to 1.4
Flink Forward
 
Stream Processing with Apache Flink
C4Media
 
From Apache Flink® 1.3 to 1.4
Till Rohrmann
 
Stephan Ewen - Experiences running Flink at Very Large Scale
Ververica
 
Apache Flink: Past, Present and Future
Gyula Fóra
 
Data Stream Processing with Apache Flink
Fabian Hueske
 
Apache Flink Berlin Meetup May 2016
Stephan Ewen
 
The Stream Processor as the Database - Apache Flink @ Berlin buzzwords
Stephan Ewen
 
GOTO Night Amsterdam - Stream processing with Apache Flink
Robert Metzger
 
Flink Forward Berlin 2017: Hao Wu - Large Scale User Behavior Analytics by Flink
Flink Forward
 
Kostas Tzoumas - Apache Flink®: State of the Union and What's Next
Ververica
 

Viewers also liked (7)

PDF
Apache Flink Training Workshop @ HadoopCon2016 - #4 Advanced Stream Processing
Apache Flink Taiwan User Group
 
PPTX
Flink Streaming @BudapestData
Gyula Fóra
 
PPTX
Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pr...
Stephan Ewen
 
PPTX
Streaming in the Wild with Apache Flink
DataWorks Summit/Hadoop Summit
 
PDF
Thomas Lamirault_Mohamed Amine Abdessemed -A brief history of time with Apac...
Flink Forward
 
PPTX
Overview of Apache Flink: Next-Gen Big Data Analytics Framework
Slim Baltagi
 
PPTX
Flink vs. Spark
Slim Baltagi
 
Apache Flink Training Workshop @ HadoopCon2016 - #4 Advanced Stream Processing
Apache Flink Taiwan User Group
 
Flink Streaming @BudapestData
Gyula Fóra
 
Apache Flink - Overview and Use cases of a Distributed Dataflow System (at pr...
Stephan Ewen
 
Streaming in the Wild with Apache Flink
DataWorks Summit/Hadoop Summit
 
Thomas Lamirault_Mohamed Amine Abdessemed -A brief history of time with Apac...
Flink Forward
 
Overview of Apache Flink: Next-Gen Big Data Analytics Framework
Slim Baltagi
 
Flink vs. Spark
Slim Baltagi
 
Ad

Similar to Flink 1.0-slides (20)

PPTX
Apache flink 1.0.0 overview
MapR Technologies
 
PPTX
Flink 0.10 - Upcoming Features
Aljoscha Krettek
 
PDF
Consensus in Apache Kafka: From Theory to Production.pdf
Guozhang Wang
 
PDF
Flink forward-2017-netflix keystones-paas
Monal Daxini
 
PPTX
Serverless design with Fn project
Siva Rama Krishna Chunduru
 
PDF
Better Kafka Performance Without Changing Any Code | Simon Ritter, Azul
HostedbyConfluent
 
PDF
Better Kafka Performance Without Changing Any Code | Simon Ritter, Azul
HostedbyConfluent
 
PDF
IBM XL Compilers Performance Tuning 2016-11-18
Yaoqing Gao
 
PDF
Introducing the Apache Flink Kubernetes Operator
Flink Forward
 
PDF
Flink at netflix paypal speaker series
Monal Daxini
 
PDF
Tips and Tricks for Operating Apache Kafka
All Things Open
 
PPTX
Stream Processing @ Lyft
Jamie Grier
 
PDF
Streaming Processing with a Distributed Commit Log
Joe Stein
 
PDF
Ippevent : openshift Introduction
kanedafromparis
 
PPTX
Realtime traffic analyser
Alex Moskvin
 
PPTX
Apache Kafka
emreakis
 
PDF
Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...
Flink Forward
 
PPTX
Mainframe Virtual User Group Summer 2013
Serena Software
 
PPTX
A tour of Java and the JVM
Alex Birch
 
PPTX
Discover Quarkus and GraalVM
Romain Schlick
 
Apache flink 1.0.0 overview
MapR Technologies
 
Flink 0.10 - Upcoming Features
Aljoscha Krettek
 
Consensus in Apache Kafka: From Theory to Production.pdf
Guozhang Wang
 
Flink forward-2017-netflix keystones-paas
Monal Daxini
 
Serverless design with Fn project
Siva Rama Krishna Chunduru
 
Better Kafka Performance Without Changing Any Code | Simon Ritter, Azul
HostedbyConfluent
 
Better Kafka Performance Without Changing Any Code | Simon Ritter, Azul
HostedbyConfluent
 
IBM XL Compilers Performance Tuning 2016-11-18
Yaoqing Gao
 
Introducing the Apache Flink Kubernetes Operator
Flink Forward
 
Flink at netflix paypal speaker series
Monal Daxini
 
Tips and Tricks for Operating Apache Kafka
All Things Open
 
Stream Processing @ Lyft
Jamie Grier
 
Streaming Processing with a Distributed Commit Log
Joe Stein
 
Ippevent : openshift Introduction
kanedafromparis
 
Realtime traffic analyser
Alex Moskvin
 
Apache Kafka
emreakis
 
Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...
Flink Forward
 
Mainframe Virtual User Group Summer 2013
Serena Software
 
A tour of Java and the JVM
Alex Birch
 
Discover Quarkus and GraalVM
Romain Schlick
 
Ad

Recently uploaded (20)

PPTX
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
PDF
Viol_Alessandro_Presentazione_prelaurea.pdf
dsecqyvhbowrzxshhf
 
PPTX
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
PPTX
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
PPTX
Day2 B2 Best.pptx
helenjenefa1
 
PPTX
GitOps_Without_K8s_Training_detailed git repository
DanialHabibi2
 
PDF
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
PDF
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
PPTX
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
PPTX
What is Shot Peening | Shot Peening is a Surface Treatment Process
Vibra Finish
 
PPTX
美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查
Taqyea
 
PPTX
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
PPTX
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
PDF
Zilliz Cloud Demo for performance and scale
Zilliz
 
PPTX
Damage of stability of a ship and how its change .pptx
ehamadulhaque
 
PDF
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PDF
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
PDF
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
PDF
Ethics and Trustworthy AI in Healthcare – Governing Sensitive Data, Profiling...
AlqualsaDIResearchGr
 
PPTX
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
Viol_Alessandro_Presentazione_prelaurea.pdf
dsecqyvhbowrzxshhf
 
Heart Bleed Bug - A case study (Course: Cryptography and Network Security)
Adri Jovin
 
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
Day2 B2 Best.pptx
helenjenefa1
 
GitOps_Without_K8s_Training_detailed git repository
DanialHabibi2
 
MAD Unit - 2 Activity and Fragment Management in Android (Diploma IT)
JappanMavani
 
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
Worm gear strength and wear calculation as per standard VB Bhandari Databook.
shahveer210504
 
What is Shot Peening | Shot Peening is a Surface Treatment Process
Vibra Finish
 
美国电子版毕业证南卡罗莱纳大学上州分校水印成绩单USC学费发票定做学位证书编号怎么查
Taqyea
 
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
Shinkawa Proposal to meet Vibration API670.pptx
AchmadBashori2
 
Zilliz Cloud Demo for performance and scale
Zilliz
 
Damage of stability of a ship and how its change .pptx
ehamadulhaque
 
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
PORTFOLIO Golam Kibria Khan — architect with a passion for thoughtful design...
MasumKhan59
 
Ethics and Trustworthy AI in Healthcare – Governing Sensitive Data, Profiling...
AlqualsaDIResearchGr
 
artificial intelligence applications in Geomatics
NawrasShatnawi1
 

Flink 1.0-slides

  • 1. What’s new in Apache FlinkTM 1.0 Kostas Tzoumas @kostas_tzoumas
  • 2. Flink 1.0 • March 8, 2016 • First release in 1.x.y series • Initiates backwards compatibility for selected APIs • More than 64 contributors • More than 450 JIRAs resolved 2
  • 3. Flink 1.0: major features • Out of core state • Savepoints • CEP library • Improved monitoring & Kafka 0.9 support 3
  • 4. Out of core state 4
  • 5. Out of core state • Alternative to in-memory state • Powered by RocksDB instances in Flink TMs • Enabled by using the RocksDBStateBackend • State limited by disk space only • State checkpoints save RocksDB databases in reliable store 5
  • 7. Production deployments • Maintaining stateful applications in production settings comes with its own challenges • Failures, code upgrades, cluster maintenance, … • Streaming jobs cannot be simply stopped and restarted 7
  • 8. Reminder: fault tolerance • At least once, at most once, exactly once • Flink guarantees exactly-once processing • Flink guarantees end to end exactly-once with selected sources and sinks • e.g., Kafka —> Flink —> HDFS
  • 9. How? Checkpoints • Flink guarantees fault tolerance by regularly taking checkpoints of the application state without ever stopping the execution • At failure, input stream is rewinded to the logical time of the last checkpoint 9
  • 10. Introducing savepoints • A savepoint is a Flink checkpoint that (1) is taken by the user, (2) is accessible externally, and (3) never expires • Command line save & resume interface • Save: flink savepoint <JobID> • Resume: flink run -s <path/to/savepoint> <jobJar> 10
  • 11. Savepoints and versions • A savepoint saves a version of a stateful application at a well-defined time • E.g.: take snapshots of one application at well-defined times 11
  • 12. “Like git for state” • Branch off from savepoints creating a tree of running application versions 12
  • 13. Essential for production deployments • Application code upgrades • Flink version upgrades • Maintenance, migration, debugging • What-if simulations • A/B testing • Time travel 13
  • 15. FlinkCEP • What is Complex Event Processing? • A catch-all term • In our context: easily detect patterns in streams 15
  • 16. 16
  • 18. 18
  • 19. 19
  • 20. Other features in 1.0 • Support for Kafka 0.9 API (and hence MapR Streams) • Monitoring console: job submission, checkpoint statistics, detecting bottlenecks • See http://flink.apache.org/news/2016/03/08/release- 1.0.0.html 20
  • 22. Summary • Flink 1.0: Initiating backwards compatibility and pushing the envelope even further for production streaming deployments 22
  • 23. What’s next • SQL • Dynamic scaling (+ savepoints) • Hybrid in-memory/out-of-core state backend • Query-able state • Support for Apache Mesos • More connectors and sinks (Kinesis, Cassandra, …) 23
  • 24. Join the community • Follow: @ApacheFlink, @dataArtisans • Read: flink.apache.org/blog, data-artisans.com/blog • Subscribe: (news | dev | user)@flink.apache.org