SlideShare a Scribd company logo
Dataflow	with	
Apache	NiFi
Aldrin	Piri	- @aldrinpiri
Apache	NiFi Crash	Course
DataWorks Summit	2017	– Munich
6	April	2017
2 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Key:	'Apache	NiFi’
Value:	'PMC	Member'
Key:	'Work’
Value:	’Sr.	Member	of	Technical	Staff	@	Hortonworks'
Key:	'Working	with	NiFi Since’
Value:	'2010’
3 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Live	Demo
Community
4 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Live	Demo
Community
5 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Let’s	Connect	A	to	B
Producers	A.K.A	Things
Anything
AND	
Everything
Internet!
Consumers
• User
• Storage
• System
• …More	Things
6 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Moving	data	effectively	is	hard
Standards:		http://xkcd.com/927/
7 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Why	is	moving	data	effectively	hard?	
à Standards
à Formats
à “Exactly	Once”	Delivery
à Protocols
à Veracity	of	Information
à Validity	of	Information
à Ensuring	Security
à Overcoming	Security
à Compliance
à Schemas
à Consumers	Change
à Credential	Management
à “That [person|team|group]”
à Network
à “Exactly	Once”	Delivery
8 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Let’s	Connect	Lots	of	As	to	Bs to	As	to	Cs	to	Bs to	Δs to	Cs	to	ϕs
Let’s	consider	the	needs	of	a	courier	service
Physical	Store
Gateway	
Server
Mobile	Devices
Registers
Server	Cluster
Distribution	Center Core	Data	Center	at	HQ
Server	Cluster
On	Delivery	Routes
Trucks Deliverers
Delivery	Truck:	Creative	Stall,	https://thenounproject.com/creativestall/
Deliverer:	Rigo Peter,	https://thenounproject.com/rigo/
Cash	Register:	Sergey	Patutin,	https://thenounproject.com/bdesign.by/
Hand	Scanner:	Eric	Pearson,	https://thenounproject.com/epearson001/
9 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Great!	I	am	collecting	all	this	data!		Let’s	use	it!
Finding	our	needles	in	the	haystack
Physical	Store
Gateway	
Server
Mobile	Devices
Registers
Server	Cluster
Distribution	Center
Kafka
Core	Data	Center	at	HQ
Server	Cluster
Others
Storm	/	Spark	/	
Flink /	Apex
Kafka
Storm	/	Spark	/	Flink /	Apex
On	Delivery	Routes
Trucks Deliverers
Delivery	Truck:	Creative	Stall,	https://thenounproject.com/creativestall/
Deliverer:	Rigo Peter,	https://thenounproject.com/rigo/
Cash	Register:	Sergey	Patutin,	https://thenounproject.com/bdesign.by/
Hand	Scanner:	Eric	Pearson,	https://thenounproject.com/epearson001/
10 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Why	is	moving	data	effectively	hard	when	scoped	internally?	
à Standards
à Formats
à “Exactly	Once”	Delivery
à Protocols
à Veracity	of	Information
à Validity	of	Information
à Ensuring	Security
à Overcoming	Security
à Compliance
à Schemas
à Consumers	Change
à Credential	Management
à “That [person|team|group]”
à Network
à “Exactly	Once”	Delivery
11 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Let’s	Connect	Lots	of	As	to	Bs to	As	to	Cs	to	Bs to	Δs to	Cs	to	ϕs
Oh,	that	courier	service	is	global
12 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Why	is	moving	data	effectively	hard	when	scoped	globally?	
à Standards
à Formats
à “Exactly	Once”	Delivery
à Protocols
à Veracity	of	Information
à Validity	of	Information
à Ensuring	Security
à Overcoming	Security
à Compliance
à Schemas
à Consumers	Change
à Credential	Management
à “That [person|team|group]”
à Network
à “Exactly	Once”	Delivery
13 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
The	Unassuming	Line:		A	Case	Study
We’ve	seen	a	few	lines	show	up	in	the	wild	thus	far
Internet! Inter- &	Intra- connections	in
our	global	courier	enterprise
Spotlight:	Arthur	Lacôte,	https://thenounproject.com/turo/
14 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Dataflow	Line	Anatomy	101
Let’s	dissect	what	this	line	typically	represents
Fig	1.		Lineus Worldwidewebus.	Common	Name:	Internet!
Script	or	
Application
Script	or	
Application
Data Data
Disparate	Transport
Mechanisms
15 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Dataflow	Line	Anatomy	201
Sometimes	that	transport	is	just	more	lines
Fig	1.		Lineus Worldwidewebus.	Common	Name:	Internet!
Script	or	
Application
Script	or	
Application
Line	Inception
Data Data
16 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Dataflow	Line	Anatomy	301
But	those	lines	could	also	have	components…
Fig	1.		Lineus Worldwidewebus.	Common	Name:	Internet!
17 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Live	Demo
Community
18 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Apache	NiFi
Key	Features
• Guaranteed	delivery
• Data	buffering	
- Backpressure
- Pressure	release
• Prioritized	queuing
• Flow	specific	QoS
- Latency	vs.	throughput
- Loss	tolerance
• Data	provenance
• Supports	push	and	pull	
models
• Recovery/recording	
a	rolling	log	of	fine-
grained	history
• Visual	command	and	
control
• Flow	templates
• Pluggable/multi-role	
security
• Designed	for	extension
• Clustering
19 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Apache	NiFi Subproject:	MiNiFi
à Let	me	get	the	key	parts	of	NiFi close	to	where	data	begins	and	provide	bidrectional
communication
à NiFi lives	in	the	data	center.		Give	it	an	enterprise	server	or	a	cluster	of	them.
à MiNiFi lives	as	close	to	where	data	is	born	and	is	a	guest	on	that	device	or	system
20 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Let’s	revisit	our	courier	service	from	the	perspective	of	NiFi
Physical	Store
Gateway	
Server
Mobile	Devices
Registers
Server	Cluster
Distribution	Center
Kafka
Core	Data	Center	at	HQ
Server	Cluster
Others
Storm	/	Spark	/	
Flink /	Apex
Kafka
Storm	/	Spark	/	Flink /	Apex
On	Delivery	Routes
Trucks Deliverers
Delivery	Truck:	Creative	Stall,	https://thenounproject.com/creativestall/
Deliverer:	Rigo Peter,	https://thenounproject.com/rigo/
Cash	Register:	Sergey	Patutin,	https://thenounproject.com/bdesign.by/
Hand	Scanner:	Eric	Pearson,	https://thenounproject.com/epearson001/
Client	
Libraries
Client	
Libraries
MiNiFi
MiNiFi
NiFi NiFi NiFi NiFi NiFi NiFi
Client	
Libraries
21 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Apache	NiFi Managed	Dataflow
SOURCES
REGIONAL	
INFRASTRUCTURE
CORE	
INFRASTRUCTURE
22 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
NiFi is	based	on	Flow	Based	Programming	(FBP)
FBP	Term NiFi Term Description
Information	
Packet
FlowFile Each object	moving	through	the	system.
Black Box FlowFile	
Processor
Performs	the	work, doing	some	combination	of	data	routing,	transformation,	
or	mediation	between	systems.
Bounded	
Buffer
Connection The	linkage between	processors, acting	as	queues	and	allowing	various	
processes	to	interact	at	differing	rates.
Scheduler Flow	
Controller
Maintains	the	knowledge	of	how	processes	are	connected, and	manages	the	
threads	and	allocations	thereof	which	all	processes	use.
Subnet Process	
Group
A	set	of	processes	and	their	connections,	which	can	receive	and	send	data	via	
ports.	A	process group	allows	creation	of	entirely	new	component	simply	by	
composition	of	its components.
23 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
FlowFiles &	Data	Agnosticism
à NiFi is	data	agnostic!
à But,	NiFi was	designed	understanding	that	users
can	care	about	specifics	and	provides	tooling	
to	interact	with	specific	formats,	protocols,	etc.
ISO	8601	- http://xkcd.com/1179/
Robustness	principle
Be	conservative	in	what	you	do,	
be	liberal	in	what	you	accept	from	others“
24 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
FlowFiles are	like	HTTP	data
HTTP	Data FlowFile
HTTP/1.1	200	OK
Date:	Sun,	10	Oct	2010	23:26:07	GMT
Server:	Apache/2.2.8	(CentOS)	OpenSSL/0.9.8g
Last-Modified:	Sun,	26	Sep	2010	22:04:35	GMT
ETag:	"45b6-834-49130cc1182c0"
Accept-Ranges:	bytes
Content-Length:	13
Connection:	close
Content-Type:	text/html
Hello	world!
Standard	FlowFile Attributes
Key:	'entryDate’ Value:	'Fri	Jun	17	17:15:04	EDT	2016'
Key:	'lineageStartDate’			Value:	'Fri	Jun	17	17:15:04	EDT	2016'
Key:	'fileSize’ Value:	'23609'
FlowFile Attribute	Map	Content
Key:	'filename’ Value:	'15650246997242'
Key:	'path’ Value:	'./’
Binary	Content	*
Header
Content
25 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Live	Demo
Community
26 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Extension	/	Integration	Points
NiFi Term Description
Flow File	
Processor
Push/Pull behavior.		Custom	UI
Reporting
Task
Used to	push	data	from	NiFi to	some	external	service	(metrics,	provenance,	
etc..)
Controller	
Service
Used	to	enable	reusable	components	/ shared	services	throughout	the	flow
REST	API Allows	clients	to	connect	to	pull	information,	change	behavior,	etc..
©	Hortonworks	Inc.	2011	–	2016.	All	Rights	ReservedX
Architecture
OS/Host
JVM
Flow	Controller
Web	Server
Processor	1 Extension	N
FlowFile

Repository
Content

Repository
Provenance

Repository
Local	Storage
Standalone
Cluster
27 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
NiFi	Architecture	– Repositories	- Pass	by	reference
FlowFile Content Provenance
F1à C1 C1 P1à F1
Excerpt	of	demo	flow… What’s	happening	inside	the	repositories…
BEFORE
AFTER
F2à C1 C1 P3à F2 – Clone	(F1)
F1à C1 P2à F1 – Route	
P1à F1 – Create
28 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
NiFi	Architecture	– Repositories	– Copy	on	Write
FlowFile Content Provenance
F1à C1 C1 P1à F1	- CREATE
Excerpt	of	demo	flow… What’s	happening	inside	the	repositories…
BEFORE
AFTER
F1à C1
F1.1à C2 C2	(encrypted)
C1	(plaintext)
P2à F1.1 - MODIFY
P1à F1	- CREATE
29 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Agenda
What	is	dataflow	and	what	are	the	challenges?
Apache	NiFi
Architecture
Demo
Community
30 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Learn,	Share	at	Birds	of	a	Feather
IOT,	STREAMING	&	DATA	FLOW
Thursday,	April	6
5:50	pm,	Room	5
31 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Why	NiFi?
à Moving	data	is	multifaceted	in	its	challenges	and	these	are	present	in	different	contexts	
at	varying	scopes
– Think	of	our	courier	example	and	organizations	like	it:	inter	vs intra,	domestically,	internationally
à Provide	common	tooling	and	extensions	that	are	commonly	needed	but	be	flexible	for	
extension
– Leverage	existing	libraries	and	expansive	Java	ecosystem	for	functionality
– Allow	organizations	to	integrate	with	their	existing	infrastructure	
à Empower	folks	managing	your	infrastructure	to	make	changes	and	reason	about	issues	
that	are	occurring
– Data	Provenance	to	show	context	and	data’s	journey
– User	Interface/Experience	a	key	component
32 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Learn	more	and	join	us!
Apache NiFi site
http://nifi.apache.org
Subproject MiNiFi site
http://nifi.apache.org/minifi/
Subscribe to and collaborate at
dev@nifi.apache.org
users@nifi.apache.org
Submit Ideas or Issues
https://issues.apache.org/jira/browse/NIFI
Follow us on Twitter
@apachenifi
33 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Our	Lab	for	Today
à We	will	be	exploring	some	examples	to	work	through	creating	a	dataflow	with	Apache	
NiFi
à Use	Case:			An	urban	planning	board	is	evaluating	the	need	for	a	new	highway,	
dependent	on	current	traffic	patterns,	particularly	as	other	roadwork	initiatives	are	
under	way.	Integrating	live	data	poses	a	problem	because	traffic	analysis	has	
traditionally	been	done	using	historical,	aggregated	traffic	counts.	To	improve	traffic	
analysis,	the	city	planner	wants	to	leverage	real-time	data	to	get	a	deeper	understanding	
of	traffic	patterns.	NiFi was	selected	for	for	this	real-time	data	integration.
à Labs	are	available	at	http://tinyurl.com/nificrashcourse
34 ©	Hortonworks	Inc.	2011	– 2016.	All	Rights	Reserved
Thank	You

More Related Content

What's hot (20)

PDF
Introduction to Apache NiFi dws19 DWS - DC 2019
Timothy Spann
 
PPTX
Apache NiFi Crash Course Intro
DataWorks Summit/Hadoop Summit
 
PDF
Data ingestion and distribution with apache NiFi
Lev Brailovskiy
 
PDF
Nifi workshop
Yifeng Jiang
 
PDF
Running Apache NiFi with Apache Spark : Integration Options
Timothy Spann
 
PDF
Apache Flink internals
Kostas Tzoumas
 
PPTX
Hive 3 - a new horizon
Thejas Nair
 
PDF
How Uber scaled its Real Time Infrastructure to Trillion events per day
DataWorks Summit
 
PPTX
Apache Flink and what it is used for
Aljoscha Krettek
 
PPTX
Flexible and Real-Time Stream Processing with Apache Flink
DataWorks Summit
 
PPTX
File Format Benchmark - Avro, JSON, ORC & Parquet
DataWorks Summit/Hadoop Summit
 
PPTX
Introduction to Apache ZooKeeper
Saurav Haloi
 
PDF
Apache Kafka Architecture & Fundamentals Explained
confluent
 
PDF
Introduction to data flow management using apache nifi
Anshuman Ghosh
 
PPTX
Apache Spark Architecture
Alexey Grishchenko
 
PDF
Introduction to Apache Flink
datamantra
 
PDF
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
HostedbyConfluent
 
PPTX
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Flink Forward
 
PPTX
The columnar roadmap: Apache Parquet and Apache Arrow
DataWorks Summit
 
PDF
Real time stock processing with apache nifi, apache flink and apache kafka
Timothy Spann
 
Introduction to Apache NiFi dws19 DWS - DC 2019
Timothy Spann
 
Apache NiFi Crash Course Intro
DataWorks Summit/Hadoop Summit
 
Data ingestion and distribution with apache NiFi
Lev Brailovskiy
 
Nifi workshop
Yifeng Jiang
 
Running Apache NiFi with Apache Spark : Integration Options
Timothy Spann
 
Apache Flink internals
Kostas Tzoumas
 
Hive 3 - a new horizon
Thejas Nair
 
How Uber scaled its Real Time Infrastructure to Trillion events per day
DataWorks Summit
 
Apache Flink and what it is used for
Aljoscha Krettek
 
Flexible and Real-Time Stream Processing with Apache Flink
DataWorks Summit
 
File Format Benchmark - Avro, JSON, ORC & Parquet
DataWorks Summit/Hadoop Summit
 
Introduction to Apache ZooKeeper
Saurav Haloi
 
Apache Kafka Architecture & Fundamentals Explained
confluent
 
Introduction to data flow management using apache nifi
Anshuman Ghosh
 
Apache Spark Architecture
Alexey Grishchenko
 
Introduction to Apache Flink
datamantra
 
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
HostedbyConfluent
 
Squirreling Away $640 Billion: How Stripe Leverages Flink for Change Data Cap...
Flink Forward
 
The columnar roadmap: Apache Parquet and Apache Arrow
DataWorks Summit
 
Real time stock processing with apache nifi, apache flink and apache kafka
Timothy Spann
 

Similar to Dataflow with Apache NiFi (20)

PDF
Apache NiFi Crash Course San Jose Hadoop Summit
Daniel Madrigal
 
PDF
Dataflow with Apache NiFi - Crash Course - HS16SJ
DataWorks Summit/Hadoop Summit
 
PPTX
Hadoop Summit Tokyo Apache NiFi Crash Course
DataWorks Summit/Hadoop Summit
 
PPTX
Apache NiFi Crash Course - San Jose Hadoop Summit
Aldrin Piri
 
PDF
Apache Nifi Crash Course
DataWorks Summit
 
PPTX
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Aldrin Piri
 
PDF
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
DataWorks Summit
 
PDF
Apache Nifi Crash Course
DataWorks Summit
 
PPTX
Connecting the Drops with Apache NiFi & Apache MiNiFi
DataWorks Summit
 
PDF
Intelligently collecting data at the edge—intro to Apache MiNiFi
DataWorks Summit
 
PPTX
The Avant-garde of Apache NiFi
DataWorks Summit/Hadoop Summit
 
PPTX
The Avant-garde of Apache NiFi
Joe Percivall
 
PPTX
State of the Apache NiFi Ecosystem & Community
Accumulo Summit
 
PPTX
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks
 
PPTX
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Haimo Liu
 
PDF
Dataflow Management From Edge to Core with Apache NiFi
DataWorks Summit
 
PDF
Apache NiFi - Flow Based Programming Meetup
Joseph Witt
 
PDF
Hortonworks DataFlow & Apache Nifi @Oslo Hadoop Big Data
Mats Johansson
 
PDF
Dataflow Management From Edge to Core with Apache NiFi
DataWorks Summit
 
PDF
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFi
Bryan Bende
 
Apache NiFi Crash Course San Jose Hadoop Summit
Daniel Madrigal
 
Dataflow with Apache NiFi - Crash Course - HS16SJ
DataWorks Summit/Hadoop Summit
 
Hadoop Summit Tokyo Apache NiFi Crash Course
DataWorks Summit/Hadoop Summit
 
Apache NiFi Crash Course - San Jose Hadoop Summit
Aldrin Piri
 
Apache Nifi Crash Course
DataWorks Summit
 
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Aldrin Piri
 
Intelligently Collecting Data at the Edge - Intro to Apache MiNiFi
DataWorks Summit
 
Apache Nifi Crash Course
DataWorks Summit
 
Connecting the Drops with Apache NiFi & Apache MiNiFi
DataWorks Summit
 
Intelligently collecting data at the edge—intro to Apache MiNiFi
DataWorks Summit
 
The Avant-garde of Apache NiFi
DataWorks Summit/Hadoop Summit
 
The Avant-garde of Apache NiFi
Joe Percivall
 
State of the Apache NiFi Ecosystem & Community
Accumulo Summit
 
Hortonworks Data in Motion Webinar Series - Part 1
Hortonworks
 
Harnessing Data-in-Motion with HDF 2.0, introduction to Apache NIFI/MINIFI
Haimo Liu
 
Dataflow Management From Edge to Core with Apache NiFi
DataWorks Summit
 
Apache NiFi - Flow Based Programming Meetup
Joseph Witt
 
Hortonworks DataFlow & Apache Nifi @Oslo Hadoop Big Data
Mats Johansson
 
Dataflow Management From Edge to Core with Apache NiFi
DataWorks Summit
 
Taking DataFlow Management to the Edge with Apache NiFi/MiNiFi
Bryan Bende
 
Ad

More from DataWorks Summit/Hadoop Summit (20)

PPT
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
PPT
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
PDF
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
PDF
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
PDF
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
PDF
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
PDF
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
PPTX
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
PPTX
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
PDF
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
PPTX
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
PPTX
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
PPTX
HBase in Practice
DataWorks Summit/Hadoop Summit
 
PPTX
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
 
PDF
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
PPTX
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
PPTX
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 
PPTX
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
DataWorks Summit/Hadoop Summit
 
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
HBase in Practice
DataWorks Summit/Hadoop Summit
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
DataWorks Summit/Hadoop Summit
 
Ad

Recently uploaded (20)

PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PDF
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
PDF
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PPT
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
PDF
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
“Voice Interfaces on a Budget: Building Real-time Speech Recognition on Low-c...
Edge AI and Vision Alliance
 
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
Digital Circuits, important subject in CS
contactparinay1
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
Ericsson LTE presentation SEMINAR 2010.ppt
npat3
 
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 

Dataflow with Apache NiFi