SlideShare a Scribd company logo
1 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
Hadoop Summit 2016 – “connecting everything”
Version 07032016
Patrick de Vries
Process systems
Data
2 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
 Intro kpn movie
 https://youtu.be/JHDobTF4_Dc
3 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
`x
1) Based on continuing operations.
4) Estimated avoided energy consumption by customers from the use of our ICT solutions
compared with KPN own energy consumption. For more information, please see annual report
4 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
Presentor
Patrick de Vries (42 years) is an OSS manager (Demand), IT
architect with more than 8 years experience in the mobile
networks. He has a passion for data management and
processing.
Maanplein 55| 2500 GC The Hague | The Netherlands
Mobile: +31653102171 | E-mail: patrick.devries@kpn.com
https://nl.linkedin.com/in/patrick-de-vries-570a8469
5 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
Network
2000 2010 2012 2014 2016 2018 2020
GSM
GPRSUM
TS
HSDPA
LTE
W
IFI
Publicfree
LORA
5G
M2M / data only
Data
The hunger for
telecommunication capacity
continues to grow.
# IT systems TCO
We are at
the brink of the “Fourth
Industrial Revolution”.
6 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
7 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
8 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
9 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
10 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
11 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
12 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
Counter ready at
source
E(T)L retrieve
Counters
Counter Ready at
HDFSvia KNOX Move Data to
Processing Folder
Workbook Parsing
XMLs
Export ready in
Target Data
Structure
KPI calculation
Workbook
Data flow OOZIE
Reports with Raw
Counters
Aggregates and
KPI in Target Data
Structure
with Hortonworks
Data flow with current architecture
Roadmap topics data flow
• From ETL to ETL
• From batch to streaming
HAAS solution
13 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
Lessens learned
 What is your Corporate policy on use of Data?
 Just because it’s legal doesn’t mean it’s right…
 Start small but plan for big
 Strong Exec sponsorship required
 Multi faceted business case
 Leverage existing resources and investments where possible
 Prepare for cultural challenges (data ownership and tools)
14 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
Bedankt
voor uw aandachtConnected by KPN.
We believe that communication technology enriches life. It is our
mission to provide safe, reliable and future-proof networks and
services, enabling people, businesses and organizations to be
connected anytime, anywhere, adding value
to their lives.
15 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
Connected by Hadoop
The TELCO Hadoop community will be a non-profit association
for service providers and their suppliers in the
telecommunications. The members share operational support
use cases and lessens learned to help the community to grow
in data and processes.
Join the TELCO community
For more information send an e-mail to
Patrick.devries@kpn.com

More Related Content

What's hot (20)

PPTX
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
PPTX
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
DataWorks Summit
 
PPTX
Modernise your EDW - Data Lake
DataWorks Summit/Hadoop Summit
 
PPTX
Depositing Value from Transactional Data at Danske Bank
DataWorks Summit/Hadoop Summit
 
PPTX
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB
 
PPTX
Hadoop dev 01
Vivian S. Zhang
 
PPTX
Use dependency injection to get Hadoop *out* of your application code
DataWorks Summit
 
PPTX
Making Bank Predictive and Real-Time
DataWorks Summit
 
PDF
Journey to Big Data: Main Issues, Solutions, Benefits
DataWorks Summit
 
PDF
The Ecosystem is too damn big
DataWorks Summit/Hadoop Summit
 
PPTX
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
 
PPTX
Multi-tenant Hadoop - the challenge of maintaining high SLAS
DataWorks Summit
 
PPTX
Operational Analytics Using Spark and NoSQL Data Stores
DATAVERSITY
 
PDF
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
Hortonworks
 
PDF
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
DataWorks Summit
 
PPTX
Beyond Batch: Is ETL still relevant in the API economy?
SnapLogic
 
PDF
Destroying Data Silos
DataWorks Summit
 
PDF
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Big Data Spain
 
PDF
Postgres Vision 2018: The Pragmatic Cloud
EDB
 
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
It Takes a Village: Organizational Alignment to Deliver Big Data Value in Hea...
DataWorks Summit
 
Modernise your EDW - Data Lake
DataWorks Summit/Hadoop Summit
 
Depositing Value from Transactional Data at Danske Bank
DataWorks Summit/Hadoop Summit
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB
 
Hadoop dev 01
Vivian S. Zhang
 
Use dependency injection to get Hadoop *out* of your application code
DataWorks Summit
 
Making Bank Predictive and Real-Time
DataWorks Summit
 
Journey to Big Data: Main Issues, Solutions, Benefits
DataWorks Summit
 
The Ecosystem is too damn big
DataWorks Summit/Hadoop Summit
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
 
Multi-tenant Hadoop - the challenge of maintaining high SLAS
DataWorks Summit
 
Operational Analytics Using Spark and NoSQL Data Stores
DATAVERSITY
 
3 CTOs Discuss the Shift to Next-Gen Analytic Ecosystems
Hortonworks
 
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
DataWorks Summit
 
Beyond Batch: Is ETL still relevant in the API economy?
SnapLogic
 
Destroying Data Silos
DataWorks Summit
 
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Big Data Spain
 
Postgres Vision 2018: The Pragmatic Cloud
EDB
 

Viewers also liked (20)

PDF
The Future of Apache Storm
DataWorks Summit/Hadoop Summit
 
PPTX
The key to unlocking the Value in the IoT? Managing the Data!
DataWorks Summit/Hadoop Summit
 
PPTX
Log I am your father
DataWorks Summit/Hadoop Summit
 
PDF
Cooperative Data Exploration with iPython Notebook
DataWorks Summit/Hadoop Summit
 
PPTX
Powering a Virtual Power Station with Big Data
DataWorks Summit/Hadoop Summit
 
PPTX
Protecting Enterprise Data in Apache Hadoop
DataWorks Summit/Hadoop Summit
 
PDF
The Heterogeneous Data lake
DataWorks Summit/Hadoop Summit
 
PDF
A Continuously Deployed Hadoop Analytics Platform?
DataWorks Summit/Hadoop Summit
 
PPTX
Hadoop Everywhere
DataWorks Summit/Hadoop Summit
 
PPTX
Practical advice to build a data driven company
DataWorks Summit/Hadoop Summit
 
PPTX
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
DataWorks Summit/Hadoop Summit
 
PDF
NLP Structured Data Investigation on Non-Text
DataWorks Summit/Hadoop Summit
 
PPTX
Using a Data Lake at the core of a Life Assurance business
DataWorks Summit/Hadoop Summit
 
PPTX
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...
DataWorks Summit/Hadoop Summit
 
PPTX
Hadoop in the Cloud: Real World Lessons from Enterprise Customers
DataWorks Summit/Hadoop Summit
 
PDF
Architecting a multi-tenanted platform
DataWorks Summit/Hadoop Summit
 
PPTX
Hadoop Platform at Yahoo
DataWorks Summit/Hadoop Summit
 
PPTX
Securing Hadoop in an Enterprise Context
DataWorks Summit/Hadoop Summit
 
PPTX
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
DataWorks Summit/Hadoop Summit
 
PPTX
Ingest and Stream Processing - What will you choose?
DataWorks Summit/Hadoop Summit
 
The Future of Apache Storm
DataWorks Summit/Hadoop Summit
 
The key to unlocking the Value in the IoT? Managing the Data!
DataWorks Summit/Hadoop Summit
 
Log I am your father
DataWorks Summit/Hadoop Summit
 
Cooperative Data Exploration with iPython Notebook
DataWorks Summit/Hadoop Summit
 
Powering a Virtual Power Station with Big Data
DataWorks Summit/Hadoop Summit
 
Protecting Enterprise Data in Apache Hadoop
DataWorks Summit/Hadoop Summit
 
The Heterogeneous Data lake
DataWorks Summit/Hadoop Summit
 
A Continuously Deployed Hadoop Analytics Platform?
DataWorks Summit/Hadoop Summit
 
Practical advice to build a data driven company
DataWorks Summit/Hadoop Summit
 
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
DataWorks Summit/Hadoop Summit
 
NLP Structured Data Investigation on Non-Text
DataWorks Summit/Hadoop Summit
 
Using a Data Lake at the core of a Life Assurance business
DataWorks Summit/Hadoop Summit
 
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...
DataWorks Summit/Hadoop Summit
 
Hadoop in the Cloud: Real World Lessons from Enterprise Customers
DataWorks Summit/Hadoop Summit
 
Architecting a multi-tenanted platform
DataWorks Summit/Hadoop Summit
 
Hadoop Platform at Yahoo
DataWorks Summit/Hadoop Summit
 
Securing Hadoop in an Enterprise Context
DataWorks Summit/Hadoop Summit
 
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
DataWorks Summit/Hadoop Summit
 
Ingest and Stream Processing - What will you choose?
DataWorks Summit/Hadoop Summit
 
Ad

Similar to Data Process Systems, connecting everything (20)

PPTX
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
DataWorks Summit
 
PPTX
Bigdata and hadoop
RamyaG50
 
PPTX
Bigdata and hadoop
RamyaG50
 
PPTX
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform
 
PDF
Cwin16 tls-partner-hpe-digital economy & Hybrid IT
Capgemini
 
PDF
Attunity Hortonworks Webinar- Sept 22, 2016
Hortonworks
 
PPTX
4S Information Technologies
Ayla GOKMEN
 
PDF
Business Day ahead of the curve interview Avsharn Bachoo
Avsharn
 
PDF
Hortonworks HDP, Is it goog enough ?
Huxi LI
 
PPTX
Fujitsu SUSE presentation at SAPPHIRE 2016
Mike Nelson
 
PDF
Big Data LDN 2017: Data Integration & Big Data Management
Matt Stubbs
 
PDF
SAP_IoT_Activities_Overview_short strategy
Biswadip Banerjee
 
PDF
Meetup oslo hortonworks HDP
Alexander Bakos Leirvåg
 
PDF
Hortonworks Hadoop @ Oslo Hadoop User Group
Mats Johansson
 
PDF
ey-hfs-top-10-application-modernization-services-2022-ey-excerpt.pdf
ravinatke
 
PDF
Discover London eAgenda
Calvin Zito
 
PPTX
Digital Transformation Trends in Insurance
Information Services Group (ISG)
 
PPT
Extelligence sro
Khalid Pervez
 
PDF
Profile_Harish_Gaddale
Harish Gaddale
 
PPTX
Harm Olde KPN
TalentEvent
 
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
DataWorks Summit
 
Bigdata and hadoop
RamyaG50
 
Bigdata and hadoop
RamyaG50
 
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform
 
Cwin16 tls-partner-hpe-digital economy & Hybrid IT
Capgemini
 
Attunity Hortonworks Webinar- Sept 22, 2016
Hortonworks
 
4S Information Technologies
Ayla GOKMEN
 
Business Day ahead of the curve interview Avsharn Bachoo
Avsharn
 
Hortonworks HDP, Is it goog enough ?
Huxi LI
 
Fujitsu SUSE presentation at SAPPHIRE 2016
Mike Nelson
 
Big Data LDN 2017: Data Integration & Big Data Management
Matt Stubbs
 
SAP_IoT_Activities_Overview_short strategy
Biswadip Banerjee
 
Meetup oslo hortonworks HDP
Alexander Bakos Leirvåg
 
Hortonworks Hadoop @ Oslo Hadoop User Group
Mats Johansson
 
ey-hfs-top-10-application-modernization-services-2022-ey-excerpt.pdf
ravinatke
 
Discover London eAgenda
Calvin Zito
 
Digital Transformation Trends in Insurance
Information Services Group (ISG)
 
Extelligence sro
Khalid Pervez
 
Profile_Harish_Gaddale
Harish Gaddale
 
Harm Olde KPN
TalentEvent
 
Ad

More from DataWorks Summit/Hadoop Summit (20)

PPT
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
PPT
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
PDF
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
PDF
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
PDF
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
PDF
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
PDF
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
PPTX
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
PPTX
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
PDF
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
PPTX
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
PPTX
HBase in Practice
DataWorks Summit/Hadoop Summit
 
PDF
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
PPTX
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
PPTX
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 
PPTX
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
DataWorks Summit/Hadoop Summit
 
PPTX
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
DataWorks Summit/Hadoop Summit
 
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
HBase in Practice
DataWorks Summit/Hadoop Summit
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
DataWorks Summit/Hadoop Summit
 
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
DataWorks Summit/Hadoop Summit
 

Recently uploaded (20)

PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
Complete Network Protection with Real-Time Security
L4RGINDIA
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
PPTX
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PDF
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PPTX
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
PPTX
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
PPTX
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
PDF
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
Complete Network Protection with Real-Time Security
L4RGINDIA
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 

Data Process Systems, connecting everything

  • 1. 1 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING, Hadoop Summit 2016 – “connecting everything” Version 07032016 Patrick de Vries Process systems Data
  • 2. 2 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,  Intro kpn movie  https://youtu.be/JHDobTF4_Dc
  • 3. 3 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING, `x 1) Based on continuing operations. 4) Estimated avoided energy consumption by customers from the use of our ICT solutions compared with KPN own energy consumption. For more information, please see annual report
  • 4. 4 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING, Presentor Patrick de Vries (42 years) is an OSS manager (Demand), IT architect with more than 8 years experience in the mobile networks. He has a passion for data management and processing. Maanplein 55| 2500 GC The Hague | The Netherlands Mobile: +31653102171 | E-mail: [email protected] https://nl.linkedin.com/in/patrick-de-vries-570a8469
  • 5. 5 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING, Network 2000 2010 2012 2014 2016 2018 2020 GSM GPRSUM TS HSDPA LTE W IFI Publicfree LORA 5G M2M / data only Data The hunger for telecommunication capacity continues to grow. # IT systems TCO We are at the brink of the “Fourth Industrial Revolution”.
  • 6. 6 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
  • 7. 7 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
  • 8. 8 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
  • 9. 9 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
  • 10. 10 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
  • 11. 11 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING,
  • 12. 12 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING, Counter ready at source E(T)L retrieve Counters Counter Ready at HDFSvia KNOX Move Data to Processing Folder Workbook Parsing XMLs Export ready in Target Data Structure KPI calculation Workbook Data flow OOZIE Reports with Raw Counters Aggregates and KPI in Target Data Structure with Hortonworks Data flow with current architecture Roadmap topics data flow • From ETL to ETL • From batch to streaming HAAS solution
  • 13. 13 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING, Lessens learned  What is your Corporate policy on use of Data?  Just because it’s legal doesn’t mean it’s right…  Start small but plan for big  Strong Exec sponsorship required  Multi faceted business case  Leverage existing resources and investments where possible  Prepare for cultural challenges (data ownership and tools)
  • 14. 14 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING, Bedankt voor uw aandachtConnected by KPN. We believe that communication technology enriches life. It is our mission to provide safe, reliable and future-proof networks and services, enabling people, businesses and organizations to be connected anytime, anywhere, adding value to their lives.
  • 15. 15 HADOOP SUMMIT 2016 | CONNECTING EVERYTHING, Connected by Hadoop The TELCO Hadoop community will be a non-profit association for service providers and their suppliers in the telecommunications. The members share operational support use cases and lessens learned to help the community to grow in data and processes. Join the TELCO community For more information send an e-mail to [email protected]