SlideShare a Scribd company logo
Hadoop: What’s Next?
Mike Olson
Reflections On You
12+ months using on average
114.5TB average size
66 average nodes in Use
500+ certified on Hadoop in 1 year
60+PB Total
Data from pre-conference survey
Cloudera - Mike Olson - Hadoop World 2010
Immutable Law of Data
RDBMS
Hadoop
Volume, Variety, Velocity increase
Immutable Law of Data
RDBMS
Hadoop
Volume, Variety, Velocity increase
Geopbytes
Brontobytes
Yottabytes
Zettabytes
Exabytes
Terabytes
Linked
Complex
Unstructured
Pre-relational
Raw
Detailed
Heterogeneous
Dirty
Graphs
Large
Schemaless
Hadoop Was Built for Data.
Proven at Scale
Room to Grow
Open Source Wins.
Hadoop: The Core of a Platform
A Platform Built by You
Hue Hue SDK
OozieOozie
HBaseFlume, Sqoop
Zookeeper / Avro
Hive
Pig/
Hive
The Vendor Ecosystem
A Platform Enabling Applications…
Query &
Reporting
Complex
ETL
Trade
Compliance
POS
Analysis
Search
Quality
Click Stream
Analysis
Machine
Learning
Graph Analysis
And
More…
Fraud
Detection
Archive
Scientific Security
Solving Critical Business Problems
• Modeling true risk
• Customer churn
analysis
• Recommendation
engine
• Ad targeting
• PoS transaction analysis
• Analyzing network data
to predict failure
• Threat analysis
• Trade surveillance
• Search quality
• Data “sandbox”
• Capture critical IT data
• Monitoring usage
• Driving bottom line value
• Risk analysis
• Customer insight
• Drive growth
• Customer intimacy
• Precision targeting
• Driving top line growth
So Much To See Today!
• Optimizing search
• Advanced analytics in the Army
• Using Flume &Hive for log data
• Analyzing VOIP data with R
What’s Next?
Market
• Adoption
• Agility
• Flexibility
Technology
• Accelerated innovation
from community
• More tools e.g., monitoring
• More automation
• More stability
• More interfaces
• At the core of the open source platform for
data
• Four years old and going strong!
Cloudera - Mike Olson - Hadoop World 2010
Organizational Impact
• More knobs and dials
• Fine grain control
• Achieve previously impossible /
impractical
• Save money
• Save time
• Greater flexibility with data
Copyright 2010 Cloudera Inc. All rights reserved
Hadoop World Keynote (NOTES)
• Themes
– Hadoop is already a big deal
• Keep in mind the why
• Solving real problems now
– It is about the platform with Hadoop at the
core
• Why
• Helps you profit
• More accessible now than ever, real people with
enterprise ops and enterprise skills, no longer the
exclusive demand of the PhDs
– What’s on the Horizon for Hadoop
Copyright 2010 Cloudera Inc. All rights reserved
Hadoop is Having a
Transformative Impact (notes)
• Continued growth and excitement
• Transformative to your career, your enterprise, your market
– Star maker
– Get ready for Hadoop being a big deal for your companies
– Your market – hyper personalization
– Use data to interact in a more customized fashion
– “It’s hard not to have a TB of data” – Mike
– Operability and SLAs for a critical enterprise platform
– Education and training
– A new stack for analytics (CEP (flume) CDH (Sqoop) dbms/BI)
• Future is now
– Use cases now and impact it is having and where it will be, look at
Facebook, Yahoo, eBay etc.
Copyright 2010 Cloudera Inc. All rights reserved
What is on the Horizon for
Hadoop (notes)
• Continued growth and excitement
• Transformative to your career, your enterprise, your market
– Star maker –
• good for your career, help make critical changes in the way customers are supported, major new business opportunities etc.
• Pull cloudera certification #’s
– Get ready for Hadoop being a big deal for your companies
• Enterprise will be more agile and able capture and analyze more data to better target ads, find fraud, etc.
• Agility – impacts the things that matter to you
• What’s happened before the transaction
– Your market – hyper personalization
• 100s’s of vertical apps to be created (developers are you listening?)
• Trend that crosses? Any other trend we can compare to? DBMS growth? Improvements in operations,
• How detailed sources have changed
• Devices, understanding how people interact with your business – retail, online entertainment, fin serv, government
– Use data to interact in a more customized fashion
– “It’s hard not to have a TB of data” – Mike
– Operability and SLAs for a critical enterprise platform
– Education and training
– A new stack for analytics (CEP (flume) CDH (sqoop) dbms/BI)
• Future is now
– Use cases now and impact it is having and where it will be, look at Facebook, Yahoo, eBay etc.
Copyright 2010 Cloudera Inc. All rights reserved
Emerging Importance
of Data Scientist
• Able to impact business at many
levels
• New conference focused data and
data related roles — O’Reilly
Strata Conference
Copyright 2010 Cloudera Inc. All rights reserved
Unprecedented Data Volume,
Velocity and Variety
Data Growth
Out Pacing
Processing Power
Organizations
Swamped and
Turning to Hadoop
61% CAGR
42% CAGR
Data
Transistors
Copyright 2010 Cloudera Inc. All rights reserved
Transforming Analytic
Requirements
• Insight into this data needs more than simple
tabular analysis
– More is needed for meaningful answers
• You can and will do deeper and more
introspective analysis
– Machine learning, natural language processing, clustering,
sophisticated statistical analysis, modeling and back testing
• Looking for patterns
– You can see patterns in lots of data that are invisible in less
data. You need pattern discovery tools
Copyright 2010 Cloudera Inc. All rights reserved
Hadoop: Already a Big Deal!!
Massive Adoption
Vibrant & Growing Community
100’s of PB Under Management
1000’s of Implementations
Benefitting From a Dynamic
OS Community
• Community around
Hadoop is proliferating
and expanding
• > ½ Hadoop sub-projects
promoted to TLPs
• Dozens of related projects
• 100’s of developers
& growing
Copyright 2010 Cloudera Inc. All rights reserved
Interest in Hadoop Has Exploded
More are looking for it
Leading analysts report
significant growth
in inquiries
Major increase
in coverage
Copyright 2010 Cloudera Inc. All rights reserved
A Data Management Platform
Applications
Copyright 2010 Cloudera Inc. All rights reserved
Market Impact
• Hyper personalization
• Extreme targeting
• Expand competitive advantages
• Better retention of customers
• Improved risk analysis

More Related Content

PDF
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
StampedeCon
 
PPTX
Managing Growing Transaction Volumes Using Hadoop
Arvind Purushothaman
 
PDF
Analyzing Unstructured Data in Hadoop Webinar
Datameer
 
PPTX
Predictive analytics from a to z
alpinedatalabs
 
PPT
Web analyticsandbigdata techweek2011
Raghu Kashyap
 
PDF
Introduction to Hadoop
POSSCON
 
PPT
Gartner peer forum sept 2011 orbitz
Raghu Kashyap
 
PPTX
Becoming Data-Driven Through Cultural Change
Cloudera, Inc.
 
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
StampedeCon
 
Managing Growing Transaction Volumes Using Hadoop
Arvind Purushothaman
 
Analyzing Unstructured Data in Hadoop Webinar
Datameer
 
Predictive analytics from a to z
alpinedatalabs
 
Web analyticsandbigdata techweek2011
Raghu Kashyap
 
Introduction to Hadoop
POSSCON
 
Gartner peer forum sept 2011 orbitz
Raghu Kashyap
 
Becoming Data-Driven Through Cultural Change
Cloudera, Inc.
 

What's hot (18)

PPTX
Benchmarking Digital Readiness: Moving at the Speed of the Market
Apigee | Google Cloud
 
PPTX
Latest corp big data and acme
hooduku
 
PPTX
Hooduku - Big data analytics - case study
Sudhi Seshachala
 
PDF
Best Practices for Big Data Analytics with Machine Learning by Datameer
Datameer
 
PPTX
Modernizing Architecture for a Complete Data Strategy
Cloudera, Inc.
 
PDF
The Emerging Data Lake IT Strategy
Thomas Kelly, PMP
 
PDF
The paradox of big data - dataiku / oxalide APEROTECH
Dataiku
 
PDF
Earley Executive Roundtable Summary - Data Analytics
Earley Information Science
 
PPTX
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
PPTX
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
Dataiku
 
PPTX
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
Dataiku
 
PDF
Customer Case Studies of Self-Service Big Data Analytics
Datameer
 
PPTX
Unlocking data science in the enterprise - with Oracle and Cloudera
Cloudera, Inc.
 
PDF
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
PPTX
Platfora Girl Geek Dinner
Platfora
 
PDF
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Hortonworks
 
PDF
Intro to Data Science on Hadoop
Caserta
 
PDF
Enabling digital business with governed data lake
Karan Sachdeva
 
Benchmarking Digital Readiness: Moving at the Speed of the Market
Apigee | Google Cloud
 
Latest corp big data and acme
hooduku
 
Hooduku - Big data analytics - case study
Sudhi Seshachala
 
Best Practices for Big Data Analytics with Machine Learning by Datameer
Datameer
 
Modernizing Architecture for a Complete Data Strategy
Cloudera, Inc.
 
The Emerging Data Lake IT Strategy
Thomas Kelly, PMP
 
The paradox of big data - dataiku / oxalide APEROTECH
Dataiku
 
Earley Executive Roundtable Summary - Data Analytics
Earley Information Science
 
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
Dataiku
 
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
Dataiku
 
Customer Case Studies of Self-Service Big Data Analytics
Datameer
 
Unlocking data science in the enterprise - with Oracle and Cloudera
Cloudera, Inc.
 
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
Platfora Girl Geek Dinner
Platfora
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Hortonworks
 
Intro to Data Science on Hadoop
Caserta
 
Enabling digital business with governed data lake
Karan Sachdeva
 
Ad

Viewers also liked (6)

PDF
Apache Hadoop Talk at QCon
Cloudera, Inc.
 
PDF
Apache Hadoop an Introduction - Todd Lipcon - Gluecon 2010
Cloudera, Inc.
 
PDF
EclipseCon Keynote: Apache Hadoop - An Introduction
Cloudera, Inc.
 
PDF
Hw09 Welcome To Hadoop World
Cloudera, Inc.
 
PPT
Module 3: Working with Jazz Source Control
IBM Rational software
 
PPTX
Data Science at Scale Using Apache Spark and Apache Hadoop
Cloudera, Inc.
 
Apache Hadoop Talk at QCon
Cloudera, Inc.
 
Apache Hadoop an Introduction - Todd Lipcon - Gluecon 2010
Cloudera, Inc.
 
EclipseCon Keynote: Apache Hadoop - An Introduction
Cloudera, Inc.
 
Hw09 Welcome To Hadoop World
Cloudera, Inc.
 
Module 3: Working with Jazz Source Control
IBM Rational software
 
Data Science at Scale Using Apache Spark and Apache Hadoop
Cloudera, Inc.
 
Ad

Similar to Cloudera - Mike Olson - Hadoop World 2010 (20)

PDF
Getting Started with Big Data for Business Managers
Datameer
 
PDF
Creating a Next-Generation Big Data Architecture
Perficient, Inc.
 
PDF
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
email2jl
 
PDF
How to implement Hadoop successfully
Adir Sharabi
 
PPTX
How to implement hadoop successfuly
Adir Sharabi
 
PDF
Are You Prepared For The Future Of Data Technologies?
Dell World
 
PPTX
Big-Data-Seminar-6-Aug-2014-Koenig
Manish Chopra
 
PPTX
Retail & CPG
Tata Consultancy Services
 
PDF
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
Hortonworks
 
PDF
Complement Your Existing Data Warehouse with Big Data & Hadoop
Datameer
 
PDF
Incorporating the Data Lake into Your Analytic Architecture
Caserta
 
PDF
Big dataservicesatfidel
Fidel Softech P. Ltd
 
PDF
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
kcmallu
 
PPT
Data Discovery, Visualization, and Apache Hadoop
Hortonworks
 
PDF
Big Data & SQL: The On-Ramp to Hadoop
Inside Analysis
 
PPTX
Introduction To Big Data & Hadoop
Blackvard
 
PPT
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 
PDF
02 a holistic approach to big data
Raul Chong
 
PDF
Operationalizing Data Analytics
VMware Tanzu
 
PDF
Create your Big Data vision and Hadoop-ify your data warehouse
Jeff Kelly
 
Getting Started with Big Data for Business Managers
Datameer
 
Creating a Next-Generation Big Data Architecture
Perficient, Inc.
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
email2jl
 
How to implement Hadoop successfully
Adir Sharabi
 
How to implement hadoop successfuly
Adir Sharabi
 
Are You Prepared For The Future Of Data Technologies?
Dell World
 
Big-Data-Seminar-6-Aug-2014-Koenig
Manish Chopra
 
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
Hortonworks
 
Complement Your Existing Data Warehouse with Big Data & Hadoop
Datameer
 
Incorporating the Data Lake into Your Analytic Architecture
Caserta
 
Big dataservicesatfidel
Fidel Softech P. Ltd
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
kcmallu
 
Data Discovery, Visualization, and Apache Hadoop
Hortonworks
 
Big Data & SQL: The On-Ramp to Hadoop
Inside Analysis
 
Introduction To Big Data & Hadoop
Blackvard
 
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 
02 a holistic approach to big data
Raul Chong
 
Operationalizing Data Analytics
VMware Tanzu
 
Create your Big Data vision and Hadoop-ify your data warehouse
Jeff Kelly
 

More from Cloudera, Inc. (20)

PPTX
Partner Briefing_January 25 (FINAL).pptx
Cloudera, Inc.
 
PPTX
Cloudera Data Impact Awards 2021 - Finalists
Cloudera, Inc.
 
PPTX
2020 Cloudera Data Impact Awards Finalists
Cloudera, Inc.
 
PPTX
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
 
PPTX
Machine Learning with Limited Labeled Data 4/3/19
Cloudera, Inc.
 
PPTX
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Cloudera, Inc.
 
PPTX
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
 
PPTX
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
 
PPTX
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Cloudera, Inc.
 
PPTX
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera, Inc.
 
PPTX
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.
 
PPTX
Leveraging the Cloud for Big Data Analytics 12.11.18
Cloudera, Inc.
 
PPTX
Modern Data Warehouse Fundamentals Part 3
Cloudera, Inc.
 
PPTX
Modern Data Warehouse Fundamentals Part 2
Cloudera, Inc.
 
PPTX
Modern Data Warehouse Fundamentals Part 1
Cloudera, Inc.
 
PPTX
Extending Cloudera SDX beyond the Platform
Cloudera, Inc.
 
PPTX
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
PPTX
Analyst Webinar: Doing a 180 on Customer 360
Cloudera, Inc.
 
PPTX
Build a modern platform for anti-money laundering 9.19.18
Cloudera, Inc.
 
PPTX
Introducing the data science sandbox as a service 8.30.18
Cloudera, Inc.
 
Partner Briefing_January 25 (FINAL).pptx
Cloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
Cloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Cloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Cloudera, Inc.
 

Recently uploaded (20)

PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PDF
Software Development Methodologies in 2025
KodekX
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
Doc9.....................................
SofiaCollazos
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PDF
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
The Future of Artificial Intelligence (AI)
Mukul
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
Software Development Methodologies in 2025
KodekX
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Doc9.....................................
SofiaCollazos
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
AI-Cloud-Business-Management-Platforms-The-Key-to-Efficiency-Growth.pdf
Artjoker Software Development Company
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 

Cloudera - Mike Olson - Hadoop World 2010

  • 2. Reflections On You 12+ months using on average 114.5TB average size 66 average nodes in Use 500+ certified on Hadoop in 1 year 60+PB Total Data from pre-conference survey
  • 4. Immutable Law of Data RDBMS Hadoop Volume, Variety, Velocity increase
  • 5. Immutable Law of Data RDBMS Hadoop Volume, Variety, Velocity increase Geopbytes Brontobytes Yottabytes Zettabytes Exabytes Terabytes
  • 7. Hadoop Was Built for Data.
  • 11. Hadoop: The Core of a Platform
  • 12. A Platform Built by You Hue Hue SDK OozieOozie HBaseFlume, Sqoop Zookeeper / Avro Hive Pig/ Hive
  • 14. A Platform Enabling Applications… Query & Reporting Complex ETL Trade Compliance POS Analysis Search Quality Click Stream Analysis Machine Learning Graph Analysis And More… Fraud Detection Archive Scientific Security
  • 15. Solving Critical Business Problems • Modeling true risk • Customer churn analysis • Recommendation engine • Ad targeting • PoS transaction analysis • Analyzing network data to predict failure • Threat analysis • Trade surveillance • Search quality • Data “sandbox”
  • 16. • Capture critical IT data • Monitoring usage • Driving bottom line value
  • 17. • Risk analysis • Customer insight • Drive growth
  • 18. • Customer intimacy • Precision targeting • Driving top line growth
  • 19. So Much To See Today! • Optimizing search • Advanced analytics in the Army • Using Flume &Hive for log data • Analyzing VOIP data with R
  • 20. What’s Next? Market • Adoption • Agility • Flexibility Technology • Accelerated innovation from community • More tools e.g., monitoring • More automation • More stability • More interfaces
  • 21. • At the core of the open source platform for data • Four years old and going strong!
  • 23. Organizational Impact • More knobs and dials • Fine grain control • Achieve previously impossible / impractical • Save money • Save time • Greater flexibility with data Copyright 2010 Cloudera Inc. All rights reserved
  • 24. Hadoop World Keynote (NOTES) • Themes – Hadoop is already a big deal • Keep in mind the why • Solving real problems now – It is about the platform with Hadoop at the core • Why • Helps you profit • More accessible now than ever, real people with enterprise ops and enterprise skills, no longer the exclusive demand of the PhDs – What’s on the Horizon for Hadoop Copyright 2010 Cloudera Inc. All rights reserved
  • 25. Hadoop is Having a Transformative Impact (notes) • Continued growth and excitement • Transformative to your career, your enterprise, your market – Star maker – Get ready for Hadoop being a big deal for your companies – Your market – hyper personalization – Use data to interact in a more customized fashion – “It’s hard not to have a TB of data” – Mike – Operability and SLAs for a critical enterprise platform – Education and training – A new stack for analytics (CEP (flume) CDH (Sqoop) dbms/BI) • Future is now – Use cases now and impact it is having and where it will be, look at Facebook, Yahoo, eBay etc. Copyright 2010 Cloudera Inc. All rights reserved
  • 26. What is on the Horizon for Hadoop (notes) • Continued growth and excitement • Transformative to your career, your enterprise, your market – Star maker – • good for your career, help make critical changes in the way customers are supported, major new business opportunities etc. • Pull cloudera certification #’s – Get ready for Hadoop being a big deal for your companies • Enterprise will be more agile and able capture and analyze more data to better target ads, find fraud, etc. • Agility – impacts the things that matter to you • What’s happened before the transaction – Your market – hyper personalization • 100s’s of vertical apps to be created (developers are you listening?) • Trend that crosses? Any other trend we can compare to? DBMS growth? Improvements in operations, • How detailed sources have changed • Devices, understanding how people interact with your business – retail, online entertainment, fin serv, government – Use data to interact in a more customized fashion – “It’s hard not to have a TB of data” – Mike – Operability and SLAs for a critical enterprise platform – Education and training – A new stack for analytics (CEP (flume) CDH (sqoop) dbms/BI) • Future is now – Use cases now and impact it is having and where it will be, look at Facebook, Yahoo, eBay etc. Copyright 2010 Cloudera Inc. All rights reserved
  • 27. Emerging Importance of Data Scientist • Able to impact business at many levels • New conference focused data and data related roles — O’Reilly Strata Conference Copyright 2010 Cloudera Inc. All rights reserved
  • 28. Unprecedented Data Volume, Velocity and Variety Data Growth Out Pacing Processing Power Organizations Swamped and Turning to Hadoop 61% CAGR 42% CAGR Data Transistors Copyright 2010 Cloudera Inc. All rights reserved
  • 29. Transforming Analytic Requirements • Insight into this data needs more than simple tabular analysis – More is needed for meaningful answers • You can and will do deeper and more introspective analysis – Machine learning, natural language processing, clustering, sophisticated statistical analysis, modeling and back testing • Looking for patterns – You can see patterns in lots of data that are invisible in less data. You need pattern discovery tools Copyright 2010 Cloudera Inc. All rights reserved
  • 30. Hadoop: Already a Big Deal!! Massive Adoption Vibrant & Growing Community 100’s of PB Under Management 1000’s of Implementations
  • 31. Benefitting From a Dynamic OS Community • Community around Hadoop is proliferating and expanding • > ½ Hadoop sub-projects promoted to TLPs • Dozens of related projects • 100’s of developers & growing Copyright 2010 Cloudera Inc. All rights reserved
  • 32. Interest in Hadoop Has Exploded More are looking for it Leading analysts report significant growth in inquiries Major increase in coverage Copyright 2010 Cloudera Inc. All rights reserved
  • 33. A Data Management Platform Applications Copyright 2010 Cloudera Inc. All rights reserved
  • 34. Market Impact • Hyper personalization • Extreme targeting • Expand competitive advantages • Better retention of customers • Improved risk analysis