SlideShare a Scribd company logo
© Cloudera, Inc. All rights reserved.
Data Driven With the Cloudera Data
Warehouse
David Dichmann | ddichmann@cloudera.com
© Cloudera, Inc. All rights reserved. 2© Cloudera, Inc. All rights reserved.
What’s YOUR Data Strategy?
© Cloudera, Inc. All rights reserved. 3
OUTCOMES
• Curated Data and Agile Discovery
with HIPAA compliance
• Accelerated new Drug
Development
NEW PRODUCT DEVELOPMENT
GLOBAL
PHARMACEUTICAL
Use Cases
Users
Fewer Silos
Diverse Data
© Cloudera, Inc. All rights reserved. 4
OUTCOMES
• LoB Data Analysts access all data
• Saved $4M+ in deposit fraud
FRAUD PREVENTION
LARGE NORTH
AMERICAN BANK
Terabytes
Users
Databases
Queries / Month
© Cloudera, Inc. All rights reserved. 5
OUTCOMES
• $10 M new revenue
• $30 M+ price optimization
• $100K+ weather correlation
BUSINESS OPTIMIZATION
MAJOR TELCO
MANUFACTURER
Query
Responses
New Sources
Data Sets
Users
© Cloudera, Inc. All rights reserved.6 © Cloudera, Inc. All rights reserved.
Quickly enable business analytics by sharing petabytes of verified data
across thousands of users while surpassing demands of SLAs and costs
Massive, Diverse Data Security, Governance
User Profiles, Use Cases Self Service EverythingAutomation, Consistency
Experiments, Time To Value
© Cloudera, Inc. All rights reserved. 7
TRADITIONAL CHANGES MODERN
Users Internal Transparency +External
Curation Planned ETLs Flexibility On-Demand ELTs
Exploration Constrained Self-Service Freeform
Volume Finite Correlations Virtually Infinite
© Cloudera, Inc. All rights reserved. 8© Cloudera, Inc. All rights reserved.
TRADITIONAL DATA WAREHOUSE
Structured Data
Sources
(ERP, CRM, SCM)
Transformations
EDW
Advanced
Analytics
Dashboards
Ad Hoc
Canned
Reports
Staging
Data Marts
Seceral Months
Master Schema
ETLODS
2 3
4
1 5
Struggle to handle volume
and variety
Limited
Access
© Cloudera, Inc. All rights reserved. 9© Cloudera, Inc. All rights reserved.
MODERN DATA WAREHOUSE
Advanced
Analytics
Dashboards
Ad Hoc
Canned
Reports
Data Store
Within Days
Data Marts
1
2
Ingest & Store All Data
At Scale
Self-service /
On-demand
Variety of Data
Sources/Types
© Cloudera, Inc. All rights reserved. 10© Cloudera, Inc. All rights reserved.
MODERN DATA WAREHOUSE
Fixed
Reports
DATA SOURCES
Flexible
Reporting
Advanced
Analytics
Self-Service
BI/Ad Hoc
Dashboards/
Analytic Apps
EDW
COMPLIMENTING A TRADITIONAL EDW
© Cloudera, Inc. All rights reserved. 11© Cloudera, Inc. All rights reserved.
CLOUD NATIVE WITH ALTUS DW
Multi-Cloud PaaS for Agile Analytics
● Quick time to value for analytics - no
software or clusters to manage
● Bring the warehouse to the data with
zero copy simplicity
● Use your security policies with your
data - no proprietary stacks
● Apply enterprise governance to
transient workloads
● Shared data experience with SDX, for
analytic workloads
● Optimized for Azure & AWS
DATA WAREHOUSE
GOVERNANCESECURITY
ALTUS CONTROL
PLANE
LIFECYCLE
MANAGEMENT
MULTI-CLOUD
Amazon
S3
Microsoft
ADLS
© Cloudera, Inc. All rights reserved. 12© Cloudera, Inc. All rights reserved.
Traditional Data
Warehouse Optimization
Transform Status Quo
TRANSFORMATIONAL AREAS OF DATA WAREHOUSING
Operations & Events Data
Warehouse
Run Business Better
Research & Discovery
Data Warehouse
Change the Culture
© Cloudera, Inc. All rights reserved. 13
DRIVERS FOR MODERNIZATION
Deeper Business Insights
Grow
• Customer Sentiment
• Fault Prevention
• Improve Product Quality
• New Revenue Streams
Experimentation and
collaboration at scale
Protect
• Proactive Fraud Prevention
• Keep up with Regulatory
Compliance
• Preempt Cyberthreats
Real-time response on
massive data volume and
variety
Connect
• Improve Operational
Efficiency
• Support Internet of Things
(IoT)
New analytics techniques
democratized to all users
© Cloudera, Inc. All rights reserved. 14
CHALLENGES OF A MODERN DATA WAREHOUSE
Extreme Speed and Scale
More Data
• Massive amounts handled
faster at scale
• More variety from new
sources (social media, IoT)
• Insight within minutes of
new data arrival
Performance and
flexibility at scale
More Workloads
• 100’s of production grade
deployments
• Enterprise grade
dependability
• Strict security and
governance
On-demand scale out,
discovery, collaboration
More People
• 1,000’s of new users and
new user types
• 1,000’s of new use cases
• All skill levels: Analytics,
Data Science, and Machine
Learning
All workloads with a
shared data experience
© Cloudera, Inc. All rights reserved. 15
Optimize Core
Processes
● Versatile Solution
● Broaden Data Reach
● Reduce IT Burden or Costs
Dynamic
Consumption
● Transient, Short-lived, Long-lived
● Public, Private, Hybrid Multi-Cloud
● Adaptive Compute & Storage
Self-Service
Everything
● Resource Provisioning
● Workload Development
● Optimizing & Troubleshooting
CLOUDERA MODERN DATA WAREHOUSE
Optimize Processes, Consumption and Costs
https://www.cloudera.com/about/customers/xl-axiata.html
https://blog.cloudera.com/blog/2018/03/automated-provisioning-
of-cdh-in-the-cloud-with-cloudera-director-and-ansible/
https://www.cloudera.com/about/customers/komatsu-mining.html
© Cloudera, Inc. All rights reserved. 16© Cloudera, Inc. All rights reserved.
Financial Services Telecom Government Healthcare Manufacturing
Customer 360
Personalized Medicine
Supply Chain Analysis
Operational Efficiencies
Network Quality Analysis
Equipment Health (IoT)
Fraud
Compliance
Cyber Threat Analysis
Regulatory Reporting
TOP 10 DATA WAREHOUSE USE CASES BY INDUSTRYGROWCONNECTPROTECT
© Cloudera, Inc. All rights reserved.17
A MODERN DATA WAREHOUSE FROM CLOUDERA
HYBRID
Storage
Preferred BI & ELT ToolsHue Analytic Workbench,
Superset Dashboards, CDSW
Workload XM,
Data Analytics Studio
Navigator & Sentry,
Atlas & Ranger
Impala / Hive LLAP
Query Engine
Hive on Tez / Spark
ELT Processing
KUDU | HDFS | Druid
Local Storage
AWS S3 | ADLS
Object Storage
Shared Data Experience (SDX)
Optimized File Formats
(ORC, Parquet, Avro, JSON)
Solr
Search Analytics
Cloudera Manager,
Ambari, Altus, Data Plane
HYBRID
Controls
HYBRID
Compute
HYBRID
Storage
HYBRID
Reporting
© Cloudera, Inc. All rights reserved.18 © Cloudera, Inc. All rights reserved.
EXTREME SPEED & SCALE
Fastest ELT at Scale
for Data Engineers
● Fast data with distributed, in-memory
processing
● Curated data, metadata instantly
available
Fastest Self-Service BI at Scale
for Analysts & Developers
● Interactive multi-user queries without rigid
modeling for exploration
● Elastic scalability for more users/data
Impala
LLAP
© Cloudera, Inc. All rights reserved. 19
EXTENSIVE PARTNER ECOSYSTEM
System
Integrators
ISV IHV
Alliances
Cloud
Alliances
OEM
Alliances
Market Expansion
© Cloudera, Inc. All rights reserved.20 © Cloudera, Inc. All rights reserved.
CLOUDERA DW - PARTING THOUGHTS
Hybrid Optimized Shared Data ExperiencePerformance @Scale
Shared Data
Exponential Use Cases, Successful Outcomes
© Cloudera, Inc. All rights reserved.
THANK YOU

More Related Content

What's hot (20)

PPTX
Extending Cloudera SDX beyond the Platform
Cloudera, Inc.
 
PPTX
Cloudera SDX
Cloudera, Inc.
 
PPTX
Get started with Cloudera's cyber solution
Cloudera, Inc.
 
PPTX
2020 Cloudera Data Impact Awards Finalists
Cloudera, Inc.
 
PPTX
Cloudera - The Modern Platform for Analytics
Cloudera, Inc.
 
PPTX
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.
 
PPTX
Modern Data Warehouse Fundamentals Part 2
Cloudera, Inc.
 
PPTX
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
 
PPTX
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
 
PPTX
Consolidate your data marts for fast, flexible analytics 5.24.18
Cloudera, Inc.
 
PPTX
How komatsu is driving operational efficiencies using io t and machine learni...
Cloudera, Inc.
 
PPTX
The 6th Wave of Automation: Automation of Decisions | Cloudera Analytics & Ma...
Cloudera, Inc.
 
PPTX
Turning Data into Business Value with a Modern Data Platform
Cloudera, Inc.
 
PPTX
Big data journey to the cloud maz chaudhri 5.30.18
Cloudera, Inc.
 
PPTX
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
Cloudera, Inc.
 
PPTX
PaaS or Fail: Rule the Cloud with Altus
Cloudera, Inc.
 
PPTX
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
 
PPTX
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Cloudera, Inc.
 
PPTX
Spark and Deep Learning Frameworks at Scale 7.19.18
Cloudera, Inc.
 
PPTX
Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Cloudera, Inc.
 
Cloudera SDX
Cloudera, Inc.
 
Get started with Cloudera's cyber solution
Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
Cloudera, Inc.
 
Cloudera - The Modern Platform for Analytics
Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Cloudera, Inc.
 
How komatsu is driving operational efficiencies using io t and machine learni...
Cloudera, Inc.
 
The 6th Wave of Automation: Automation of Decisions | Cloudera Analytics & Ma...
Cloudera, Inc.
 
Turning Data into Business Value with a Modern Data Platform
Cloudera, Inc.
 
Big data journey to the cloud maz chaudhri 5.30.18
Cloudera, Inc.
 
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
Cloudera, Inc.
 
PaaS or Fail: Rule the Cloud with Altus
Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
 
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Cloudera, Inc.
 
Spark and Deep Learning Frameworks at Scale 7.19.18
Cloudera, Inc.
 
Building a Data Hub that Empowers Customer Insight (Technical Workshop)
Cloudera, Inc.
 

Similar to Data Driven With the Cloudera Modern Data Warehouse 3.19.19 (20)

PDF
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
Excelerate Systems
 
PPTX
Breakout: Hadoop and the Operational Data Store
Cloudera, Inc.
 
PPTX
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB
 
PDF
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
TheInevitableCloud
 
PDF
Cw13 big data and apache hadoop by amr awadallah-cloudera
inevitablecloud
 
PPTX
Data Warehouse Optimization
Cloudera, Inc.
 
PPTX
The Journey to Success with Big Data
Cloudera, Inc.
 
PPTX
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
PPTX
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Stefan Lipp
 
PPTX
Breakout: Data Discovery with Hadoop
Cloudera, Inc.
 
PPTX
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera, Inc.
 
PPTX
A deep dive into running data analytic workloads in the cloud
Cloudera, Inc.
 
PPTX
Keynote: The Journey to Pervasive Analytics
Cloudera, Inc.
 
PPTX
Hadoop and Manufacturing
Cloudera, Inc.
 
PDF
Cloudera enterprise-datasheet
peerawicht
 
PPTX
Cloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera, Inc.
 
PDF
Gab Genai Cloudera - Going Beyond Traditional Analytic
IntelAPAC
 
PPTX
Rethink Analytics with an Enterprise Data Hub
Cloudera, Inc.
 
PDF
Data Strategy – What Does an Enterprise Data Cloud Mean for Your Agency?
scoopnewsgroup
 
PPTX
Limitless Data, Rapid Discovery, Powerful Insight: How to Connect Cloudera to...
Cloudera, Inc.
 
BigDataBx #1 - Atelier 1 Cloudera Datawarehouse Optimisation
Excelerate Systems
 
Breakout: Hadoop and the Operational Data Store
Cloudera, Inc.
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB
 
Intro to Big Data and Apache Hadoop by Dr. Amr Awadallah at CLOUD WEEKEND '13...
TheInevitableCloud
 
Cw13 big data and apache hadoop by amr awadallah-cloudera
inevitablecloud
 
Data Warehouse Optimization
Cloudera, Inc.
 
The Journey to Success with Big Data
Cloudera, Inc.
 
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
Stefan Lipp
 
Breakout: Data Discovery with Hadoop
Cloudera, Inc.
 
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera, Inc.
 
A deep dive into running data analytic workloads in the cloud
Cloudera, Inc.
 
Keynote: The Journey to Pervasive Analytics
Cloudera, Inc.
 
Hadoop and Manufacturing
Cloudera, Inc.
 
Cloudera enterprise-datasheet
peerawicht
 
Cloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera, Inc.
 
Gab Genai Cloudera - Going Beyond Traditional Analytic
IntelAPAC
 
Rethink Analytics with an Enterprise Data Hub
Cloudera, Inc.
 
Data Strategy – What Does an Enterprise Data Cloud Mean for Your Agency?
scoopnewsgroup
 
Limitless Data, Rapid Discovery, Powerful Insight: How to Connect Cloudera to...
Cloudera, Inc.
 
Ad

More from Cloudera, Inc. (12)

PPTX
Partner Briefing_January 25 (FINAL).pptx
Cloudera, Inc.
 
PPTX
Cloudera Data Impact Awards 2021 - Finalists
Cloudera, Inc.
 
PPTX
Machine Learning with Limited Labeled Data 4/3/19
Cloudera, Inc.
 
PPTX
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Cloudera, Inc.
 
PPTX
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
PPTX
Analyst Webinar: Doing a 180 on Customer 360
Cloudera, Inc.
 
PPTX
How Cloudera SDX can aid GDPR compliance
Cloudera, Inc.
 
PPTX
When SAP alone is not enough
Cloudera, Inc.
 
PDF
Multi task learning stepping away from narrow expert models 7.11.18
Cloudera, Inc.
 
PPTX
Cloudera training secure your cloudera cluster 7.10.18
Cloudera, Inc.
 
PPTX
The 5 Biggest Data Myths in Telco: Exposed
Cloudera, Inc.
 
PPTX
Delivering improved patient outcomes through advanced analytics 6.26.18
Cloudera, Inc.
 
Partner Briefing_January 25 (FINAL).pptx
Cloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Cloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Cloudera, Inc.
 
How Cloudera SDX can aid GDPR compliance
Cloudera, Inc.
 
When SAP alone is not enough
Cloudera, Inc.
 
Multi task learning stepping away from narrow expert models 7.11.18
Cloudera, Inc.
 
Cloudera training secure your cloudera cluster 7.10.18
Cloudera, Inc.
 
The 5 Biggest Data Myths in Telco: Exposed
Cloudera, Inc.
 
Delivering improved patient outcomes through advanced analytics 6.26.18
Cloudera, Inc.
 
Ad

Recently uploaded (20)

PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PDF
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
PDF
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit
 
PDF
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
PDF
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
PPTX
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
PPTX
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
AI Agents in the Cloud: The Rise of Agentic Cloud Architecture
Lilly Gracia
 
ICONIQ State of AI Report 2025 - The Builder's Playbook
Razin Mustafiz
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit
 
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Mastering Financial Management in Direct Selling
Epixel MLM Software
 
Future-Proof or Fall Behind? 10 Tech Trends You Can’t Afford to Ignore in 2025
DIGITALCONFEX
 
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
Safe Software
 
Seamless Tech Experiences Showcasing Cross-Platform App Design.pptx
presentifyai
 
Agentforce World Tour Toronto '25 - Supercharge MuleSoft Development with Mod...
Alexandra N. Martinez
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
What’s my job again? Slides from Mark Simos talk at 2025 Tampa BSides
Mark Simos
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 

Data Driven With the Cloudera Modern Data Warehouse 3.19.19

  • 1. © Cloudera, Inc. All rights reserved. Data Driven With the Cloudera Data Warehouse David Dichmann | [email protected]
  • 2. © Cloudera, Inc. All rights reserved. 2© Cloudera, Inc. All rights reserved. What’s YOUR Data Strategy?
  • 3. © Cloudera, Inc. All rights reserved. 3 OUTCOMES • Curated Data and Agile Discovery with HIPAA compliance • Accelerated new Drug Development NEW PRODUCT DEVELOPMENT GLOBAL PHARMACEUTICAL Use Cases Users Fewer Silos Diverse Data
  • 4. © Cloudera, Inc. All rights reserved. 4 OUTCOMES • LoB Data Analysts access all data • Saved $4M+ in deposit fraud FRAUD PREVENTION LARGE NORTH AMERICAN BANK Terabytes Users Databases Queries / Month
  • 5. © Cloudera, Inc. All rights reserved. 5 OUTCOMES • $10 M new revenue • $30 M+ price optimization • $100K+ weather correlation BUSINESS OPTIMIZATION MAJOR TELCO MANUFACTURER Query Responses New Sources Data Sets Users
  • 6. © Cloudera, Inc. All rights reserved.6 © Cloudera, Inc. All rights reserved. Quickly enable business analytics by sharing petabytes of verified data across thousands of users while surpassing demands of SLAs and costs Massive, Diverse Data Security, Governance User Profiles, Use Cases Self Service EverythingAutomation, Consistency Experiments, Time To Value
  • 7. © Cloudera, Inc. All rights reserved. 7 TRADITIONAL CHANGES MODERN Users Internal Transparency +External Curation Planned ETLs Flexibility On-Demand ELTs Exploration Constrained Self-Service Freeform Volume Finite Correlations Virtually Infinite
  • 8. © Cloudera, Inc. All rights reserved. 8© Cloudera, Inc. All rights reserved. TRADITIONAL DATA WAREHOUSE Structured Data Sources (ERP, CRM, SCM) Transformations EDW Advanced Analytics Dashboards Ad Hoc Canned Reports Staging Data Marts Seceral Months Master Schema ETLODS 2 3 4 1 5 Struggle to handle volume and variety Limited Access
  • 9. © Cloudera, Inc. All rights reserved. 9© Cloudera, Inc. All rights reserved. MODERN DATA WAREHOUSE Advanced Analytics Dashboards Ad Hoc Canned Reports Data Store Within Days Data Marts 1 2 Ingest & Store All Data At Scale Self-service / On-demand Variety of Data Sources/Types
  • 10. © Cloudera, Inc. All rights reserved. 10© Cloudera, Inc. All rights reserved. MODERN DATA WAREHOUSE Fixed Reports DATA SOURCES Flexible Reporting Advanced Analytics Self-Service BI/Ad Hoc Dashboards/ Analytic Apps EDW COMPLIMENTING A TRADITIONAL EDW
  • 11. © Cloudera, Inc. All rights reserved. 11© Cloudera, Inc. All rights reserved. CLOUD NATIVE WITH ALTUS DW Multi-Cloud PaaS for Agile Analytics ● Quick time to value for analytics - no software or clusters to manage ● Bring the warehouse to the data with zero copy simplicity ● Use your security policies with your data - no proprietary stacks ● Apply enterprise governance to transient workloads ● Shared data experience with SDX, for analytic workloads ● Optimized for Azure & AWS DATA WAREHOUSE GOVERNANCESECURITY ALTUS CONTROL PLANE LIFECYCLE MANAGEMENT MULTI-CLOUD Amazon S3 Microsoft ADLS
  • 12. © Cloudera, Inc. All rights reserved. 12© Cloudera, Inc. All rights reserved. Traditional Data Warehouse Optimization Transform Status Quo TRANSFORMATIONAL AREAS OF DATA WAREHOUSING Operations & Events Data Warehouse Run Business Better Research & Discovery Data Warehouse Change the Culture
  • 13. © Cloudera, Inc. All rights reserved. 13 DRIVERS FOR MODERNIZATION Deeper Business Insights Grow • Customer Sentiment • Fault Prevention • Improve Product Quality • New Revenue Streams Experimentation and collaboration at scale Protect • Proactive Fraud Prevention • Keep up with Regulatory Compliance • Preempt Cyberthreats Real-time response on massive data volume and variety Connect • Improve Operational Efficiency • Support Internet of Things (IoT) New analytics techniques democratized to all users
  • 14. © Cloudera, Inc. All rights reserved. 14 CHALLENGES OF A MODERN DATA WAREHOUSE Extreme Speed and Scale More Data • Massive amounts handled faster at scale • More variety from new sources (social media, IoT) • Insight within minutes of new data arrival Performance and flexibility at scale More Workloads • 100’s of production grade deployments • Enterprise grade dependability • Strict security and governance On-demand scale out, discovery, collaboration More People • 1,000’s of new users and new user types • 1,000’s of new use cases • All skill levels: Analytics, Data Science, and Machine Learning All workloads with a shared data experience
  • 15. © Cloudera, Inc. All rights reserved. 15 Optimize Core Processes ● Versatile Solution ● Broaden Data Reach ● Reduce IT Burden or Costs Dynamic Consumption ● Transient, Short-lived, Long-lived ● Public, Private, Hybrid Multi-Cloud ● Adaptive Compute & Storage Self-Service Everything ● Resource Provisioning ● Workload Development ● Optimizing & Troubleshooting CLOUDERA MODERN DATA WAREHOUSE Optimize Processes, Consumption and Costs https://www.cloudera.com/about/customers/xl-axiata.html https://blog.cloudera.com/blog/2018/03/automated-provisioning- of-cdh-in-the-cloud-with-cloudera-director-and-ansible/ https://www.cloudera.com/about/customers/komatsu-mining.html
  • 16. © Cloudera, Inc. All rights reserved. 16© Cloudera, Inc. All rights reserved. Financial Services Telecom Government Healthcare Manufacturing Customer 360 Personalized Medicine Supply Chain Analysis Operational Efficiencies Network Quality Analysis Equipment Health (IoT) Fraud Compliance Cyber Threat Analysis Regulatory Reporting TOP 10 DATA WAREHOUSE USE CASES BY INDUSTRYGROWCONNECTPROTECT
  • 17. © Cloudera, Inc. All rights reserved.17 A MODERN DATA WAREHOUSE FROM CLOUDERA HYBRID Storage Preferred BI & ELT ToolsHue Analytic Workbench, Superset Dashboards, CDSW Workload XM, Data Analytics Studio Navigator & Sentry, Atlas & Ranger Impala / Hive LLAP Query Engine Hive on Tez / Spark ELT Processing KUDU | HDFS | Druid Local Storage AWS S3 | ADLS Object Storage Shared Data Experience (SDX) Optimized File Formats (ORC, Parquet, Avro, JSON) Solr Search Analytics Cloudera Manager, Ambari, Altus, Data Plane HYBRID Controls HYBRID Compute HYBRID Storage HYBRID Reporting
  • 18. © Cloudera, Inc. All rights reserved.18 © Cloudera, Inc. All rights reserved. EXTREME SPEED & SCALE Fastest ELT at Scale for Data Engineers ● Fast data with distributed, in-memory processing ● Curated data, metadata instantly available Fastest Self-Service BI at Scale for Analysts & Developers ● Interactive multi-user queries without rigid modeling for exploration ● Elastic scalability for more users/data Impala LLAP
  • 19. © Cloudera, Inc. All rights reserved. 19 EXTENSIVE PARTNER ECOSYSTEM System Integrators ISV IHV Alliances Cloud Alliances OEM Alliances Market Expansion
  • 20. © Cloudera, Inc. All rights reserved.20 © Cloudera, Inc. All rights reserved. CLOUDERA DW - PARTING THOUGHTS Hybrid Optimized Shared Data ExperiencePerformance @Scale Shared Data Exponential Use Cases, Successful Outcomes
  • 21. © Cloudera, Inc. All rights reserved. THANK YOU