SlideShare a Scribd company logo
1© Cloudera, Inc. All rights reserved.
How can Big Data enable
Analytics from the Cloud
Joel Roland, System Engineer, Cloudera
2© Cloudera, Inc. All rights reserved.
What’s Driving Hadoop to the Cloud?
Enterprise customers using cloud for big data analytics
Hadoop deployments in cloud are
accelerating:
● Executive mandate: minimize on-prem
datacenter footprint
● Perceived lower overall TCO
● Increased agility: end-user self-service
● Elasticity: optimize infrastructure usage
3© Cloudera, Inc. All rights reserved.
Workloads in the cloud
Only pay for what you need,
when you need it
▪ Transient clusters
▪ Elastic workload
▪ Object storage centric
▪ Cloud-native deployment
ETL/Modeling
(Data Engineering)
App Delivery
(Operational
Database)
Reduce Operating Costs New Insights, New Revenue Run Without Risk
BI/Analytics
(Analytic Database)
Explore and analyze all data,
wherever it lives
▪ Transient or Persistent clusters
▪ Sized to demand
▪ HDFS or object storage
▪ Lift-and-shift or cloud-native
deployment
Enterprise-grade to protect your
business, no matter what
▪ Fixed clusters
▪ Periodic sync
▪ All HDFS storage
▪ Lift-and-shift deployment
4© Cloudera, Inc. All rights reserved.
Embrace Transience for
Lower Costs
Decoupled Storage and
Compute for Elastic Scale
Patterns of Cloud-Native Applications
Flexibility, Self-Service Models, and New Cost Dynamics
Compartmentalize for
Greater Isolation
Object Store
COMPUTE
1hr
SPIN UP SPIN
DOWN
Object Store
5© Cloudera, Inc. All rights reserved.
Sample CDH in Cloud Architecture
Data
Sources
Real-Time
Serving
Kafka/
Flume
Spark
Streaming
HBase or
Impala/Kudu (beta)
Kafka
Application
S3
Hive/Spark/HoS
Impala
Analytics
Batch Data
Transformations
Streaming Architecture
6© Cloudera, Inc. All rights reserved.
Sample CDH in Cloud Architecture
Data
Sources
Real-Time
Serving
Kafka/
Flume
Spark
Streaming
HBase, or
Impala/Kudu (beta)
Kafka
Application
S3
Hive/Spark/HoS
Impala
Analytics
Batch Data
Transformations
Batch Analytics
7© Cloudera, Inc. All rights reserved.
Cloud Enabled by Cloudera Director
OPERATIONS
Cloudera Manager
Cloudera Director
• Cloudera Director is an integrated part of Cloudera
Enterprise; designed to enable organisations to deploy
Enterprise Grade Hadoop into the Cloud by making it
• Fast, Easy, Secure and Reliable
• Cloudera Director complements and extends the existing
capabilities of tools such as Cloudera Manager
• While enabling organisations to consume new usages
patterns as well as reduce time-to-value when deploying
Cloudera
8© Cloudera, Inc. All rights reserved.
Cloudera Director Benefits & Capabilities
Eliminate Vendor Lock-in
• Native support for multiple cloud-providers
(AWS, Google Cloud Compute & Microsoft Azure)
• Extend / Enable Hybrid cloud deployments using plugins for
VMWare
• You control what workloads run on which providers
avoiding single vendors lock in.
9© Cloudera, Inc. All rights reserved.
Cloudera Director Benefits & Capabilities
Simplify Cluster Lifecycle & Management
• Simple and Easy to use UI to manage,
spin up, scale, and spin down cluster.
• Can fully automate Cloud Deployments
at the click of a button
• Dynamic scaling for clusters
• Ability to clone clusters on-demand
• Allows you to define blueprints for
repeatable cloud deployments
10© Cloudera, Inc. All rights reserved.
Cloudera Director Benefits & Capabilities
Accelerating Time-to-Value with Enterprise-ready
security and administration
• Deploy clusters in as little as 15-30 minutes!
• Support for complex cluster topologies (eg HA)
• Deployed with compliance-ready security and
governance (eg Kerebos)
• Clusters Easily connect into Cloudera’s BDR (Backup
& Disaster Recovery Solution)
• Has contains an extensible Restful API for those
who want to script and automate cluster builds
11© Cloudera, Inc. All rights reserved.
In Summary
Cloudera Director makes your journey to the cloud:
• Fast
• Deploy clusters across multiple cloud providers in minutes vs days
• Easy
• As Cloudera Director is an integrated part of the Platform, it makes creating,
modifying and managing clusters in the cloud simple!
• Secure
• Deploy secure clusters out of the box
• Reliable
• Reduce risk or errors by using an automated deployment model
12© Cloudera, Inc. All rights reserved.
Demo
• Overview Web Interface
• Create New Cluster (via Console)
• Modify Existing Cluster (resize)
• Query Data on S3 (Customer Example)
• Execute ETL Workflow & Save data to S3
13© Cloudera, Inc. All rights reserved.
Crunching 1,000+ Business Metrics
per Customer with Sub-Second
Responses
•Enables granular targeting of
customers
•50% reduction in marketing cost
execution at one
•Stores & processes 1000s of
critical events at scale & low cost
•Provides flexibility, agility to
support customer needs with
Cloudera on Amazon Web
Services and on premises
CUSTOMER 360
Customer 360° in the
Cloud
14© Cloudera, Inc. All rights reserved.
Thanks! Questions?

More Related Content

What's hot (20)

PPTX
Self-service Big Data Analytics on Microsoft Azure
Cloudera, Inc.
 
PPTX
PaaS or Fail: Rule the Cloud with Altus
Cloudera, Inc.
 
PPTX
Get started with Cloudera's cyber solution
Cloudera, Inc.
 
PPTX
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Cloudera, Inc.
 
PPTX
Big data journey to the cloud rohit pujari 5.30.18
Cloudera, Inc.
 
PPTX
Leveraging the Cloud for Big Data Analytics 12.11.18
Cloudera, Inc.
 
PPTX
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
Cloudera, Inc.
 
PPTX
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Cloudera, Inc.
 
PPTX
Standing Up an Effective Enterprise Data Hub -- Technology and Beyond
Cloudera, Inc.
 
PPTX
Data Drive Applications_Webinar
Sean Spediacci
 
PPTX
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
Cloudera, Inc.
 
PPTX
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera, Inc.
 
PPTX
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Cloudera, Inc.
 
PDF
Hadoop on Cloud: Why and How?
Cloudera, Inc.
 
PPTX
Kudu Forrester Webinar
Cloudera, Inc.
 
PPTX
Unlock Hadoop Success with Cloudera Navigator Optimizer
Cloudera, Inc.
 
PPTX
Secure Data - Why Encryption and Access Control are Game Changers
Cloudera, Inc.
 
PPTX
Consolidate your data marts for fast, flexible analytics 5.24.18
Cloudera, Inc.
 
PPTX
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Cloudera, Inc.
 
PPTX
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
Cloudera, Inc.
 
Self-service Big Data Analytics on Microsoft Azure
Cloudera, Inc.
 
PaaS or Fail: Rule the Cloud with Altus
Cloudera, Inc.
 
Get started with Cloudera's cyber solution
Cloudera, Inc.
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Cloudera, Inc.
 
Big data journey to the cloud rohit pujari 5.30.18
Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Cloudera, Inc.
 
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
Cloudera, Inc.
 
Securing the Data Hub--Protecting your Customer IP (Technical Workshop)
Cloudera, Inc.
 
Standing Up an Effective Enterprise Data Hub -- Technology and Beyond
Cloudera, Inc.
 
Data Drive Applications_Webinar
Sean Spediacci
 
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera, Inc.
 
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Cloudera, Inc.
 
Hadoop on Cloud: Why and How?
Cloudera, Inc.
 
Kudu Forrester Webinar
Cloudera, Inc.
 
Unlock Hadoop Success with Cloudera Navigator Optimizer
Cloudera, Inc.
 
Secure Data - Why Encryption and Access Control are Game Changers
Cloudera, Inc.
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Cloudera, Inc.
 
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Cloudera, Inc.
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
Cloudera, Inc.
 

Viewers also liked (17)

PPTX
The Vortex of Change - Digital Transformation (Presented by Intel)
Cloudera, Inc.
 
PPTX
Top 5 IoT Use Cases
Cloudera, Inc.
 
PPTX
Using Big Data to Transform Your Customer’s Experience - Part 1

Cloudera, Inc.
 
PPTX
Enabling the Connected Car Revolution

Cloudera, Inc.
 
PPTX
Analyzing Hadoop Data Using Sparklyr

Cloudera, Inc.
 
PPTX
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Cloudera, Inc.
 
PPTX
Data Engineering: Elastic, Low-Cost Data Processing in the Cloud
Cloudera, Inc.
 
PPTX
Part 1: Lambda Architectures: Simplified by Apache Kudu
Cloudera, Inc.
 
PPTX
Hadoop ppt
Aditya Jagtap
 
PDF
Big data processing using Hadoop with Cloudera Quickstart
IMC Institute
 
PPTX
The Impala Cookbook
Cloudera, Inc.
 
PDF
The Physical Interface
Josh Clark
 
PDF
Mobile-First SEO - The Marketers Edition #3XEDigital
Aleyda Solís
 
PDF
3 Things Every Sales Team Needs to Be Thinking About in 2017
Drift
 
PPTX
Introduction to Spark: Data Analysis and Use Cases in Big Data
Jongwook Woo
 
PDF
Apache Spark Tutorial
Farzad Nozarian
 
PPTX
Modernizing Architecture for a Complete Data Strategy
Cloudera, Inc.
 
The Vortex of Change - Digital Transformation (Presented by Intel)
Cloudera, Inc.
 
Top 5 IoT Use Cases
Cloudera, Inc.
 
Using Big Data to Transform Your Customer’s Experience - Part 1

Cloudera, Inc.
 
Enabling the Connected Car Revolution

Cloudera, Inc.
 
Analyzing Hadoop Data Using Sparklyr

Cloudera, Inc.
 
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Cloudera, Inc.
 
Data Engineering: Elastic, Low-Cost Data Processing in the Cloud
Cloudera, Inc.
 
Part 1: Lambda Architectures: Simplified by Apache Kudu
Cloudera, Inc.
 
Hadoop ppt
Aditya Jagtap
 
Big data processing using Hadoop with Cloudera Quickstart
IMC Institute
 
The Impala Cookbook
Cloudera, Inc.
 
The Physical Interface
Josh Clark
 
Mobile-First SEO - The Marketers Edition #3XEDigital
Aleyda Solís
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
Drift
 
Introduction to Spark: Data Analysis and Use Cases in Big Data
Jongwook Woo
 
Apache Spark Tutorial
Farzad Nozarian
 
Modernizing Architecture for a Complete Data Strategy
Cloudera, Inc.
 
Ad

Similar to How Big Data Can Enable Analytics from the Cloud (Technical Workshop) (20)

PPTX
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera, Inc.
 
PDF
Introducing Cloudera Director at Big Data Bash
Andrei Savu
 
PDF
One Hadoop, Multiple Clouds
Cloudera, Inc.
 
PDF
One Hadoop, Multiple Clouds - NYC Big Data Meetup
Andrei Savu
 
PDF
Cloudera GoDataFest Deploying Cloudera in the Cloud
GoDataDriven
 
PPTX
Cloudera - The Modern Platform for Analytics
Cloudera, Inc.
 
PPTX
Strata + Hadoop World 2012: Taming the Elephant - Learn how Monsanto manages ...
Cloudera, Inc.
 
PPTX
Turning Data into Business Value with a Modern Data Platform
Cloudera, Inc.
 
PPTX
Cloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera, Inc.
 
PPTX
Five Tips for Running Cloudera on AWS
Cloudera, Inc.
 
PPTX
Hadoop Essentials -- The What, Why and How to Meet Agency Objectives
Cloudera, Inc.
 
PPTX
Intel and Cloudera: Accelerating Enterprise Big Data Success
Cloudera, Inc.
 
PPTX
A deep dive into running data analytic workloads in the cloud
Cloudera, Inc.
 
PPTX
High-Performance Analytics in the Cloud with Apache Impala
Cloudera, Inc.
 
PPTX
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB
 
PPTX
Automating Cloud Cluster Deployment: Beyond the Book
Bill Havanki
 
PPTX
Cloudera Federal Forum 2014: Cloud Deployment for the Enterprise Data Hub
Cloudera, Inc.
 
PPTX
Cloudera training: secure your Cloudera cluster
Cloudera, Inc.
 
PPTX
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Stefan Lipp
 
PPTX
Chicago Data Summit: Cloudera's Distribution including Apache Hadoop & Cloude...
Cloudera, Inc.
 
Cloudera Director: Unlock the Full Potential of Hadoop in the Cloud
Cloudera, Inc.
 
Introducing Cloudera Director at Big Data Bash
Andrei Savu
 
One Hadoop, Multiple Clouds
Cloudera, Inc.
 
One Hadoop, Multiple Clouds - NYC Big Data Meetup
Andrei Savu
 
Cloudera GoDataFest Deploying Cloudera in the Cloud
GoDataDriven
 
Cloudera - The Modern Platform for Analytics
Cloudera, Inc.
 
Strata + Hadoop World 2012: Taming the Elephant - Learn how Monsanto manages ...
Cloudera, Inc.
 
Turning Data into Business Value with a Modern Data Platform
Cloudera, Inc.
 
Cloudera Altus: Big Data in der Cloud einfach gemacht
Cloudera, Inc.
 
Five Tips for Running Cloudera on AWS
Cloudera, Inc.
 
Hadoop Essentials -- The What, Why and How to Meet Agency Objectives
Cloudera, Inc.
 
Intel and Cloudera: Accelerating Enterprise Big Data Success
Cloudera, Inc.
 
A deep dive into running data analytic workloads in the cloud
Cloudera, Inc.
 
High-Performance Analytics in the Cloud with Apache Impala
Cloudera, Inc.
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB
 
Automating Cloud Cluster Deployment: Beyond the Book
Bill Havanki
 
Cloudera Federal Forum 2014: Cloud Deployment for the Enterprise Data Hub
Cloudera, Inc.
 
Cloudera training: secure your Cloudera cluster
Cloudera, Inc.
 
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Stefan Lipp
 
Chicago Data Summit: Cloudera's Distribution including Apache Hadoop & Cloude...
Cloudera, Inc.
 
Ad

More from Cloudera, Inc. (20)

PPTX
Partner Briefing_January 25 (FINAL).pptx
Cloudera, Inc.
 
PPTX
Cloudera Data Impact Awards 2021 - Finalists
Cloudera, Inc.
 
PPTX
2020 Cloudera Data Impact Awards Finalists
Cloudera, Inc.
 
PPTX
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
 
PPTX
Machine Learning with Limited Labeled Data 4/3/19
Cloudera, Inc.
 
PPTX
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Cloudera, Inc.
 
PPTX
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
 
PPTX
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
 
PPTX
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Cloudera, Inc.
 
PPTX
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.
 
PPTX
Modern Data Warehouse Fundamentals Part 3
Cloudera, Inc.
 
PPTX
Modern Data Warehouse Fundamentals Part 2
Cloudera, Inc.
 
PPTX
Modern Data Warehouse Fundamentals Part 1
Cloudera, Inc.
 
PPTX
Extending Cloudera SDX beyond the Platform
Cloudera, Inc.
 
PPTX
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
PPTX
Analyst Webinar: Doing a 180 on Customer 360
Cloudera, Inc.
 
PPTX
Build a modern platform for anti-money laundering 9.19.18
Cloudera, Inc.
 
PPTX
Introducing the data science sandbox as a service 8.30.18
Cloudera, Inc.
 
PPTX
Cloudera SDX
Cloudera, Inc.
 
PPTX
Introducing Workload XM 8.7.18
Cloudera, Inc.
 
Partner Briefing_January 25 (FINAL).pptx
Cloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
Cloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Cloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Cloudera, Inc.
 
Cloudera SDX
Cloudera, Inc.
 
Introducing Workload XM 8.7.18
Cloudera, Inc.
 

Recently uploaded (20)

PPTX
3uTools Full Crack Free Version Download [Latest] 2025
muhammadgurbazkhan
 
PPTX
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 
PDF
Revenue streams of the Wazirx clone script.pdf
aaronjeffray
 
PPTX
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
PPTX
Fundamentals_of_Microservices_Architecture.pptx
MuhammadUzair504018
 
PPTX
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
PPTX
Human Resources Information System (HRIS)
Amity University, Patna
 
PPTX
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
PDF
HiHelloHR – Simplify HR Operations for Modern Workplaces
HiHelloHR
 
PPTX
Engineering the Java Web Application (MVC)
abhishekoza1981
 
PPTX
An Introduction to ZAP by Checkmarx - Official Version
Simon Bennetts
 
PDF
Automate Cybersecurity Tasks with Python
VICTOR MAESTRE RAMIREZ
 
PPTX
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pptx
Varsha Nayak
 
PDF
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Safe Software
 
PDF
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
PDF
Executive Business Intelligence Dashboards
vandeslie24
 
PPTX
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
PDF
Understanding the Need for Systemic Change in Open Source Through Intersectio...
Imma Valls Bernaus
 
PPTX
Tally software_Introduction_Presentation
AditiBansal54083
 
PDF
Thread In Android-Mastering Concurrency for Responsive Apps.pdf
Nabin Dhakal
 
3uTools Full Crack Free Version Download [Latest] 2025
muhammadgurbazkhan
 
Java Native Memory Leaks: The Hidden Villain Behind JVM Performance Issues
Tier1 app
 
Revenue streams of the Wazirx clone script.pdf
aaronjeffray
 
Writing Better Code - Helping Developers make Decisions.pptx
Lorraine Steyn
 
Fundamentals_of_Microservices_Architecture.pptx
MuhammadUzair504018
 
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
Human Resources Information System (HRIS)
Amity University, Patna
 
Feb 2021 Cohesity first pitch presentation.pptx
enginsayin1
 
HiHelloHR – Simplify HR Operations for Modern Workplaces
HiHelloHR
 
Engineering the Java Web Application (MVC)
abhishekoza1981
 
An Introduction to ZAP by Checkmarx - Official Version
Simon Bennetts
 
Automate Cybersecurity Tasks with Python
VICTOR MAESTRE RAMIREZ
 
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pptx
Varsha Nayak
 
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Safe Software
 
Digger Solo: Semantic search and maps for your local files
seanpedersen96
 
Executive Business Intelligence Dashboards
vandeslie24
 
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
Understanding the Need for Systemic Change in Open Source Through Intersectio...
Imma Valls Bernaus
 
Tally software_Introduction_Presentation
AditiBansal54083
 
Thread In Android-Mastering Concurrency for Responsive Apps.pdf
Nabin Dhakal
 

How Big Data Can Enable Analytics from the Cloud (Technical Workshop)

  • 1. 1© Cloudera, Inc. All rights reserved. How can Big Data enable Analytics from the Cloud Joel Roland, System Engineer, Cloudera
  • 2. 2© Cloudera, Inc. All rights reserved. What’s Driving Hadoop to the Cloud? Enterprise customers using cloud for big data analytics Hadoop deployments in cloud are accelerating: ● Executive mandate: minimize on-prem datacenter footprint ● Perceived lower overall TCO ● Increased agility: end-user self-service ● Elasticity: optimize infrastructure usage
  • 3. 3© Cloudera, Inc. All rights reserved. Workloads in the cloud Only pay for what you need, when you need it ▪ Transient clusters ▪ Elastic workload ▪ Object storage centric ▪ Cloud-native deployment ETL/Modeling (Data Engineering) App Delivery (Operational Database) Reduce Operating Costs New Insights, New Revenue Run Without Risk BI/Analytics (Analytic Database) Explore and analyze all data, wherever it lives ▪ Transient or Persistent clusters ▪ Sized to demand ▪ HDFS or object storage ▪ Lift-and-shift or cloud-native deployment Enterprise-grade to protect your business, no matter what ▪ Fixed clusters ▪ Periodic sync ▪ All HDFS storage ▪ Lift-and-shift deployment
  • 4. 4© Cloudera, Inc. All rights reserved. Embrace Transience for Lower Costs Decoupled Storage and Compute for Elastic Scale Patterns of Cloud-Native Applications Flexibility, Self-Service Models, and New Cost Dynamics Compartmentalize for Greater Isolation Object Store COMPUTE 1hr SPIN UP SPIN DOWN Object Store
  • 5. 5© Cloudera, Inc. All rights reserved. Sample CDH in Cloud Architecture Data Sources Real-Time Serving Kafka/ Flume Spark Streaming HBase or Impala/Kudu (beta) Kafka Application S3 Hive/Spark/HoS Impala Analytics Batch Data Transformations Streaming Architecture
  • 6. 6© Cloudera, Inc. All rights reserved. Sample CDH in Cloud Architecture Data Sources Real-Time Serving Kafka/ Flume Spark Streaming HBase, or Impala/Kudu (beta) Kafka Application S3 Hive/Spark/HoS Impala Analytics Batch Data Transformations Batch Analytics
  • 7. 7© Cloudera, Inc. All rights reserved. Cloud Enabled by Cloudera Director OPERATIONS Cloudera Manager Cloudera Director • Cloudera Director is an integrated part of Cloudera Enterprise; designed to enable organisations to deploy Enterprise Grade Hadoop into the Cloud by making it • Fast, Easy, Secure and Reliable • Cloudera Director complements and extends the existing capabilities of tools such as Cloudera Manager • While enabling organisations to consume new usages patterns as well as reduce time-to-value when deploying Cloudera
  • 8. 8© Cloudera, Inc. All rights reserved. Cloudera Director Benefits & Capabilities Eliminate Vendor Lock-in • Native support for multiple cloud-providers (AWS, Google Cloud Compute & Microsoft Azure) • Extend / Enable Hybrid cloud deployments using plugins for VMWare • You control what workloads run on which providers avoiding single vendors lock in.
  • 9. 9© Cloudera, Inc. All rights reserved. Cloudera Director Benefits & Capabilities Simplify Cluster Lifecycle & Management • Simple and Easy to use UI to manage, spin up, scale, and spin down cluster. • Can fully automate Cloud Deployments at the click of a button • Dynamic scaling for clusters • Ability to clone clusters on-demand • Allows you to define blueprints for repeatable cloud deployments
  • 10. 10© Cloudera, Inc. All rights reserved. Cloudera Director Benefits & Capabilities Accelerating Time-to-Value with Enterprise-ready security and administration • Deploy clusters in as little as 15-30 minutes! • Support for complex cluster topologies (eg HA) • Deployed with compliance-ready security and governance (eg Kerebos) • Clusters Easily connect into Cloudera’s BDR (Backup & Disaster Recovery Solution) • Has contains an extensible Restful API for those who want to script and automate cluster builds
  • 11. 11© Cloudera, Inc. All rights reserved. In Summary Cloudera Director makes your journey to the cloud: • Fast • Deploy clusters across multiple cloud providers in minutes vs days • Easy • As Cloudera Director is an integrated part of the Platform, it makes creating, modifying and managing clusters in the cloud simple! • Secure • Deploy secure clusters out of the box • Reliable • Reduce risk or errors by using an automated deployment model
  • 12. 12© Cloudera, Inc. All rights reserved. Demo • Overview Web Interface • Create New Cluster (via Console) • Modify Existing Cluster (resize) • Query Data on S3 (Customer Example) • Execute ETL Workflow & Save data to S3
  • 13. 13© Cloudera, Inc. All rights reserved. Crunching 1,000+ Business Metrics per Customer with Sub-Second Responses •Enables granular targeting of customers •50% reduction in marketing cost execution at one •Stores & processes 1000s of critical events at scale & low cost •Provides flexibility, agility to support customer needs with Cloudera on Amazon Web Services and on premises CUSTOMER 360 Customer 360° in the Cloud
  • 14. 14© Cloudera, Inc. All rights reserved. Thanks! Questions?