SlideShare a Scribd company logo
ISHADOOPTHEDEMISEOFDATAWAREHOUSING? 
THOUGHTSONTHEIMPACTOFHADOOPONBI SYSTEMSANDDATAWAREHOUSING 
Part of our 
BI Demystified Series
questions 
here 
Copyright 2014Senturus,Inc. 
AllRightsReserved 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/is-hadoop-the-demise- of-data-warehousing/ 
Hear the Recording
Resource Library 
Senturus’ whole purpose is to make you successful with Business Analytics. Thus, we offer a series of technology-neutral webinars, training on specific software, demonstrations, and no-holds-barred reviews of new software releases. We host dozens of live webinars every year and we offer a comprehensive library of recorded webinars, demos, white papers, presentations and case studies on our website--a wealth of learning resources. Most of our content is custom created and constantly updated, so visit us often to see what’s new in the industry. 
www.senturus.com/resources/ 
3 
Copyright 2014 Senturus, Inc. All Rights Reserved
John Peterson CEO & Co-Founder 
Senturus 
Today’s Presenter 
4 
With thanks to: 
Guy Wilnai, Sujee Maniyam and Knowledge @ Senturus
•INTRODUCTION 
•THEDATACHALLENGE 
•WHATISHADOOP? 
•ADVANTAGES& CHALLENGES 
•IMPLICATIONS, PREDICTIONS& MISC. MUSINGS 
•CONCLUSIONS 
•Q&A 
AGENDA 
5 
Copyright 2014 Senturus, Inc. All Rights Reserved
WHOWEARE 
SENTURUSINTRODUCTION
questions 
here 
Copyright 2014Senturus,Inc. 
AllRightsReserved 
Hear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/is-hadoop-the-demise-of- data-warehousing/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
Our Team: 
Business depth combined with technical expertise. Former CFOs, CIOs, Controllers, Directors, BI Managers 
SENTURUS: BUSINESSANALYTICSCONSULTANTS 
8 
Copyright 2014 Senturus, Inc. All Rights Reserved 
Business Intelligence 
Enterprise Planning 
Predictive Analytics 
Creating Clarity from Chaos
•Former Head of BI/ Lead Architect –VISA 
•Former Chief BI Architect –Jamba Juice 
•Former Head of BI –Dole 
•Former Chief BI Architect –Cisco 
•Former Chief BI Architect –Central Garden & Pet 
•Former Head of BI –Experian 
•Former Head of BI –Robert Half International 
•Former Head of Training (IBM Cognos, Southern California) 
•Former Controller –The GAP 
•Two former CFO’s 
•Former Partner -PWC ($50million+ projects) 
•Several former Vice Presidents of Marketing, Sales & Manufacturing/Supply Chain 
•Several former COO’s 
•Several former CIO’s 
•Average experience = over 20 years 
A FEWOFOURTEAMMEMBERS(FORMERROLES) 
Deep & Pragmatic Experience 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
9
750+ CLIENTS, 1600+ PROJECTS, 13+ YEARS 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
10
Outpacing our ability to harness it 
THEDATACHALLENGE
THECHALLENGES(ANDOPPORTUNITIES) 
12Copyright 2014 Senturus, Inc. All Rights Reserved. 
•Data volumes & velocity increasing exponentially 
•Data types proliferating 
•Rapid emergence of less structured (or unstructured) data sources 
•Valueof Data increasing 
•Traditional ETL is time-consuming and costly 
•Traditional storage costs skyrocketing(not $/TB) 
•Business users increasinglyfrustrated at not being able to get access to information
THENETRESULT 
13 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
Something is bound to happen
A WARNINGABOUTTODAY’SFOCUS 
14 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
ISABOUT: 
Hadoopas a potential platform or tool for Business Analytics & DW 
ISNOTABOUT: 
Yet another “How Big Data will change the world” paradigm-shift prediction
ROLEOFHADOOPINYOURENVIRONMENT 
QUICKPOLL
Under the Covers 
WHATISHADOOP?
questions 
here 
Copyright 2014Senturus,Inc. 
AllRightsReserved 
Hear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/is-hadoop-the-demise-of- data-warehousing/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
WHATISHADOOP? 
18 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
Hadoopis a stuffed elephant
WHATISHADOOPREALLY? 
19 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
Database Tables 
•Hadoopis an open source distributed storage and processing framework 
•Hadoopvs. RDBMS 
System Tables 
SQL Query Engine 
Typical RDBMS 
HDFS Files* 
Hcatalog& YARN 
Multiple Engines 
HadoopStack 
Storage 
Metadata 
Queries 
*Raw data to highly structured 
All layers combined in a proprietary bundle 
All layers separate and independent allowing flexible access
REFERENCEARCHITECTURE 
20 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
Source: Hortonworks
REFERENCEARCHITECTURE(DETAILED) 
21 
Copyright 2014 Senturus, Inc. All Rights Reserved. Source: Hortonworks
HADOOPSTACKDISTRIBUTIONS 
22 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
Distribution 
Open Source 
Premium 
Apache 
Y 
N 
Cloudera 
Y 
Y 
HortonWorks 
Y 
N 
MapR 
Y (?) 
Y 
Intel 
N 
Y 
EMC GreenplumHD 
N 
Y
ADVANTAGESOFHADOOP(FORBI) 
23 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
•Dramatically lower cost 
–50x to 100x (or more) 
•Can store virtually any data type 
•Can support multipleanalytic engines 
•Massively scalable 
–Both Size and Performance 
–100’s of nodes, TB of RAM, PB of storage 
•Open-source leads to rapid innovation
HADOOPOFFERSCOSTEFFECTIVESTORAGE 
“A recent survey of large financial services firms, telecommunications carriers and retailers indicated that storing data in an RDBMS typically runs between $30,000 and $100,000 (USD) per TB per year in total costs” 
---Clouderawhite paper 
-Hadoopcan bring down the cost to ~$1,000 / TB
BIGDATACOSTCOMPARISON 
Source : Neustar
BIGDATACOSTCOMPARISON 
Source: HortonWorks
COSTCASESTUDY(TELECOM) 
•The carrier’s previous data processing environment was costing $59 million (USD) each year to manage 1PB of data, broken down as follows: 
–$2 million (USD) per year = storage for 1PB raw archive data on network-attached storage (NAS) at $2,000 per TB per year 
–$55 million (USD) per year = management and backup of 1PB processed data on EDW at $55,000 per TB per year 
–$2 million (USD) per year = administration costs calculated at $1,000 per TB per year 
•Calculating costs for moving data processing onto Cloudera, the carrier reduced infrastructure costs to $5.1 million (USD) total 
–$5 million (USD) per year = hardware, software and infrastructure for 1PB at $5,000 per TB per year 
–$100,000 (USD) per year = administration costs calculated at $100 per TB per year
HADOOPCANSTOREANYDATATYPE 
•Key-value pairs 
•Text and binary data 
•Structured 
–Database records 
•Semi-structured 
–Sensor & Machine data 
–Log files 
•Un-structured 
–Emails, tweets 
“Set structure at query time” 
Can retain atomic level data
ANALYTICSINHADOOP 
•‘Batch’ or ‘offline’ analytics 
–MapReducebased tools (java mapreduce, streaming, pig, hive) 
–Have been there from the start, Well understood 
•Fast Ad-Hoc querying 
–New wave of processing, answer to MPP databases (Teradata .etc) 
–Impala (Cloudera), stinger / Tez(Hortonworks), Shark on Spark (Apache) 
•Streaming / Near-RealTimeworkloads 
–Storm, Spark 
–Propelled by YARN processing framework in Hadoop version 2.x
ANALYTICSINHADOOP(CONT.) 
•BI Tools integration 
–Rich BI tool integration 
–Various levels of integration (basic, native, high-speed) 
–Lots of vendors : Datameer, Pentaho, Tableau, QlikView, IBM Cognos… 
•NOSQL store 
–Find data very quickly (milliseconds, just like a traditional database) 
–Hbase 
•Statistical Tools 
–R 
•And, of course, the old favorite 
–SQL 
–Example: InfiniDB(Calpont)
CHALLENGESOFHADOOP 
31 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
•Everything is very NEW 
•Playing field is changing DAILY 
–The Wild West 
•Tools still in v1.0 mode (at best) 
•Does not eliminate the need for dimensional modeling 
•Security TBD 
•No “standard”(winners) declared yet 
•Lots of roughedges still 
•Simple things, like surrogate keys…
A DIZZYINGFIELDOFPLAYERS 
•Alpine Data Labs, San Mateo, CA. 
•Cloudera, Palo Alto, CA. 
•Concurrent, San Francisco, CA. 
•Continuum Analytics, Austin, TX. 
•Continuuity, Palo Alto, CA. 
•Couchbase, Mountain View, CA. 
•Datameer, San Mateo, CA. 
•DataSift, San Francisco, CA. 
•DataStax, San Francisco, CA. 
•DataXu, Boston, MA. 
•Enigma, New York, NY. 
•Factual, Los Angeles, CA. 
•GoodData, San Francisco, CA. 
•Gravity, New York, NY. 
•Guavus, San Mateo, CA. 
•Hadapt, Cambridge, MA 
•Hopper, Cambridge, MA. 
•Hortonworks, Palo Alto, CA. 
•KarmaSphere, Cupertino, CA 
•Lattice Engines, San Mateo, CA. 
•MapRTechnologies, San Jose, CA. 
•MemSQL, New York, NY. 
•Mortar Data, New York, NY. 
•Mu Sigma, Northbrook, IL + India. 
•Neo Technology, San Mateo, CA 
•Opera Solutions, San Diego, CA + India. 
•ParAccel, Campbell, CA. 
•Pivotal Software, Palo Alto, CA 
•Platfora:, San Mateo, CA. 
•RainStor, San Francisco, CA. 
•Rocket Fuel, Redwood City, CA. 
•SiSense, Redwood Shores, CA and Israel. 
•Skytree, Atlanta, GA. 
•Splice Machine, San Francisco, CA. 
•Splunk, San Francisco, CA 
•Statwing, San Francisco, CA. 
•SumAll, New York, NY. 
•Talend, Los Altos, CA. 
•WibiData, San Francisco, CA. 
•Zettaset, Mountain View, CA 
•Zoomdata, Reston, VA. 
•10gen, New York, NY 
•1010data, New York, NY. 
32 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
Partial snapshopas of May 2014
IMPLICATIONS, PREDICTIONS& MISC. MUSINGS 
TSUNAMIWARNING
questionshereCopyright 2014Senturus,Inc.AllRightsReserved 
Hear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
www.senturus.com/resources/is-hadoop-the-demise-of- data-warehousing/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
IMPLICATIONS, PREDICTIONS& MUSINGS 
35 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
•Hadoopas a Data Stagingenvironment 
•Hadoopas an Archive 
•Hadoopas the Data Warehouse 
–“Enterprise Data Hub” 
•Future role of RDBMS’s?? 
–For OLTP 
–For Data Warehouse 
•How much Transformationand where?
TYPICAL“BESTPRACTICES” BI ARCHITECTUREINTEGRATEDBUSINESSPROCESSDIMENSIONALMODELSWITHMETADATALAYER(S) 
36 
Copyright 2014 Senturus, Inc. All Rights Reserved. ERP Data 
CRM Data 
Data Integration 
Conforming 
Business Process 
Dimensional Models 
Standard 
Reports Web Portal Other Sources 
Information Security 
Data Warehouse 
Data Abstraction Model 
Ad hoc Querying 
Planning Data Slicing & DicingDashboard Authoring 
Report Authoring 
Dashboards/ 
Scorecards 
Source Systems of Record 
Threshold 
Alerting 
Self-service Reporting 
& Analysis 
Single Version of the TruthThreshold-basedAlerts
POTENTIALBI ARCHITECTUREUSINGHADOOPINTEGRATEDBUSINESSPROCESSDIMENSIONALMODELSWITHMETADATALAYER(S) 
37 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
ERP Data 
CRM Data 
Data Integration 
Conforming 
Business Process 
Dimensional ModelsStandardReports 
Web Portal 
Other Sources 
Information Security 
Data Warehouse 
Data Abstraction Model 
Ad hoc Querying 
Planning Data Slicing & Dicing 
Dashboard Authoring 
Report Authoring 
Dashboards/ 
Scorecards 
Source Systems of Record 
Threshold 
Alerting 
Self-service Reporting& AnalysisSingle Version of the Truth 
Threshold-based 
Alerts 
HadoopData Staging
IMPLICATIONS, PREDICTIONS& MUSINGS(CONT.) 
38 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
•What have I got to learn? 
–MapReduce= No 
–Hand-coding = No 
–Scoop = Maybe 
–SQL = YES 
•Role of Existing Tools going forward 
–ETL 
–BI Front-ends 
•Role of DW Appliances? 
–HANA 
–IBM PureDataSystem (formerly Netezza), etc.
IMPLICATIONS, PREDICTIONS& MUSINGS(CONT.) 
39 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
•What is the impact on end-users seeking information? 
•We still need: 
–Data delivered in business user-friendly state 
–Rich, relevant and conformingdimensions 
–Ability to account for dimension changes over time 
–Good performance(transformation and aggregation) 
–Ability to integratewith existing systems
JP’SCONCLUSION#140 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
Wow, this stuff is a BIG game changer
JP’SCONCLUSION#2 
41 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
It’s too early to call on the specifics
JP’SCONCLUSION#3 
42 
Copyright 2014 Senturus, Inc. All Rights Reserved. 
DW Architectures & Technologies 
are in a huge state of fluxBut… 
DW Principlesstill apply
Resources, Upcoming Events, Q&A 
NEEDMOREINFO?
•Cloudera& Ralph Kimball 
–Best Practices for the HadoopData Warehouse: EDW 101 for HadoopProfessionals 
–http://www.cloudera.com/content/cloudera/en/resources/library/recordedwebinar/ best-practices-for-the-hadoop-data-warehouse-video.html 
–Building a HadoopData Warehouse: Hadoop101 for EDW Professionals 
–http://www.cloudera.com/content/cloudera/en/resources/library/recordedwebinar/ building-a-hadoop-data-warehouse-video.html 
•MapR& Jack Norris 
–How (and Why) Hadoopis Changing the Data Warehousing Paradigm 
–http://tdwi.org/articles/2013/08/13/hadoop-changing-dw-paradigm.aspx 
•HortonWorks 
–http://hortonworks.com/hadoop/ 
•Senturus.com 
–http://senturus.com/resources/ 
–jpeterson@senturus.comor jfrazier@senturus.com 
ADDITIONALRESOURCES 
44 
Copyright 2014 Senturus, Inc. All Rights Reserved 
Contact us for help on a POC
www.senturus.com 
UPCOMINGEVENTS 
45 
Copyright 2014 Senturus, Inc. All Rights Reserved
More Information on www.senturus.com 
Copyright 2014 Senturus, Inc. All Rights Reserved 
46
questions 
hereCopyright 2014Senturus,Inc. 
AllRightsReserved 
Hear the Recording 
This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to 
http://www.senturus.com/resources/is-hadoop-the- demise-of-data-warehousing/ 
Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. 
www.senturus.com
Thank 
You!! 
www.senturus.com 
888-601-6010 
info@senturus.com 
Copyright2014bySenturus,Inc. 
Thisentirepresentationiscopyrightedandmaynotbereusedor 
distributedwithoutthewrittenconsentofSenturus,Inc.

More Related Content

PPTX
Building an Effective Data Warehouse Architecture
James Serra
 
PPTX
Trafodion overview
Rohit Jain
 
PPTX
Introduction To Big Data & Hadoop
Blackvard
 
PDF
Hadoop and the Future of SQL: Using BI Tools with Big Data
Senturus
 
PDF
Hadoop and the Relational Database: The Best of Both Worlds
Inside Analysis
 
PDF
Introduction to Hadoop
POSSCON
 
PDF
Web Briefing: Unlock the power of Hadoop to enable interactive analytics
Kognitio
 
PDF
Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos...
Senturus
 
Building an Effective Data Warehouse Architecture
James Serra
 
Trafodion overview
Rohit Jain
 
Introduction To Big Data & Hadoop
Blackvard
 
Hadoop and the Future of SQL: Using BI Tools with Big Data
Senturus
 
Hadoop and the Relational Database: The Best of Both Worlds
Inside Analysis
 
Introduction to Hadoop
POSSCON
 
Web Briefing: Unlock the power of Hadoop to enable interactive analytics
Kognitio
 
Beyond PowerPlay: Choose the Right OLAP Tool for Your BI Environment (Cognos...
Senturus
 

Similar to Is Hadoop the Demise of Data Warehousing? The Impact of Hadoop/Big Data on BI and DW (20)

PDF
Turn Data Into Actionable Insights - StampedeCon 2016
StampedeCon
 
PDF
Big Data & Analytics - Innovating at the Speed of Light
Amazon Web Services LATAM
 
PDF
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
jaxconf
 
PDF
Take Action: The New Reality of Data-Driven Business
Inside Analysis
 
PDF
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Publicis Sapient Engineering
 
PDF
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
Hortonworks
 
PDF
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
 
PDF
Scaling up with Cisco Big Data: Data + Science = Data Science
eRic Choo
 
PDF
Do You Really Need a Data Warehouse? Avoid the Downsides Typically Associated...
Senturus
 
PPTX
Hadoop Reporting and Analysis - Jaspersoft
Hortonworks
 
PPTX
OOP 2014
Emil Andreas Siemes
 
PDF
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Hortonworks
 
PDF
Big Data
Ben Duan
 
PPTX
Ledingkart Meetup #4: Data pipeline @ lk
Mukesh Singh
 
PPTX
Atlanta Data Science Meetup | Qubole slides
Qubole
 
PDF
Hadoop Master Class : A concise overview
Abhishek Roy
 
PDF
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
StampedeCon
 
PDF
Level Up – How to Achieve Hadoop Acceleration
Inside Analysis
 
PPTX
201305 hadoop jpl-v3
Eric Baldeschwieler
 
PDF
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Hortonworks
 
Turn Data Into Actionable Insights - StampedeCon 2016
StampedeCon
 
Big Data & Analytics - Innovating at the Speed of Light
Amazon Web Services LATAM
 
Apache Hadoop and its role in Big Data architecture - Himanshu Bari
jaxconf
 
Take Action: The New Reality of Data-Driven Business
Inside Analysis
 
Ask Bigger Questions with Cloudera and Apache Hadoop - Big Data Day Paris 2013
Publicis Sapient Engineering
 
C-BAG Big Data Meetup Chennai Oct.29-2014 Hortonworks and Concurrent on Casca...
Hortonworks
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks
 
Scaling up with Cisco Big Data: Data + Science = Data Science
eRic Choo
 
Do You Really Need a Data Warehouse? Avoid the Downsides Typically Associated...
Senturus
 
Hadoop Reporting and Analysis - Jaspersoft
Hortonworks
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Hortonworks
 
Big Data
Ben Duan
 
Ledingkart Meetup #4: Data pipeline @ lk
Mukesh Singh
 
Atlanta Data Science Meetup | Qubole slides
Qubole
 
Hadoop Master Class : A concise overview
Abhishek Roy
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
StampedeCon
 
Level Up – How to Achieve Hadoop Acceleration
Inside Analysis
 
201305 hadoop jpl-v3
Eric Baldeschwieler
 
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Hortonworks
 
Ad

More from Senturus (20)

PPTX
Power BI Gateway: Understanding, Installing, Configuring
Senturus
 
PPTX
Cognos Performance Tuning Tips & Tricks
Senturus
 
PPTX
Power Automate for Power BI: Getting Started
Senturus
 
PPTX
Collaborative BI: 3 Ways to Use Cognos with Power BI & Tableau
Senturus
 
PPTX
Tips for Installing Cognos Analytics 11.2.1x
Senturus
 
PDF
How to Prepare for a BI Migration
Senturus
 
PPTX
4 Common Analytics Reporting Errors to Avoid
Senturus
 
PPTX
Extending Power BI Functionality with R
Senturus
 
PPTX
Take Control of Your Cloud
Senturus
 
PPTX
Using Python with Power BI
Senturus
 
PPTX
User-Friendly Power BI Report Nav
Senturus
 
PPTX
Streamline Cognos Migrations & Consolidations
Senturus
 
PPTX
What’s New in Cognos 11.2.1
Senturus
 
PPTX
Planning for a Power BI Enterprise Deployment
Senturus
 
PPTX
Power BI Report Builder & Paginated Reports
Senturus
 
PPTX
Tableau: 6 Ways to Publish & Share Dashboards
Senturus
 
PPTX
Cognos Analytics 11.2 New Features
Senturus
 
PPTX
Azure Synapse vs. Snowflake: The Data Warehouse Dating Game
Senturus
 
PPTX
Secrets of High Performing Report Development Teams
Senturus
 
PPTX
Power BI: Data Cleansing & Power Query Editor
Senturus
 
Power BI Gateway: Understanding, Installing, Configuring
Senturus
 
Cognos Performance Tuning Tips & Tricks
Senturus
 
Power Automate for Power BI: Getting Started
Senturus
 
Collaborative BI: 3 Ways to Use Cognos with Power BI & Tableau
Senturus
 
Tips for Installing Cognos Analytics 11.2.1x
Senturus
 
How to Prepare for a BI Migration
Senturus
 
4 Common Analytics Reporting Errors to Avoid
Senturus
 
Extending Power BI Functionality with R
Senturus
 
Take Control of Your Cloud
Senturus
 
Using Python with Power BI
Senturus
 
User-Friendly Power BI Report Nav
Senturus
 
Streamline Cognos Migrations & Consolidations
Senturus
 
What’s New in Cognos 11.2.1
Senturus
 
Planning for a Power BI Enterprise Deployment
Senturus
 
Power BI Report Builder & Paginated Reports
Senturus
 
Tableau: 6 Ways to Publish & Share Dashboards
Senturus
 
Cognos Analytics 11.2 New Features
Senturus
 
Azure Synapse vs. Snowflake: The Data Warehouse Dating Game
Senturus
 
Secrets of High Performing Report Development Teams
Senturus
 
Power BI: Data Cleansing & Power Query Editor
Senturus
 
Ad

Recently uploaded (20)

PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PDF
Fundamentals and Techniques of Biophysics and Molecular Biology (Pranav Kumar...
RohitKumar868624
 
PPTX
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
PPTX
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
PPTX
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
PPTX
Presentation on animal welfare a good topic
kidscream385
 
PDF
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PPTX
INFO8116 -Big data architecture and analytics
guddipatel10
 
PPTX
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
PDF
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
PPT
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PDF
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
PDF
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
Fundamentals and Techniques of Biophysics and Molecular Biology (Pranav Kumar...
RohitKumar868624
 
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
Presentation on animal welfare a good topic
kidscream385
 
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
INFO8116 -Big data architecture and analytics
guddipatel10
 
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 

Is Hadoop the Demise of Data Warehousing? The Impact of Hadoop/Big Data on BI and DW

  • 2. questions here Copyright 2014Senturus,Inc. AllRightsReserved This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/is-hadoop-the-demise- of-data-warehousing/ Hear the Recording
  • 3. Resource Library Senturus’ whole purpose is to make you successful with Business Analytics. Thus, we offer a series of technology-neutral webinars, training on specific software, demonstrations, and no-holds-barred reviews of new software releases. We host dozens of live webinars every year and we offer a comprehensive library of recorded webinars, demos, white papers, presentations and case studies on our website--a wealth of learning resources. Most of our content is custom created and constantly updated, so visit us often to see what’s new in the industry. www.senturus.com/resources/ 3 Copyright 2014 Senturus, Inc. All Rights Reserved
  • 4. John Peterson CEO & Co-Founder Senturus Today’s Presenter 4 With thanks to: Guy Wilnai, Sujee Maniyam and Knowledge @ Senturus
  • 5. •INTRODUCTION •THEDATACHALLENGE •WHATISHADOOP? •ADVANTAGES& CHALLENGES •IMPLICATIONS, PREDICTIONS& MISC. MUSINGS •CONCLUSIONS •Q&A AGENDA 5 Copyright 2014 Senturus, Inc. All Rights Reserved
  • 7. questions here Copyright 2014Senturus,Inc. AllRightsReserved Hear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/is-hadoop-the-demise-of- data-warehousing/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 8. Our Team: Business depth combined with technical expertise. Former CFOs, CIOs, Controllers, Directors, BI Managers SENTURUS: BUSINESSANALYTICSCONSULTANTS 8 Copyright 2014 Senturus, Inc. All Rights Reserved Business Intelligence Enterprise Planning Predictive Analytics Creating Clarity from Chaos
  • 9. •Former Head of BI/ Lead Architect –VISA •Former Chief BI Architect –Jamba Juice •Former Head of BI –Dole •Former Chief BI Architect –Cisco •Former Chief BI Architect –Central Garden & Pet •Former Head of BI –Experian •Former Head of BI –Robert Half International •Former Head of Training (IBM Cognos, Southern California) •Former Controller –The GAP •Two former CFO’s •Former Partner -PWC ($50million+ projects) •Several former Vice Presidents of Marketing, Sales & Manufacturing/Supply Chain •Several former COO’s •Several former CIO’s •Average experience = over 20 years A FEWOFOURTEAMMEMBERS(FORMERROLES) Deep & Pragmatic Experience Copyright 2014 Senturus, Inc. All Rights Reserved. 9
  • 10. 750+ CLIENTS, 1600+ PROJECTS, 13+ YEARS Copyright 2014 Senturus, Inc. All Rights Reserved. 10
  • 11. Outpacing our ability to harness it THEDATACHALLENGE
  • 12. THECHALLENGES(ANDOPPORTUNITIES) 12Copyright 2014 Senturus, Inc. All Rights Reserved. •Data volumes & velocity increasing exponentially •Data types proliferating •Rapid emergence of less structured (or unstructured) data sources •Valueof Data increasing •Traditional ETL is time-consuming and costly •Traditional storage costs skyrocketing(not $/TB) •Business users increasinglyfrustrated at not being able to get access to information
  • 13. THENETRESULT 13 Copyright 2014 Senturus, Inc. All Rights Reserved. Something is bound to happen
  • 14. A WARNINGABOUTTODAY’SFOCUS 14 Copyright 2014 Senturus, Inc. All Rights Reserved. ISABOUT: Hadoopas a potential platform or tool for Business Analytics & DW ISNOTABOUT: Yet another “How Big Data will change the world” paradigm-shift prediction
  • 16. Under the Covers WHATISHADOOP?
  • 17. questions here Copyright 2014Senturus,Inc. AllRightsReserved Hear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/is-hadoop-the-demise-of- data-warehousing/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 18. WHATISHADOOP? 18 Copyright 2014 Senturus, Inc. All Rights Reserved. Hadoopis a stuffed elephant
  • 19. WHATISHADOOPREALLY? 19 Copyright 2014 Senturus, Inc. All Rights Reserved. Database Tables •Hadoopis an open source distributed storage and processing framework •Hadoopvs. RDBMS System Tables SQL Query Engine Typical RDBMS HDFS Files* Hcatalog& YARN Multiple Engines HadoopStack Storage Metadata Queries *Raw data to highly structured All layers combined in a proprietary bundle All layers separate and independent allowing flexible access
  • 20. REFERENCEARCHITECTURE 20 Copyright 2014 Senturus, Inc. All Rights Reserved. Source: Hortonworks
  • 21. REFERENCEARCHITECTURE(DETAILED) 21 Copyright 2014 Senturus, Inc. All Rights Reserved. Source: Hortonworks
  • 22. HADOOPSTACKDISTRIBUTIONS 22 Copyright 2014 Senturus, Inc. All Rights Reserved. Distribution Open Source Premium Apache Y N Cloudera Y Y HortonWorks Y N MapR Y (?) Y Intel N Y EMC GreenplumHD N Y
  • 23. ADVANTAGESOFHADOOP(FORBI) 23 Copyright 2014 Senturus, Inc. All Rights Reserved. •Dramatically lower cost –50x to 100x (or more) •Can store virtually any data type •Can support multipleanalytic engines •Massively scalable –Both Size and Performance –100’s of nodes, TB of RAM, PB of storage •Open-source leads to rapid innovation
  • 24. HADOOPOFFERSCOSTEFFECTIVESTORAGE “A recent survey of large financial services firms, telecommunications carriers and retailers indicated that storing data in an RDBMS typically runs between $30,000 and $100,000 (USD) per TB per year in total costs” ---Clouderawhite paper -Hadoopcan bring down the cost to ~$1,000 / TB
  • 27. COSTCASESTUDY(TELECOM) •The carrier’s previous data processing environment was costing $59 million (USD) each year to manage 1PB of data, broken down as follows: –$2 million (USD) per year = storage for 1PB raw archive data on network-attached storage (NAS) at $2,000 per TB per year –$55 million (USD) per year = management and backup of 1PB processed data on EDW at $55,000 per TB per year –$2 million (USD) per year = administration costs calculated at $1,000 per TB per year •Calculating costs for moving data processing onto Cloudera, the carrier reduced infrastructure costs to $5.1 million (USD) total –$5 million (USD) per year = hardware, software and infrastructure for 1PB at $5,000 per TB per year –$100,000 (USD) per year = administration costs calculated at $100 per TB per year
  • 28. HADOOPCANSTOREANYDATATYPE •Key-value pairs •Text and binary data •Structured –Database records •Semi-structured –Sensor & Machine data –Log files •Un-structured –Emails, tweets “Set structure at query time” Can retain atomic level data
  • 29. ANALYTICSINHADOOP •‘Batch’ or ‘offline’ analytics –MapReducebased tools (java mapreduce, streaming, pig, hive) –Have been there from the start, Well understood •Fast Ad-Hoc querying –New wave of processing, answer to MPP databases (Teradata .etc) –Impala (Cloudera), stinger / Tez(Hortonworks), Shark on Spark (Apache) •Streaming / Near-RealTimeworkloads –Storm, Spark –Propelled by YARN processing framework in Hadoop version 2.x
  • 30. ANALYTICSINHADOOP(CONT.) •BI Tools integration –Rich BI tool integration –Various levels of integration (basic, native, high-speed) –Lots of vendors : Datameer, Pentaho, Tableau, QlikView, IBM Cognos… •NOSQL store –Find data very quickly (milliseconds, just like a traditional database) –Hbase •Statistical Tools –R •And, of course, the old favorite –SQL –Example: InfiniDB(Calpont)
  • 31. CHALLENGESOFHADOOP 31 Copyright 2014 Senturus, Inc. All Rights Reserved. •Everything is very NEW •Playing field is changing DAILY –The Wild West •Tools still in v1.0 mode (at best) •Does not eliminate the need for dimensional modeling •Security TBD •No “standard”(winners) declared yet •Lots of roughedges still •Simple things, like surrogate keys…
  • 32. A DIZZYINGFIELDOFPLAYERS •Alpine Data Labs, San Mateo, CA. •Cloudera, Palo Alto, CA. •Concurrent, San Francisco, CA. •Continuum Analytics, Austin, TX. •Continuuity, Palo Alto, CA. •Couchbase, Mountain View, CA. •Datameer, San Mateo, CA. •DataSift, San Francisco, CA. •DataStax, San Francisco, CA. •DataXu, Boston, MA. •Enigma, New York, NY. •Factual, Los Angeles, CA. •GoodData, San Francisco, CA. •Gravity, New York, NY. •Guavus, San Mateo, CA. •Hadapt, Cambridge, MA •Hopper, Cambridge, MA. •Hortonworks, Palo Alto, CA. •KarmaSphere, Cupertino, CA •Lattice Engines, San Mateo, CA. •MapRTechnologies, San Jose, CA. •MemSQL, New York, NY. •Mortar Data, New York, NY. •Mu Sigma, Northbrook, IL + India. •Neo Technology, San Mateo, CA •Opera Solutions, San Diego, CA + India. •ParAccel, Campbell, CA. •Pivotal Software, Palo Alto, CA •Platfora:, San Mateo, CA. •RainStor, San Francisco, CA. •Rocket Fuel, Redwood City, CA. •SiSense, Redwood Shores, CA and Israel. •Skytree, Atlanta, GA. •Splice Machine, San Francisco, CA. •Splunk, San Francisco, CA •Statwing, San Francisco, CA. •SumAll, New York, NY. •Talend, Los Altos, CA. •WibiData, San Francisco, CA. •Zettaset, Mountain View, CA •Zoomdata, Reston, VA. •10gen, New York, NY •1010data, New York, NY. 32 Copyright 2014 Senturus, Inc. All Rights Reserved. Partial snapshopas of May 2014
  • 33. IMPLICATIONS, PREDICTIONS& MISC. MUSINGS TSUNAMIWARNING
  • 34. questionshereCopyright 2014Senturus,Inc.AllRightsReserved Hear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to www.senturus.com/resources/is-hadoop-the-demise-of- data-warehousing/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 35. IMPLICATIONS, PREDICTIONS& MUSINGS 35 Copyright 2014 Senturus, Inc. All Rights Reserved. •Hadoopas a Data Stagingenvironment •Hadoopas an Archive •Hadoopas the Data Warehouse –“Enterprise Data Hub” •Future role of RDBMS’s?? –For OLTP –For Data Warehouse •How much Transformationand where?
  • 36. TYPICAL“BESTPRACTICES” BI ARCHITECTUREINTEGRATEDBUSINESSPROCESSDIMENSIONALMODELSWITHMETADATALAYER(S) 36 Copyright 2014 Senturus, Inc. All Rights Reserved. ERP Data CRM Data Data Integration Conforming Business Process Dimensional Models Standard Reports Web Portal Other Sources Information Security Data Warehouse Data Abstraction Model Ad hoc Querying Planning Data Slicing & DicingDashboard Authoring Report Authoring Dashboards/ Scorecards Source Systems of Record Threshold Alerting Self-service Reporting & Analysis Single Version of the TruthThreshold-basedAlerts
  • 37. POTENTIALBI ARCHITECTUREUSINGHADOOPINTEGRATEDBUSINESSPROCESSDIMENSIONALMODELSWITHMETADATALAYER(S) 37 Copyright 2014 Senturus, Inc. All Rights Reserved. ERP Data CRM Data Data Integration Conforming Business Process Dimensional ModelsStandardReports Web Portal Other Sources Information Security Data Warehouse Data Abstraction Model Ad hoc Querying Planning Data Slicing & Dicing Dashboard Authoring Report Authoring Dashboards/ Scorecards Source Systems of Record Threshold Alerting Self-service Reporting& AnalysisSingle Version of the Truth Threshold-based Alerts HadoopData Staging
  • 38. IMPLICATIONS, PREDICTIONS& MUSINGS(CONT.) 38 Copyright 2014 Senturus, Inc. All Rights Reserved. •What have I got to learn? –MapReduce= No –Hand-coding = No –Scoop = Maybe –SQL = YES •Role of Existing Tools going forward –ETL –BI Front-ends •Role of DW Appliances? –HANA –IBM PureDataSystem (formerly Netezza), etc.
  • 39. IMPLICATIONS, PREDICTIONS& MUSINGS(CONT.) 39 Copyright 2014 Senturus, Inc. All Rights Reserved. •What is the impact on end-users seeking information? •We still need: –Data delivered in business user-friendly state –Rich, relevant and conformingdimensions –Ability to account for dimension changes over time –Good performance(transformation and aggregation) –Ability to integratewith existing systems
  • 40. JP’SCONCLUSION#140 Copyright 2014 Senturus, Inc. All Rights Reserved. Wow, this stuff is a BIG game changer
  • 41. JP’SCONCLUSION#2 41 Copyright 2014 Senturus, Inc. All Rights Reserved. It’s too early to call on the specifics
  • 42. JP’SCONCLUSION#3 42 Copyright 2014 Senturus, Inc. All Rights Reserved. DW Architectures & Technologies are in a huge state of fluxBut… DW Principlesstill apply
  • 43. Resources, Upcoming Events, Q&A NEEDMOREINFO?
  • 44. •Cloudera& Ralph Kimball –Best Practices for the HadoopData Warehouse: EDW 101 for HadoopProfessionals –http://www.cloudera.com/content/cloudera/en/resources/library/recordedwebinar/ best-practices-for-the-hadoop-data-warehouse-video.html –Building a HadoopData Warehouse: Hadoop101 for EDW Professionals –http://www.cloudera.com/content/cloudera/en/resources/library/recordedwebinar/ building-a-hadoop-data-warehouse-video.html •MapR& Jack Norris –How (and Why) Hadoopis Changing the Data Warehousing Paradigm –http://tdwi.org/articles/2013/08/13/hadoop-changing-dw-paradigm.aspx •HortonWorks –http://hortonworks.com/hadoop/ •Senturus.com –http://senturus.com/resources/ –[email protected] [email protected] ADDITIONALRESOURCES 44 Copyright 2014 Senturus, Inc. All Rights Reserved Contact us for help on a POC
  • 45. www.senturus.com UPCOMINGEVENTS 45 Copyright 2014 Senturus, Inc. All Rights Reserved
  • 46. More Information on www.senturus.com Copyright 2014 Senturus, Inc. All Rights Reserved 46
  • 47. questions hereCopyright 2014Senturus,Inc. AllRightsReserved Hear the Recording This slide deck is part of a recorded webinar. To view the FREE recording of the entire presentation and download the slide deck go to http://www.senturus.com/resources/is-hadoop-the- demise-of-data-warehousing/ Senturus’ comprehensive library of recorded webinars, demos, white papers, presentations and case studies is available on our website. www.senturus.com
  • 48. Thank You!! www.senturus.com 888-601-6010 [email protected] Copyright2014bySenturus,Inc. Thisentirepresentationiscopyrightedandmaynotbereusedor distributedwithoutthewrittenconsentofSenturus,Inc.