Self-Service Analytics – For Enterprise
Audience
• Sreejith Madhavan
– msreejith@yahoo.com
– https://www.linkedin.com/in/msreejith
Enterprise Analytics Portfolio – Lay of
The Land
Data Analytics – Basic Concepts
• Business Intelligence
o Using the available data to make factual business decisions
o “WHAT” is happening to your business right now?
• Business Analytics
o Steps that lead up to business decision
o Data Mining - process of looking for trends, patterns, or other useful
information within dataset
o Diagnostic analytics - “WHY” something is happening right now
o Predictive analytics - “WHAT Will” happen in future
o Prescriptive analytics - “WHAT Should be Done next”
Enterprise Analytics Landscape
• Enterprises typically have Users categorized broadly as -
o Business users – most interested in current metrics, fiscal trends, dashboards
o Engineering users – most interested in diagnostics (find needle-in-haystack),
deep-analytics
o An enterprise analytics solution stack should cover self-service needs to above
broad user-base
• Existing Data-stores Have Varying Use-cases
o Representing specialized data (application specific)
o Organizational units having independent solutions (IT, Engineering, Support etc..)
o Data architecture demands (BI tool backend, Datamarts, OLTP/OLAP etc)
• Enter Hadoop Datalake…
o Answering “Why” you need Hadoop Datalake in your Analytics landscape is critical
o What short, long term goals need to be met
o Not meant to be a one-stop-shop solution to replace existing Databases and
workflows
o Enterprise has several types of Users (by broad skill level) – A self service solution
stack should cater to broad User base by having mix-of several tools
Understanding Existing Data-Stores
Structured
data of Pre-
Computed
measures
Analytical
Cubes
Currently
SQL Server
Business
Analytics
system
Structured
data as Star
schema
with Dims
and Facts
Datamart
Currently
Oracle
Decision
Support
system/
Datamart
Structured,
Semi-
structured
data per
Event
granularity
Hive, M/R,
Datameer
Big Data
system
(Datalake)
Original
data
persisted in
its incoming
form
HDFS(M/R),
NFS
(Scripts),
REST
Raw Data
Highly granular and
complete dataset
Lower granularity and
subset of source data
Good for standard
Biz Metrics of
current and fiscal
trend
Good for interactive
Adhoc reporting
Good for diagnostic
mining and general
Adhoc reports at
scale
Useful to do ELT to
feed into other data
sources
Access
Interface/Tool
Data
Characteristic
Advanced Users (Data
Engineers/Scientists)
Enhance and persist
data-model, Develop
Deep insights
workflows
Frameworks, APIs
Map-reduce, Hive, Pig,
Spark, R, Programmatic
(JDBC..)
Technical Analysts
Generate Adhoc and
canned reports
SQL and
Transformation-
workflow based Tools
Oracle, SQL-Server,
Hive, R, Vertica,
Teradata, Datameer,
Tableau, PDI
Exec-users (Non-
Technical)
Consume predefined
metrics, Dashboards,
drag-n-drop what-if
analysis
Visual, Natural
language based tools
Tableau, OBIEE, PBA,
Excel, Microstrategy,
Search UI
End User Categories and Expectations
Usage
Characteristics
Interface
Characteristics
Sample Tools In each
Vertical
User and Use-case Requirement Considerations
• Demarcate target Users – Provision right Tool to right Users/Use-cases
– Not all users can should be given a Hadoop Datalake interface in self-service model
– Not one tool can fit all Use-cases
• Get to a Consolidated view of existing Data Sources to cover most
common domain objects to target “BI” based self-service model
• Data architecture - Data-layout and Data-model for the above
“Consolidated view”
– Star-schema vs Analytic Cube vs Flat OLTP schema
– MPP Analytic Database vs OLAP Cube vs DSS
– Traversing and Finding Metadata - Search interface to find entities, attributes and data
– Documentation covering data-model and data-dictionary
• Performance considerations
– High Performance and Concurrency support backend for interfacing BI Tools
– Scalable environment for batch, mining use-cases
– Interactive programmatic platform for data engineering
• Miscellaneous Operational Considerations (slide7)
Holistic View For Building E2E Analytics
Platform
Objectives For Holistic Analytics Platform
• Establish a self-service Analytics platform to cover BI and
Analytics use-cases for Internal users
• Support 3Vs of User types and Access patterns
o Volume of data
o Variety of Users (Programmatic and Non-technical)
o Variety of Queries (Adhoc, Not pre-defined)
o Velocity (Interactive query response, Dashboarding)
• Design Principles
o Embrace ideology of “one-tool doesn’t fit all use-cases and user preferences”
o Ease of Use (Front-end interface and Backend Data-model)
o Improved Performance to query response times
Datalake Analytics Platform – Conceptual View
MPP/Analytic
Database
PUAT Datamart Hive HDFS
BI Tool Front-End
Spark
Hue UI
(Hive, Search)
DataStore
Layer
Processing
Engine
Layer
Viz.and
Data
Access
Layer
• Focus on Data Processing & Integration frameworks
• Adhoc Data mining, complex data transformations, Machine learning
• 25-50 Concurrent users
• Focus on Visualization & Metrics (not Data Processing)
• Support Adhoc and Canned Self-service Reports
• 100+ Concurrent users
Extended
Datamodel
Cloudera Search
Spark CLI,
Hive Jdbc
(Programmatic
Access)
Datameer
(Non-
Programmatic)
Engineering focused Self-serve Reporting (Analysts &
Data engineers, Data scientists)
Business focused Self-serve Reporting (Analysts, Execs,
non-technical Audience)
Search
Front-End
Datalake Analytics Platform – Technology View
HDFS
(Orig Source)
Spark Data Prep
FW
M/R Daily HDFS
Transforms
HDFS
(Transformed)
Hive/Impala
Time based
SeqFile
Layout
System based
PARQUET
Layout
Adhoc Query
Hue UI/ Edge
Node CLI
Vertica MPP
Analytic DB
(12 month window)
On-demand
Parsed content
Datam
art
Structured
Config Feed
Cloudera Search
Indexing Prep FW
SSAS
Latest System
Snapshot raw
Latest Week Raw
& Structured
Data-
Prep/Transform
(SnapLogic/Data
meer)
Cloudera
Search Hue
UI
Tableau/Penta
ho BA
Spark
CLI/MLLib
Data-Prep/Filter
& Import
(SnapLogic)
DistributedR
Flattened
Star-schema
ZoomData
Raw
Data
Export
Published
Extended schema
Text search & Search AnalyticsSelf-serve BI
Reporting
Statistical Analytics Adhoc SQL Queries On-demand Data Transformations
Other
Sources…
Existing Components
Processing Workflows
New ComponentsOther
Legend
Evolving Other Operational Requirements
Agility and Productivity for End users
Monitoring and Governance
- Monitor & recover user, system jobs/service failures
- Analytics on Analytics – user and system behaviour
- Data quality, security etc
Ease of access to Data
- Abstracting data complexities, Provisioning prep’ed data to cover standard use-cases
- Query response times, Data mobility(transfer) issues
Understanding the Dataset
- Documentation, Catalog, Data Dictionary, Data Exploration
External References
• https://www.vertica.com/2014/04/18/facebook-and-vertica-a-case-for-mpp-databases/
• https://practicalanalytics.wordpress.com/2015/06/11/databianalytics-evolution-netflix/
• http://www.thebigdatainsightgroup.com/site/sites/default/files/Teradata's%20-
%20Big%20Data%20Architecture%20-%20Putting%20all%20your%20eggs%20in%20one%20basket.pdf
• http://www.slideshare.net/Dataconomy/hp-vertica-dataconomy
• http://www.bryanbrandow.com/2014/05/microstrategy-vs-tableau.html
• http://www.experfy.com/blog/pentaho-vs-tableau-comparison-visualization-dashboards/

More Related Content

PPT
Data Architecture for Data Governance
PDF
Activate Data Governance Using the Data Catalog
PPTX
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
PPT
Data Governance
PDF
Modernizing to a Cloud Data Architecture
PPTX
Power BI Overview, Deployment and Governance
PDF
Build Real-Time Applications with Databricks Streaming
PDF
Data Governance and Metadata Management
Data Architecture for Data Governance
Activate Data Governance Using the Data Catalog
Verizon: Finance Data Lake implementation as a Self Service Discovery Big Dat...
Data Governance
Modernizing to a Cloud Data Architecture
Power BI Overview, Deployment and Governance
Build Real-Time Applications with Databricks Streaming
Data Governance and Metadata Management

What's hot (20)

PPTX
Azure SQL Database Managed Instance
PDF
Owning Your Own (Data) Lake House
PPTX
Introduction to Data Engineering
PDF
PDF
Data Lake Architecture – Modern Strategies & Approaches
PPTX
Introducing Azure SQL Data Warehouse
PDF
Moving to Databricks & Delta
PPTX
Demystifying data engineering
PDF
Architect’s Open-Source Guide for a Data Mesh Architecture
PDF
The Business Value of Metadata for Data Governance
PDF
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
PPTX
DAMA International DMBOK V2 - Comparison with V1
PDF
RWDG Slides: Building a Data Governance Roadmap
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PDF
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
PPTX
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
PPTX
Intro to Data Vault 2.0 on Snowflake
PDF
Data Warehouse or Data Lake, Which Do I Choose?
PPTX
Data product thinking-Will the Data Mesh save us from analytics history
PPT
Data Lakehouse Symposium | Day 1 | Part 2
Azure SQL Database Managed Instance
Owning Your Own (Data) Lake House
Introduction to Data Engineering
Data Lake Architecture – Modern Strategies & Approaches
Introducing Azure SQL Data Warehouse
Moving to Databricks & Delta
Demystifying data engineering
Architect’s Open-Source Guide for a Data Mesh Architecture
The Business Value of Metadata for Data Governance
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
DAMA International DMBOK V2 - Comparison with V1
RWDG Slides: Building a Data Governance Roadmap
Building a Data Strategy – Practical Steps for Aligning with Business Goals
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
Intro to Data Vault 2.0 on Snowflake
Data Warehouse or Data Lake, Which Do I Choose?
Data product thinking-Will the Data Mesh save us from analytics history
Data Lakehouse Symposium | Day 1 | Part 2
Ad

Viewers also liked (13)

PDF
The Power of Self Service Reporting
DOC
Obiee metadata dictionary
PDF
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...
PPTX
Agile collaborative practices
PPTX
Trivial works.com introduction
PPT
Agile Development For Rte Systems
PDF
Collaborative and agile development of mobile applications
PPTX
The Business Benefits of a Data-Driven, Self-Service BI Organization
PDF
Realtime Reporting using Spark Streaming
PDF
The Complete Guide to Embedded Analytics
PPT
Agile presentation
PPTX
Tableau Server Basics
PPTX
Overview of Agile Methodology
The Power of Self Service Reporting
Obiee metadata dictionary
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...
Agile collaborative practices
Trivial works.com introduction
Agile Development For Rte Systems
Collaborative and agile development of mobile applications
The Business Benefits of a Data-Driven, Self-Service BI Organization
Realtime Reporting using Spark Streaming
The Complete Guide to Embedded Analytics
Agile presentation
Tableau Server Basics
Overview of Agile Methodology
Ad

Similar to Self Service Reporting & Analytics For an Enterprise (20)

PPTX
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
PDF
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
PPTX
Tableau and hadoop
PDF
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
PPTX
Big Data SE vs. SE for Big Data
PDF
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
PDF
18Mar14 Find the Hidden Signal in Market Data Noise Webinar
PDF
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
PDF
Teradata - Presentation at Hortonworks Booth - Strata 2014
PPTX
Big data unit 2
PPTX
AzureDay - Introduction Big Data Analytics.
PPTX
Skillwise Big Data part 2
PDF
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...
PDF
Architecting Agile Data Applications for Scale
PPTX
No sql and sql - open analytics summit
PPTX
Introduction To Big Data & Hadoop
PPTX
Skilwise Big data
PDF
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
PPT
Kushal Data Warehousing PPT
PDF
Hadoop meets Agile! - An Agile Big Data Model
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Tableau and hadoop
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data SE vs. SE for Big Data
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
18Mar14 Find the Hidden Signal in Market Data Noise Webinar
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
Teradata - Presentation at Hortonworks Booth - Strata 2014
Big data unit 2
AzureDay - Introduction Big Data Analytics.
Skillwise Big Data part 2
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...
Architecting Agile Data Applications for Scale
No sql and sql - open analytics summit
Introduction To Big Data & Hadoop
Skilwise Big data
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Kushal Data Warehousing PPT
Hadoop meets Agile! - An Agile Big Data Model

Recently uploaded (20)

PDF
Delhi c@ll girl# cute girls in delhi with travel girls in delhi call now
PPTX
Microsoft Fabric Modernization Pathways in Action: Strategic Insights for Dat...
PPTX
ISO 9001-2015 quality management system presentation
PPTX
Evaluasi program Bhs Inggris th 2023-2024 dan prog th 2024-2025-1.pptx
PDF
Memberlist of Indian Paintrs and coarutinfjnzbdscjhz
PPTX
Transport System for Biology students in the 11th grade
PPT
DWDM unit 1 for btech 3rd year students.ppt
PDF
TenneT-Integrated-Annual-Report-2018.pdf
PPT
genetics-16bbbbbbhhbbbjjjjjjjjffggg11-.ppt
PDF
PPT nikita containers of the company use
PPTX
Fkrjrkrkekekekeekkekswkjdjdjddwkejje.pptx
PPTX
Overview_of_Computing_Presentation.pptxxx
PPTX
Bussiness Plan S Group of college 2020-23 Final
PDF
Machine Learning Final Summary Cheat Sheet
PPTX
An Introduction to Lean Six Sigma for Bilginer
PDF
American Journal of Multidisciplinary Research and Review
PPTX
UNIT-1 NOTES Data warehousing and data mining.pptx
PPT
2011 HCRP presentation-final.pptjrirrififfi
PDF
Nucleic-Acids_-Structure-Typ...-1.pdf 011
PDF
Q1-wK1-Human-and-Cultural-Variation-sy-2024-2025-Copy-1.pdf
Delhi c@ll girl# cute girls in delhi with travel girls in delhi call now
Microsoft Fabric Modernization Pathways in Action: Strategic Insights for Dat...
ISO 9001-2015 quality management system presentation
Evaluasi program Bhs Inggris th 2023-2024 dan prog th 2024-2025-1.pptx
Memberlist of Indian Paintrs and coarutinfjnzbdscjhz
Transport System for Biology students in the 11th grade
DWDM unit 1 for btech 3rd year students.ppt
TenneT-Integrated-Annual-Report-2018.pdf
genetics-16bbbbbbhhbbbjjjjjjjjffggg11-.ppt
PPT nikita containers of the company use
Fkrjrkrkekekekeekkekswkjdjdjddwkejje.pptx
Overview_of_Computing_Presentation.pptxxx
Bussiness Plan S Group of college 2020-23 Final
Machine Learning Final Summary Cheat Sheet
An Introduction to Lean Six Sigma for Bilginer
American Journal of Multidisciplinary Research and Review
UNIT-1 NOTES Data warehousing and data mining.pptx
2011 HCRP presentation-final.pptjrirrififfi
Nucleic-Acids_-Structure-Typ...-1.pdf 011
Q1-wK1-Human-and-Cultural-Variation-sy-2024-2025-Copy-1.pdf

Self Service Reporting & Analytics For an Enterprise

  • 1. Self-Service Analytics – For Enterprise Audience • Sreejith Madhavan – [email protected] – https://www.linkedin.com/in/msreejith
  • 2. Enterprise Analytics Portfolio – Lay of The Land
  • 3. Data Analytics – Basic Concepts • Business Intelligence o Using the available data to make factual business decisions o “WHAT” is happening to your business right now? • Business Analytics o Steps that lead up to business decision o Data Mining - process of looking for trends, patterns, or other useful information within dataset o Diagnostic analytics - “WHY” something is happening right now o Predictive analytics - “WHAT Will” happen in future o Prescriptive analytics - “WHAT Should be Done next”
  • 4. Enterprise Analytics Landscape • Enterprises typically have Users categorized broadly as - o Business users – most interested in current metrics, fiscal trends, dashboards o Engineering users – most interested in diagnostics (find needle-in-haystack), deep-analytics o An enterprise analytics solution stack should cover self-service needs to above broad user-base • Existing Data-stores Have Varying Use-cases o Representing specialized data (application specific) o Organizational units having independent solutions (IT, Engineering, Support etc..) o Data architecture demands (BI tool backend, Datamarts, OLTP/OLAP etc) • Enter Hadoop Datalake… o Answering “Why” you need Hadoop Datalake in your Analytics landscape is critical o What short, long term goals need to be met o Not meant to be a one-stop-shop solution to replace existing Databases and workflows o Enterprise has several types of Users (by broad skill level) – A self service solution stack should cater to broad User base by having mix-of several tools
  • 5. Understanding Existing Data-Stores Structured data of Pre- Computed measures Analytical Cubes Currently SQL Server Business Analytics system Structured data as Star schema with Dims and Facts Datamart Currently Oracle Decision Support system/ Datamart Structured, Semi- structured data per Event granularity Hive, M/R, Datameer Big Data system (Datalake) Original data persisted in its incoming form HDFS(M/R), NFS (Scripts), REST Raw Data Highly granular and complete dataset Lower granularity and subset of source data Good for standard Biz Metrics of current and fiscal trend Good for interactive Adhoc reporting Good for diagnostic mining and general Adhoc reports at scale Useful to do ELT to feed into other data sources Access Interface/Tool Data Characteristic
  • 6. Advanced Users (Data Engineers/Scientists) Enhance and persist data-model, Develop Deep insights workflows Frameworks, APIs Map-reduce, Hive, Pig, Spark, R, Programmatic (JDBC..) Technical Analysts Generate Adhoc and canned reports SQL and Transformation- workflow based Tools Oracle, SQL-Server, Hive, R, Vertica, Teradata, Datameer, Tableau, PDI Exec-users (Non- Technical) Consume predefined metrics, Dashboards, drag-n-drop what-if analysis Visual, Natural language based tools Tableau, OBIEE, PBA, Excel, Microstrategy, Search UI End User Categories and Expectations Usage Characteristics Interface Characteristics Sample Tools In each Vertical
  • 7. User and Use-case Requirement Considerations • Demarcate target Users – Provision right Tool to right Users/Use-cases – Not all users can should be given a Hadoop Datalake interface in self-service model – Not one tool can fit all Use-cases • Get to a Consolidated view of existing Data Sources to cover most common domain objects to target “BI” based self-service model • Data architecture - Data-layout and Data-model for the above “Consolidated view” – Star-schema vs Analytic Cube vs Flat OLTP schema – MPP Analytic Database vs OLAP Cube vs DSS – Traversing and Finding Metadata - Search interface to find entities, attributes and data – Documentation covering data-model and data-dictionary • Performance considerations – High Performance and Concurrency support backend for interfacing BI Tools – Scalable environment for batch, mining use-cases – Interactive programmatic platform for data engineering • Miscellaneous Operational Considerations (slide7)
  • 8. Holistic View For Building E2E Analytics Platform
  • 9. Objectives For Holistic Analytics Platform • Establish a self-service Analytics platform to cover BI and Analytics use-cases for Internal users • Support 3Vs of User types and Access patterns o Volume of data o Variety of Users (Programmatic and Non-technical) o Variety of Queries (Adhoc, Not pre-defined) o Velocity (Interactive query response, Dashboarding) • Design Principles o Embrace ideology of “one-tool doesn’t fit all use-cases and user preferences” o Ease of Use (Front-end interface and Backend Data-model) o Improved Performance to query response times
  • 10. Datalake Analytics Platform – Conceptual View MPP/Analytic Database PUAT Datamart Hive HDFS BI Tool Front-End Spark Hue UI (Hive, Search) DataStore Layer Processing Engine Layer Viz.and Data Access Layer • Focus on Data Processing & Integration frameworks • Adhoc Data mining, complex data transformations, Machine learning • 25-50 Concurrent users • Focus on Visualization & Metrics (not Data Processing) • Support Adhoc and Canned Self-service Reports • 100+ Concurrent users Extended Datamodel Cloudera Search Spark CLI, Hive Jdbc (Programmatic Access) Datameer (Non- Programmatic) Engineering focused Self-serve Reporting (Analysts & Data engineers, Data scientists) Business focused Self-serve Reporting (Analysts, Execs, non-technical Audience) Search Front-End
  • 11. Datalake Analytics Platform – Technology View HDFS (Orig Source) Spark Data Prep FW M/R Daily HDFS Transforms HDFS (Transformed) Hive/Impala Time based SeqFile Layout System based PARQUET Layout Adhoc Query Hue UI/ Edge Node CLI Vertica MPP Analytic DB (12 month window) On-demand Parsed content Datam art Structured Config Feed Cloudera Search Indexing Prep FW SSAS Latest System Snapshot raw Latest Week Raw & Structured Data- Prep/Transform (SnapLogic/Data meer) Cloudera Search Hue UI Tableau/Penta ho BA Spark CLI/MLLib Data-Prep/Filter & Import (SnapLogic) DistributedR Flattened Star-schema ZoomData Raw Data Export Published Extended schema Text search & Search AnalyticsSelf-serve BI Reporting Statistical Analytics Adhoc SQL Queries On-demand Data Transformations Other Sources… Existing Components Processing Workflows New ComponentsOther Legend
  • 12. Evolving Other Operational Requirements Agility and Productivity for End users Monitoring and Governance - Monitor & recover user, system jobs/service failures - Analytics on Analytics – user and system behaviour - Data quality, security etc Ease of access to Data - Abstracting data complexities, Provisioning prep’ed data to cover standard use-cases - Query response times, Data mobility(transfer) issues Understanding the Dataset - Documentation, Catalog, Data Dictionary, Data Exploration
  • 13. External References • https://www.vertica.com/2014/04/18/facebook-and-vertica-a-case-for-mpp-databases/ • https://practicalanalytics.wordpress.com/2015/06/11/databianalytics-evolution-netflix/ • http://www.thebigdatainsightgroup.com/site/sites/default/files/Teradata's%20- %20Big%20Data%20Architecture%20-%20Putting%20all%20your%20eggs%20in%20one%20basket.pdf • http://www.slideshare.net/Dataconomy/hp-vertica-dataconomy • http://www.bryanbrandow.com/2014/05/microstrategy-vs-tableau.html • http://www.experfy.com/blog/pentaho-vs-tableau-comparison-visualization-dashboards/

Editor's Notes

  • #5: Business users (typically from Sales, Product management, Other execs) Engineering users (Developers, QA, Technical support engineers, Analysts, Data scientists)
  • #10: User Types: - Semi/non- technical users – easy to use drag-n-drop interface - advanced users - Programmatic and SQL based interfaces Improved Performance considerations - High Performance and Concurrent platform for user interactions via BI Tools - Scalable environment for batch, mining use-cases - nteractive programmatic platform for data engineering
  • #11: Business users workflows: - Self-service - Answer “What” questions - Analytic Database – consolidate data model supporting quick Vizn, Performance and lower learning curve Engineering users workflows: - Self-service – Answer “Why” and “What next” questions
  • #12: CLI – Command-line Interface MLLib – Machine learning Lib Data Prep FW – Data Preparation framework MPP – Massive Parallel Processing BI – Business Intelligence