SlideShare a Scribd company logo
Be Heroic
Analytics. For Anyone.
Turn Big Data
into Action
2
My journey….
Early Life
College
Early Career
Masters
Mid Career
3
About me now….
Michele Chambers
President/COO
@mcAnalytics
mchambers@rapidminer.com
4
Analytics on big data is no longer just a
competitive advantage.
It’s a Business
Requirement.
Progressive businesses must accelerate time-to-value not only to thrive, but survive.
5
Unlike traditional analytics providers,
RapidMiner enables anyone to make the
most of all data in all environments,
creating a powerful advantage from the
wisdom of over 250,000 users.
RapidMiner is the industry's easiest-to-use
Modern Analytics Platform that
significantly accelerates productivity – from
data blending to predictive action.
Built by data scientists for data scientists, businesses analysts, and developers.
6
TTraditional
MModern
Advanced Analytics Market Maturity
Lagging innovation
High-velocity innovation
7
TTraditional
Evolving Roles for Advanced Analytics
MModern
Status Quo
• Statisticians
• Quants
• Actuarials
Next Generation
• Data Scientists
• Business Analysts
8
TTraditional
Evolving Advanced Analytics Market
MModern
Limitations
• Limited handling of variety of data source
• Legacy compute engines
• On-premises, if not offline
Limitless
• Big Data
• New compute engines
• Cloud
9
Traditional vs. Modern Analytics Market
Magic Quadrant for
Advanced Analytics Platforms
February 2015
Challengers Leaders
Niche players Visionaries
Completeness of vision
Abilitytoexecute
Tibco Software
Prognoz
Salford Systems
Revolution Analytics
Predixion
Angoss
FICO
SAP
Dell
Microsoft
KNIME
IBM
SAS
RapidMiner
Alteryx
Alpine Data Labs
10
Enter RapidMiner. Analytics. For Anyone.
Accelerate
Pre-Built Templates
One-Click Deployments
Connect
All Data
All Environments
Simplify
Code-Free
Wisdom of Crowds
11
Wisdom of Crowds
How do we create data science heroes?
Store them in a
knowledge base
of analytic best practices
Anonymously collect
analytic processes
from analysts across the
enterprise
Use machine
learning algorithms
to recommend and
empower any user at any
skill level to become a
data science hero
1
2
3
12
Self Service Modern Analytics Platform
RapidMiner Studio
Code-free design your analytics
using 1500+ operators
RapidMiner Radoop
Push down computations to
where your data lives
RapidMiner Streams
Analyze streaming data while in
motion
RapidMiner Cloud
Elastic compute environment
for high performance analytics
RapidMiner Server
Enterprise analytics
environment for integration
with business processes
Orchestrate
Design
Compute
Business Analysts Data Scientists
Consume
Machine
Business Users
Web
App
Custom
App
Biz
App
VizBI
Studio
Code-Free GUI Engine
Engine
In-Memory In-DatabaseIn-Hadoop
Engine
Studio
Engine Engine
Streams
Engine
Radoop
Engine
Cloud
Engine
Server
Web Services API
In-Stream
Engine
13
IT
Use statistical tool to
•create ad hoc predictive
processes
Developer Data Scientists
Use programming
languages and libraries to:
•build completely new
algorithms
•create highly customized
advanced analytic processes
Applied Data Scientists
Use advanced analytic
platforms to:
•ingest and prepare data for
analysis
•identify patterns in data
•build and deploy novel
predictive apps
Business Analysts
Use advanced analytic
platforms to:
•ingest and prepare data
for analysis
•identify patterns in data
•build and deploy standard
predictive apps
Business Consumers
Use embedded predictive
results in frontline
applications
Maximize Analytic Skills Through Collaboration
Why RapidMiner:
Fast Production Deployments
Why RapidMiner:
Share Code Across Teams
Why RapidMiner:
Efficiency & Collaboration
Why RapidMiner:
Design Predictive Analytics
Why RapidMiner:
Actions in Front Line Apps
RapidMiner Studio
RapidMiner Radoop
RapidMiner Stream
RapidMiner Cloud
RapidMiner Server
14
RapidMiner Radoop Architecture
Hadoop environment
Impala
(In-memory SQL)
Mahout
(Machine
Learning)
Pig
(Scripting)
HDFS
YARNMapReduce
Hive
(SQL)
PROGRAMMING
CODE
VISUAL
DEVELOPMENT
Radoop
Studio Server
Spark
(MLib)
Ingestion Modeling DeploymentBlending
Code-free design
in RapidMiner
with 70+ Operators
Optimized distributed
execution in Hadoop
environment
One-click push down to
Hadoop environment
15
VISUAL
DEVELOPMENT
Streams
Studio Server
Apache Storm clusterMessage
broker
Apache
Kafka
Amazon
SQS
or
Application
Cassandra MongoDB
Apache
Kafka
Application
push pull
pull
store
deploy process
as topology
monitor and
manage
Redis
RapidMiner Streams Architecture Code-free design
in RapidMiner leveraging
1500+ Operators
Distributed execution in
Storm environment
One-click push down to
Storm environment
Node
Engine
Node
Node
Storm Topology
Node
Storm Topology
Node
Engine
Streams
Spout Bolt Bolt Bolt
Storm Topology
Bolt
Bolt
Ingestion Modeling DeploymentBlending
16
www.rapidminer.com
Activating the data science hero
in every business analyst!
Michele Chambers
@mcAnalytics
mchambers@rapidminer.com

More Related Content

PPTX
Rapid Miner
SrushtiSuvarna
 
PPTX
Rapid miner
Manish Champaneri
 
PPTX
RapidMiner: Introduction To Rapid Miner
DataminingTools Inc
 
PPTX
RAPIDMINER: Rapidminer products
DataminingTools Inc
 
PPTX
Data mining tools (R , WEKA, RAPID MINER, ORANGE)
Krishna Petrochemicals
 
PDF
Slides PAPIs.io'14 RapidMiner
Sabrina Kirstein
 
PPTX
Managing the Dewey Decimal System
DataWorks Summit
 
PPTX
StreamSet ETL tool
SwapnilSHampi
 
Rapid Miner
SrushtiSuvarna
 
Rapid miner
Manish Champaneri
 
RapidMiner: Introduction To Rapid Miner
DataminingTools Inc
 
RAPIDMINER: Rapidminer products
DataminingTools Inc
 
Data mining tools (R , WEKA, RAPID MINER, ORANGE)
Krishna Petrochemicals
 
Slides PAPIs.io'14 RapidMiner
Sabrina Kirstein
 
Managing the Dewey Decimal System
DataWorks Summit
 
StreamSet ETL tool
SwapnilSHampi
 

What's hot (20)

PDF
CI/CD Templates: Continuous Delivery of ML-Enabled Data Pipelines on Databricks
Databricks
 
PDF
Search for All with Elastic Enterprise Search
Elasticsearch
 
PPTX
The Life of an Internet of Things Electron
DataWorks Summit/Hadoop Summit
 
PDF
Elastic @ John Deere
Elasticsearch
 
PDF
IBM and Lightbend Build Integrated Platform for Cognitive Development
Lightbend
 
PDF
American Ancestors Use Case - Scalability & Support Using the Elasticsearch S...
Elasticsearch
 
PDF
Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...
Spark Summit
 
PDF
The New Basics of Business Intelligence Lesson 3: Multi Source Analysis
Zoomdata
 
PDF
Keynote
Elasticsearch
 
PDF
Search for all with Elastic Enterprise Search
Elasticsearch
 
PPTX
Spark Summit Keynote by Seshu Adunuthula
Spark Summit
 
PDF
Misusing MLflow To Help Deduplicate Data At Scale
Databricks
 
PDF
Elastic@Colruyt: Ensuring business continuity and improving efficiency
Elasticsearch
 
PDF
Improving Veteran benefit services through efficient data streaming | Robert ...
HostedbyConfluent
 
PPTX
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Spark Summit
 
PPTX
Securing and governing a multi-tenant data lake within the financial industry
DataWorks Summit
 
PPTX
Pyramid vs QlikView
Pyramid Analytics
 
PPTX
Enhance your multi-cloud application performance using Redis Enterprise P2
Ashnikbiz
 
PDF
Achieving cyber mission assurance with near real-time impact
Elasticsearch
 
PPTX
Intuit Analytics Cloud 101
DataWorks Summit/Hadoop Summit
 
CI/CD Templates: Continuous Delivery of ML-Enabled Data Pipelines on Databricks
Databricks
 
Search for All with Elastic Enterprise Search
Elasticsearch
 
The Life of an Internet of Things Electron
DataWorks Summit/Hadoop Summit
 
Elastic @ John Deere
Elasticsearch
 
IBM and Lightbend Build Integrated Platform for Cognitive Development
Lightbend
 
American Ancestors Use Case - Scalability & Support Using the Elasticsearch S...
Elasticsearch
 
Unlocking Value in Device Data Using Spark: Spark Summit East talk by John La...
Spark Summit
 
The New Basics of Business Intelligence Lesson 3: Multi Source Analysis
Zoomdata
 
Keynote
Elasticsearch
 
Search for all with Elastic Enterprise Search
Elasticsearch
 
Spark Summit Keynote by Seshu Adunuthula
Spark Summit
 
Misusing MLflow To Help Deduplicate Data At Scale
Databricks
 
Elastic@Colruyt: Ensuring business continuity and improving efficiency
Elasticsearch
 
Improving Veteran benefit services through efficient data streaming | Robert ...
HostedbyConfluent
 
Virtualizing Analytics with Apache Spark: Keynote by Arsalan Tavakoli
Spark Summit
 
Securing and governing a multi-tenant data lake within the financial industry
DataWorks Summit
 
Pyramid vs QlikView
Pyramid Analytics
 
Enhance your multi-cloud application performance using Redis Enterprise P2
Ashnikbiz
 
Achieving cyber mission assurance with near real-time impact
Elasticsearch
 
Intuit Analytics Cloud 101
DataWorks Summit/Hadoop Summit
 
Ad

Viewers also liked (14)

PPTX
radoop - nlp matiné 2014
Zoltan Varju
 
PPTX
Hadoop World 2011: Radoop: a Graphical Analytics Tool for Big Data - Gabor Ma...
Cloudera, Inc.
 
PPTX
RapidMiner: Introduction To Rapid Miner
Rapidmining Content
 
PPT
Data mining tools
suganmca14
 
PDF
Présentation on radoop
siliconsudipt
 
PPTX
Data mining tools overall
Mohamed Sharique Vellikan
 
PPTX
Rapidminer: Visualization Capabilities
Rapidmining Content
 
PDF
RapidMiner, an entrance to explore MIMIC-III?
Sven Van Poucke, MD, PhD
 
PPTX
RapidMiner: Data Mining And Rapid Miner
Rapidmining Content
 
PPTX
Data Mining: Implementation of Data Mining Techniques using RapidMiner software
Mohammed Kharma
 
PPTX
Rapidminer
Gernot Schulmeister
 
PDF
Introduction to RapidMiner Studio V7
geraldinegray
 
PPTX
RapidMiner: Important Elements
DataminingTools Inc
 
PPTX
Terminology Machine Learning
DataminingTools Inc
 
radoop - nlp matiné 2014
Zoltan Varju
 
Hadoop World 2011: Radoop: a Graphical Analytics Tool for Big Data - Gabor Ma...
Cloudera, Inc.
 
RapidMiner: Introduction To Rapid Miner
Rapidmining Content
 
Data mining tools
suganmca14
 
Présentation on radoop
siliconsudipt
 
Data mining tools overall
Mohamed Sharique Vellikan
 
Rapidminer: Visualization Capabilities
Rapidmining Content
 
RapidMiner, an entrance to explore MIMIC-III?
Sven Van Poucke, MD, PhD
 
RapidMiner: Data Mining And Rapid Miner
Rapidmining Content
 
Data Mining: Implementation of Data Mining Techniques using RapidMiner software
Mohammed Kharma
 
Introduction to RapidMiner Studio V7
geraldinegray
 
RapidMiner: Important Elements
DataminingTools Inc
 
Terminology Machine Learning
DataminingTools Inc
 
Ad

Similar to M Chambers and RapidMiner Overview for Babson class (20)

PDF
RapidMiner - From Data Mining To Decision Making In One Platform.pdf
DataSpace Academy
 
PDF
Taming the Beast: Extracting Value from Hadoop
Enterprise Management Associates
 
PPTX
Big Data Mining Keynote presentation Sept 2013 09012013
Julio Da Silva
 
PPTX
In-Hadoop, In-Database and In-Memory Processing for Predictive Analytics
DataWorks Summit
 
PPTX
Big data unit 2
RojaT4
 
PDF
Mighty Guides- Data Disruption
Mighty Guides, Inc.
 
PDF
Big data and you
IBM
 
PDF
Big data in marketing at harvard business club nick1 june 15 2013
nkabra
 
PDF
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Ali Alkan
 
PDF
What are Big Data, Data Science, and Data Analytics
Ray Business Technologies
 
PDF
Revolution in Business Analytics-Zika Virus Example
Bardess Group
 
PDF
SIMPosium presentation_Bardess Qlik
Bardess Group
 
PPTX
000 introduction to big data analytics 2021
Dendej Sawarnkatat
 
PPTX
In-Depth Data Analytics
YASH GAIKWAD
 
PPTX
Why Everything You Know About bigdata Is A Lie
Sunil Ranka
 
PDF
The Data Lake: Empowering Your Data Science Team
Senturus
 
PPTX
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Kevin Pledge
 
PDF
AP-Summary-Aug-09-2022_capabilities .pdf
kcdelllaptop
 
PDF
S ba0881 big-data-use-cases-pearson-edge2015-v7
Tony Pearson
 
PDF
TOUG Big Data Challenge and Impact
Toronto-Oracle-Users-Group
 
RapidMiner - From Data Mining To Decision Making In One Platform.pdf
DataSpace Academy
 
Taming the Beast: Extracting Value from Hadoop
Enterprise Management Associates
 
Big Data Mining Keynote presentation Sept 2013 09012013
Julio Da Silva
 
In-Hadoop, In-Database and In-Memory Processing for Predictive Analytics
DataWorks Summit
 
Big data unit 2
RojaT4
 
Mighty Guides- Data Disruption
Mighty Guides, Inc.
 
Big data and you
IBM
 
Big data in marketing at harvard business club nick1 june 15 2013
nkabra
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Ali Alkan
 
What are Big Data, Data Science, and Data Analytics
Ray Business Technologies
 
Revolution in Business Analytics-Zika Virus Example
Bardess Group
 
SIMPosium presentation_Bardess Qlik
Bardess Group
 
000 introduction to big data analytics 2021
Dendej Sawarnkatat
 
In-Depth Data Analytics
YASH GAIKWAD
 
Why Everything You Know About bigdata Is A Lie
Sunil Ranka
 
The Data Lake: Empowering Your Data Science Team
Senturus
 
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Kevin Pledge
 
AP-Summary-Aug-09-2022_capabilities .pdf
kcdelllaptop
 
S ba0881 big-data-use-cases-pearson-edge2015-v7
Tony Pearson
 
TOUG Big Data Challenge and Impact
Toronto-Oracle-Users-Group
 

Recently uploaded (20)

PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Software Development Methodologies in 2025
KodekX
 
The Future of Artificial Intelligence (AI)
Mukul
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Software Development Methodologies in 2025
KodekX
 

M Chambers and RapidMiner Overview for Babson class

  • 1. Be Heroic Analytics. For Anyone. Turn Big Data into Action
  • 2. 2 My journey…. Early Life College Early Career Masters Mid Career
  • 3. 3 About me now…. Michele Chambers President/COO @mcAnalytics [email protected]
  • 4. 4 Analytics on big data is no longer just a competitive advantage. It’s a Business Requirement. Progressive businesses must accelerate time-to-value not only to thrive, but survive.
  • 5. 5 Unlike traditional analytics providers, RapidMiner enables anyone to make the most of all data in all environments, creating a powerful advantage from the wisdom of over 250,000 users. RapidMiner is the industry's easiest-to-use Modern Analytics Platform that significantly accelerates productivity – from data blending to predictive action. Built by data scientists for data scientists, businesses analysts, and developers.
  • 6. 6 TTraditional MModern Advanced Analytics Market Maturity Lagging innovation High-velocity innovation
  • 7. 7 TTraditional Evolving Roles for Advanced Analytics MModern Status Quo • Statisticians • Quants • Actuarials Next Generation • Data Scientists • Business Analysts
  • 8. 8 TTraditional Evolving Advanced Analytics Market MModern Limitations • Limited handling of variety of data source • Legacy compute engines • On-premises, if not offline Limitless • Big Data • New compute engines • Cloud
  • 9. 9 Traditional vs. Modern Analytics Market Magic Quadrant for Advanced Analytics Platforms February 2015 Challengers Leaders Niche players Visionaries Completeness of vision Abilitytoexecute Tibco Software Prognoz Salford Systems Revolution Analytics Predixion Angoss FICO SAP Dell Microsoft KNIME IBM SAS RapidMiner Alteryx Alpine Data Labs
  • 10. 10 Enter RapidMiner. Analytics. For Anyone. Accelerate Pre-Built Templates One-Click Deployments Connect All Data All Environments Simplify Code-Free Wisdom of Crowds
  • 11. 11 Wisdom of Crowds How do we create data science heroes? Store them in a knowledge base of analytic best practices Anonymously collect analytic processes from analysts across the enterprise Use machine learning algorithms to recommend and empower any user at any skill level to become a data science hero 1 2 3
  • 12. 12 Self Service Modern Analytics Platform RapidMiner Studio Code-free design your analytics using 1500+ operators RapidMiner Radoop Push down computations to where your data lives RapidMiner Streams Analyze streaming data while in motion RapidMiner Cloud Elastic compute environment for high performance analytics RapidMiner Server Enterprise analytics environment for integration with business processes Orchestrate Design Compute Business Analysts Data Scientists Consume Machine Business Users Web App Custom App Biz App VizBI Studio Code-Free GUI Engine Engine In-Memory In-DatabaseIn-Hadoop Engine Studio Engine Engine Streams Engine Radoop Engine Cloud Engine Server Web Services API In-Stream Engine
  • 13. 13 IT Use statistical tool to •create ad hoc predictive processes Developer Data Scientists Use programming languages and libraries to: •build completely new algorithms •create highly customized advanced analytic processes Applied Data Scientists Use advanced analytic platforms to: •ingest and prepare data for analysis •identify patterns in data •build and deploy novel predictive apps Business Analysts Use advanced analytic platforms to: •ingest and prepare data for analysis •identify patterns in data •build and deploy standard predictive apps Business Consumers Use embedded predictive results in frontline applications Maximize Analytic Skills Through Collaboration Why RapidMiner: Fast Production Deployments Why RapidMiner: Share Code Across Teams Why RapidMiner: Efficiency & Collaboration Why RapidMiner: Design Predictive Analytics Why RapidMiner: Actions in Front Line Apps RapidMiner Studio RapidMiner Radoop RapidMiner Stream RapidMiner Cloud RapidMiner Server
  • 14. 14 RapidMiner Radoop Architecture Hadoop environment Impala (In-memory SQL) Mahout (Machine Learning) Pig (Scripting) HDFS YARNMapReduce Hive (SQL) PROGRAMMING CODE VISUAL DEVELOPMENT Radoop Studio Server Spark (MLib) Ingestion Modeling DeploymentBlending Code-free design in RapidMiner with 70+ Operators Optimized distributed execution in Hadoop environment One-click push down to Hadoop environment
  • 15. 15 VISUAL DEVELOPMENT Streams Studio Server Apache Storm clusterMessage broker Apache Kafka Amazon SQS or Application Cassandra MongoDB Apache Kafka Application push pull pull store deploy process as topology monitor and manage Redis RapidMiner Streams Architecture Code-free design in RapidMiner leveraging 1500+ Operators Distributed execution in Storm environment One-click push down to Storm environment Node Engine Node Node Storm Topology Node Storm Topology Node Engine Streams Spout Bolt Bolt Bolt Storm Topology Bolt Bolt Ingestion Modeling DeploymentBlending
  • 16. 16 www.rapidminer.com Activating the data science hero in every business analyst! Michele Chambers @mcAnalytics [email protected]

Editor's Notes

  • #7: Secondary SkyTree Knime Pentaho Alpine Data Miner Datameer Google API RM 5.3 ? Amazon KXEN / SAP
  • #8: Secondary SkyTree Knime Pentaho Alpine Data Miner Datameer Google API RM 5.3 ? Amazon KXEN / SAP
  • #9: Secondary SkyTree Knime Pentaho Alpine Data Miner Datameer Google API RM 5.3 ? Amazon KXEN / SAP