SlideShare a Scribd company logo
TIBCO Advanced Analytics
Houston Energy Data Science
Meetup
Michael O’Connell
Chief Data Scientist
moconnell@tibco.com
@moc_tib
August 2015
• Data Science Process
• Data Analysis Pipeline
• Understand – Anticipate – Act
• Advanced Analytics
• TIBCO’s R engine
• GeoLocation Analytics
• Real-Time Analytics
• Remote Monitoring – the Digital Nervous System
• Software & APIs
• Wrap-Up / Questions
Increase
Productivity
Grow
Revenue
Value
Reduce
Risk
ROI
TIBCO Analytics – Insight to Action
© Copyright 2000-2015 TIBCO Software Inc.
“Data Science”
Engineer/Marketeer
“Address the
business issue”
Statistician
“Build the
best model”
IT / Developer
“Manage my
infrastructure”
Engineer/Marketeer:
Knows the business problem but
doesn’t know how to prepare data
or build models.
Statistician:
Knows how to develop appropriate
models to address business
problems but is in short supply and
can’t deploy IT or business
systems
IT / Developer:
Knows databases, application
provisioning and development tools
but isn’t familiar with data meaning
or analytical workflow purpose
What is a Data Scientist
© Copyright 2000-2015 TIBCO Software Inc.
Data Access
& Prep
Exploratory
Data Analysis
Features
Visual
Dashboard
Model &
Predict
Deploy
Champion
Model
Test &
Learn
Channel
Social
Loyalty
Campaign
Filter
Map
Merge
Shape
Propensity
Affinity
ImproveGuided -------- Deploy -------- In-LineExplore Data
Aggregate
Prepare DataBusiness Case
Increase
Productivity
Grow
Revenue
Ensemble
Forest
Regression
Additive
Models
Segment
Visualize
Pricing
Promotion
Challenger
Models
At Rest
In Motion
Value
Theses
Reduce
Risk
ROI
Value
Dashboard
Updates
Data a Insight a Action
© Copyright 2000-2015 TIBCO Software Inc.
Spotfire
Desktop
TIBCO Analytics Stack
Custom GUI-driven
data access via SDK
Enterprise Data Access
Siebel
eBusiness
Local data sources
AccessExcel STDF
Drag-and-drop
MySQL
SQL Server
Oracle
Information Services
(join, transform, reusable,
parameterized, dynamic query
for in-memory use)
Databases
JDBC/ODBC
Hadoop
SFDC
PostgreSQL
Teradata
Netezza
Etc.XML
RDBMS
Flat
Files
Spread-
sheets
Web
Services
Oracle
E-Business
RDBMS
RDBMS
RDBMS
SAP BWSAP R/3 D
A
T
A
F
A
B
R
I
C
Salesforce
ODBC
OLE DB
SqlClient
Direct
connection
Oracle
TeradataAsterMS SSAS
Teradata
Direct Query
(dynamically query and retrieve data
for visualization and analysis)
Databases
MySQL
Etc.
OBIEE
Netezza
Hadoop
© Copyright 2000-2015 TIBCO Software Inc.
Immediate
Long-Term
Competitive AdvantageValue to the Organization
TIBCO is the only analytics platform that provides business
value across the Analytics Spectrum
Self-service
Dashboards
Event Processing
Predictive and
Prescriptive Analytics
Measure Diagnose Predict Optimize Operationalize Automate
Analytics Maturity
Analytics Spectrum
Immediate
Long-Term
Competitive AdvantageValue to the Organization
TIBCO is the only analytics platform that provides business
value across the Analytics Spectrum
Self-service
Dashboards
Measure Diagnose Predict Optimize Operationalize Automate
Analytics Maturity
Analytics Spectrum
Predictive and
Prescriptive Analytics
Event Processing
Immediate
Long-Term
Competitive AdvantageValue to the Organization
TIBCO is the only analytics platform that provides business
value across the Analytics Spectrum
Self-service
Dashboards
Predictive and
Prescriptive Analytics
Measure Diagnose Predict Optimize Operationalize Automate
Analytics Maturity
Analytics Spectrum
Event Processing
© Copyright 2000-2015 TIBCO Software Inc. 10
Visual Analytics – Spotfire
Visual Analytics – Spotfire
3D rotate SurfacePolar
Contour Network Funnel
Spotfire Extensions – d3 and JS
© Copyright 2000-2015 TIBCO Software Inc.
Sankey
Venn
ChordDonut
Dials
Gantt
Visual Analytics – Dashboards
Visual Analytics – Dashboards
Visual Analytics – Dashboards
Visual Analytics – Dashboards
Visual Analytics – Dashboards
Dashboards and Themes
Dashboards and Themes
Dashboards and Themes
Jaspersoft Pixel-Perfect Embedded Reports
© Copyright 2000-2015 TIBCO Software Inc.
Analytic Workspaces & Analytic Fabric
APIs
Search,Sharingetc.
Business Analysts Report Developers
Analytic
Workspaces
Analytic
Fabric
Data Discovery Analytics Dashboards Reports
© Copyright 2000-2015 TIBCO Software Inc.
Spotfire is Super Simple to Use
US Homeless Analysis
Step-by-Step
YouTube Playlist
• Dashboards
• Predictive
• GeoLocation
© Copyright 2000-2015 TIBCO Software Inc.
Immediate
Long-Term
Competitive AdvantageValue to the Organization
TIBCO is the only analytics platform that provides business
value across the Analytics Spectrum
Self-service
Dashboards
Measure Diagnose Predict Optimize Operationalize Automate
Analytics Maturity
Analytics Spectrum
Predictive and
Prescriptive Analytics
Event Processing
Advanced Analytics Ecosystem
© Copyright 2000-2015 TIBCO Software Inc.
TIBCO Enterprise Runtime for R (TERR)
© Copyright 2000-2015 TIBCO Software Inc.
• TIBCO has rewritten R as a Commercial Compute Engine
• Latest statistics scripting engine: S a S-PLUS® a R a TERR
• Runs R code including CRAN packages
• Engine internals rebuilt from scratch at low-level
• Redesigned data objects, memory management
• High performance + Big Data
• TERR is licensed from TIBCO
• TERR Installs (free) with Spotfire Analyst / Desktop and other TIBCO products (CEP, Stats)
• Spotfire Server can manage all TERR / R scripts, artifacts for reuse
• Standalone Developer Edition: www.TIBCOmmunity.com
• Supported by TIBCO
Model Fitting: 5 Million Rows Model Scoring: 20 Million Rows
TERR 7X faster 84X
TERR Performance
© Copyright 2000-2015 TIBCO Software Inc.
Spotfire and TERR local TERR on server
Spotfire-TERR – Local and Server
• Build models on data using local
TERR engine embedded in
Spotfire
• Build models on big data directly in TERR on
server and display results in Spotfire
• Run TERR as parallel sessions on Hadoop cluster,
controlled and visualized in Spotfire
Data Source TERR
TSSS
Spotfire
Results
ODBC
JDBC
SDC
File
Data
Function
Larger Data
Modeling
Spotfire
Local
TERR
ODBC
JDBC
SDC
File
Data
Data Source
Both Spotfire and TERR can load data from any ODBC or JDBC compliant source or from
Spotfire Data Connections (SDC) or Spotfire Information Links stored in the Spotfire library.
© Copyright 2000-2015 TIBCO Software Inc.
© Copyright 2000-2015 TIBCO Software Inc.
Simple Predictive Analytics – Forecasting & Modeling
Contextual Analytics
- Forecasting
Contextual Analytics
- Machine Learning
Extensible Predictive Analytics – Analysis Workflows
Interactive Spotfire Analytics with R
- Data Function
- Robust Cluster Analysis
- Any Analysis in R / CRAN
Variables driving segments
- Random Forest
Revenue by product
- Color by segment
Free Scripts - GeoCluster [kmeans(x,y)]
Free Scripts - Contours [contourLines(x,y,z)]
Spotfire-TERR : Data Types, Analyses
Spotfire data functions support any
type of data as input and output
parameters to and from TERR.
TERR data functions used for data
prep, integration, predictive &
prescriptive analytics, …
TERR data functions can output
content metadata to Spotfire
• formatting of fields
• handling of binary data including
images and geospatial objects.
Rows
Columns
Values
Tables
Metadata
Blobs
Geometries
Images
Spotfire TERR
Data
Function
© Copyright 2000-2015 TIBCO Software Inc.
Trade Areas
Smart Routing
Smart Routing
Smart Routing
Production Forecasting
Forecast Production – Set Expected Production for Wells• Resource Play
• Repeatable distribution for EUR
• Offset not reliable predictor
• Continuous hydrocarbon system
• Free hydrocarbon not held in place by
hydrodynamics
• Geologic Subset
• Analogous Wells
• Geology, completion, spacing, vintage
• Analysis and Data
• Production forecasting (EUR)
• Probability of production
• Proven (P90), Probable (P50), Possible (P10)
• Cluster and Regression Analysis
© Copyright 2000-2015 TIBCO Software Inc.
Proven, Probable and Possible Production• Resource Play
• Repeatable distribution for EUR
• Offset not reliable predictor
• Continuous hydrocarbon system
• Free hydrocarbon not held in place by
hydrodynamics
• Geologic Subset
• Analogous Wells
• Geology, completion, spacing, vintage
• Analysis and Data
• Production forecasting (EUR)
• Probability of production
• Proven (P90), Probable (P50), Possible (P10)
• Cluster and Regression Analysis
Probability: Proven & Probable Production
© Copyright 2000-2015 TIBCO Software Inc.
Completions Optimization
• Business Opportunities
• Completions optimization by well
• Production prediction for new wells
• Identify factors driving production vs
expected production e.g. operator
• Analysis and Data
• Subsurface (e.g. Spectra)
• Location
• Completions
• Production
• Value and Financial Impact
• Optimal completions
• Operations management
• Asset valuation & “where to drill”
Optimize Completions – Location, Subsurface
© Copyright 2000-2015 TIBCO Software Inc.
41
© Copyright 2000-2014 TIBCO Software Inc.
• Business Opportunities
• Maintenance optimization
• Analysis and Data
• Failure times and locations
• Maintenance and failure costs
• Root cause analysis
• Value and Financial Impact
• Visibility into maintenance
expenses and root causes
• Optimal maintenance scheduling
Maintenance Optimization
Equipment Reliability - Refining
Winner of 2014 Strata Cloudera Award
For Best Advanced Analytics Application
Big Data Analytics with Spotfire and TERR
© Copyright 2000-2015 TIBCO Software Inc.
Big Data Analytics with TERR
TERR on the nodes of Hadoop Cluster
TERR in Action
• Hadoop cluster compute
• TIBCO Cloud Compute Grid
• TIBCO Streambase
• TIBCO Business Events
• KNIME
• Lavastorm
• Rstudio
• Teradata
• TIBCO Statistics Services
• TIBCO Spotfire
© Copyright 2000-2015 TIBCO Software Inc.
© Copyright 2000-2015 TIBCO Software Inc.
Predictive & Collaborative Analytics
Library of Data Functions – everyone Shares
• Analysts use functions – no code
• Coders develop new functions – R
Data Function Samples
• Ship with Spotfire Server
• Geospatial
• Computations with polygons on a map
• Computing optimal routes in logistics
• Machine Learning
• Fitting models and making predictions
• Applications
• Customers, Finance, Machines, …
IT View - GovernanceUser View - Functions
Immediate
Long-Term
Competitive AdvantageValue to the Organization
TIBCO is the only analytics platform that provides business
value across the Analytics Spectrum
Self-service
Dashboards
Predictive and
Prescriptive Analytics
Measure Diagnose Predict Optimize Operationalize Automate
Analytics Maturity
Analytics Spectrum
Event Processing
BIG DATA
AT REST
FAST DATA
IN MOTION
Insight to Action
© Copyright 2000-2015 TIBCO Software Inc.
Analyze And Act On “Critical Business Moments”
Optimize
pricing Check for
fraud
Make offer
to customer
Restock
inventory
Reroute
transport
Give customer
service
Proactively
maintain machines
© Copyright 2000-2015 TIBCO Software Inc.
Big Data
– Analysis of production
– Analysis of contracts and product
inventory
Fast Data
– Location data from ships and
trains, weather and tides
– Manage product supply
– Optimize fuel use
Benefits
– Optimize product contracts
– Maximize product shipped
– Minimize logistics cost
Managing Supply Chain
Managing Supply Chain
Managing Industrial Equipment
Big Data
– Analysis of production
– Failure analytics
Fast Data
– Real-time sensor data
– Leading indicator for shutdowns
– Drilling: kick detection
– Flow monitoring
Benefits
– Reduced NPT: Big $$s
– System reliability
– Efficient drilling
Data Monitoring
• Motor temperature
• Motor vibration
• Current
• Intake pressure
• Intake temperature
 Flow
Electrical power cable
Pump
Intake
Protector
ESP motor
Pump monitoring unit
Pump Components
Equipment Monitoring & Management
Video: https://youtu.be/vIVepQRl5SY
• Business Opportunities
• Pump health & performance surveillance
• Condition-based maintenance
• Analysis and Data
• Effects of operating conditions on performance
• Effects of suppliers on reliability
• Component faults and failure analysis
• Value and Financial Impact
• Prioritization of engineering and retrofit
• Supplier involvement in system reliability
• ID systems for Engineering focus
• Warranty cost recovery
Equipment Monitoring & Management
Video: https://youtu.be/vIVepQRl5SY
Equipment Monitoring & Management
Video: https://youtu.be/vIVepQRl5SY
Trend Analysis
Combination of Rules
CUSUM Analysis
Statistical Analysis
Statistical Process Control
Machine Learning
Location Change
– Variable moves up or down
Slope Change
– Variable changes trend
Variance Change
– Variable becomes more/less volatile
Process Threshold
– Shewhart control chart
Failure Model
y (0/1) = f (X, b) + e; f = logistic regression, trees, svm, nnet, ...
Sensor Analytics
1. Analytics models
2. Data streams
3. Calculations on live data
4. Analysis notifications
Fast Data Analytics
Video: https://youtu.be/vIVepQRl5SY
Live Data
Video: https://youtu.be/vIVepQRl5SY
Alerting In The Field
Crowdsourcing Solutions
Industrial Equipment Management Improves Operations
IT & Governance
© Copyright 2000-2015 TIBCO Software Inc.© Copyright 2000-2014 TIBCO Software Inc.
• Library Services
• Centralized management of Spotfire analysis files,
metadata, information links, TERR scripts, …
• User Services
• User authentication, role-based authorization
• Audit Services
• Content access, modification, deletion
• User authentication, data access, library operations
• Usage Log Analytics
• Sessions, Users, Admin, Local Files
• Library, Information Links, Admin, Detailed Logs
• Analysis Profiler
• Automate every analysis file during upgrade / migration
© Copyright 2000-2015 TIBCO Software Inc.
Tibco’s Fast Data Platform Architecture
Learn how some of the major players in
the energy industry are using Spotfire to
revolutionize their business:
• How to minimize risks by better
understanding exposure to asset
integrity issues
• Using analytics to control margins
and conduct customer profiling
• Leveraging forensics to reduce NPT
and monitor production
• Production optimization techniques
http://energyforum.tibco.com/
Energy Forum
September 1st – 2nd | Norris Conference Center | Houston, TX
spotfire.tibco.com/demos
spotfire.tibco.com/tips/
tibco.com/blog/tag/trends-and-outliers/
www.tibcommunity.com
Resources spotfire.tibco.com
Monthly Knowledge Share
Hosted by Quintus
Linked In hosted by Syntelli
LinkedIn
Webcasts
Insight and Action - Analyzing Your OSIsoft
PI System Data
Tuesday, July 7, 2015 1 PM EST
Presenter: Michael O'Connell & Dave Leigh
Predictive Analytics in the Energy Sector:
Asset Valuation
Tuesday, July 28, 2015 1PM EST
Presenter: Michael O'Connell & Peter Shaw with
Haas Engineering and R Lacy
Seeing Stars: the Gartner BI Bakeoff
Recording, May 27, 2015
Presenter: Anna Nowakowska & Michael
O'Connell
Events spotfire.tibco.com/about-us/events
66
© Copyright 2000-2014 TIBCO Software Inc.
Spotfire Ecosystem
Thank you!
Michael O’Connell, PhD
Chief Data Scientist
TIBCO
moconnell@tibco.com
@moc_tib
http://about.me/moconnell
+1-919-7401560
First to Insight, First to Action
© Copyright 2000-2015 TIBCO Software Inc.

More Related Content

PPTX
TIBCO Advanced Analytics Meetup (TAAM) November 2015
Bipin Singh
 
PDF
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
Bipin Singh
 
PDF
Extending the Reach of R to the Enterprise with TERR and Spotfire
Lou Bajuk
 
PPT
Getting the most out of Tibco Spotfire
Herwig Van Marck
 
PPTX
TIBCO Spotfire deck
syncsite1
 
PPTX
Spotfire
Sudarsan Desikan
 
PDF
HiTech Manufacturing Use Cases/Examples
TIBCO Spotfire
 
PDF
TIBCO Spotfire: Data Science in the Enterprise
TIBCO Spotfire
 
TIBCO Advanced Analytics Meetup (TAAM) November 2015
Bipin Singh
 
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
Bipin Singh
 
Extending the Reach of R to the Enterprise with TERR and Spotfire
Lou Bajuk
 
Getting the most out of Tibco Spotfire
Herwig Van Marck
 
TIBCO Spotfire deck
syncsite1
 
HiTech Manufacturing Use Cases/Examples
TIBCO Spotfire
 
TIBCO Spotfire: Data Science in the Enterprise
TIBCO Spotfire
 

What's hot (19)

PDF
Journey to Creating a 360 View of the Customer: Implementing Big Data Strateg...
Databricks
 
PDF
The case of vehicle networking financial services accomplished by China Mobile
DataWorks Summit
 
PDF
ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...
Databricks
 
PPTX
Democratizing data science Using spark, hive and druid
DataWorks Summit
 
PDF
Life is but a Stream
Databricks
 
PDF
Stream Scaling in Pravega
DataWorks Summit
 
PDF
Managing R&D Data on Parallel Compute Infrastructure
Databricks
 
PDF
Phar Data Platform: From the Lakehouse Paradigm to the Reality
Databricks
 
PDF
Snowflakes in the Cloud Real world experience on a new approach for Big Data
DevFest DC
 
PDF
Pivotal Real Time Data Stream Analytics
kgshukla
 
PDF
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Databricks
 
PDF
The Keys to Digital Transformation
MapR Technologies
 
PDF
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
Big Data Spain
 
PPTX
Airline reservations and routing: a graph use case
DataWorks Summit
 
PDF
Building the Autodesk Design Graph-(Yotto Koga, Autodesk)
Spark Summit
 
PDF
IBM Cloud Native Day April 2021: Serverless Data Lake
Torsten Steinbach
 
PDF
Introducing Databricks Delta
Databricks
 
PPTX
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
Torsten Steinbach
 
PPTX
Whoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
DataWorks Summit
 
Journey to Creating a 360 View of the Customer: Implementing Big Data Strateg...
Databricks
 
The case of vehicle networking financial services accomplished by China Mobile
DataWorks Summit
 
ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...
Databricks
 
Democratizing data science Using spark, hive and druid
DataWorks Summit
 
Life is but a Stream
Databricks
 
Stream Scaling in Pravega
DataWorks Summit
 
Managing R&D Data on Parallel Compute Infrastructure
Databricks
 
Phar Data Platform: From the Lakehouse Paradigm to the Reality
Databricks
 
Snowflakes in the Cloud Real world experience on a new approach for Big Data
DevFest DC
 
Pivotal Real Time Data Stream Analytics
kgshukla
 
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Databricks
 
The Keys to Digital Transformation
MapR Technologies
 
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
Big Data Spain
 
Airline reservations and routing: a graph use case
DataWorks Summit
 
Building the Autodesk Design Graph-(Yotto Koga, Autodesk)
Spark Summit
 
IBM Cloud Native Day April 2021: Serverless Data Lake
Torsten Steinbach
 
Introducing Databricks Delta
Databricks
 
IBM THINK 2020 - Cloud Data Lake with IBM Cloud Data Services
Torsten Steinbach
 
Whoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
DataWorks Summit
 
Ad

Viewers also liked (20)

PPTX
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
Grid Dynamics
 
PPTX
Vert.x for Microservices Architecture
Idan Fridman
 
PDF
Generalized B2B Machine Learning by Andrew Waage
Data Con LA
 
PPTX
NTT SIC marketplace slide deck at Tokyo Summit
Toshikazu Ichikawa
 
PPTX
BMC Engage 2015: IT Asset Management - An essential pillar for the digital en...
Jon Stevens-Hall
 
PDF
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
DATAVERSITY
 
PPTX
KD2017_System Center in the "cloud first" era
Tomica Kaniski
 
PDF
Status Quo on the automation support in SOA Suite OGhTech17
Jon Petter Hjulstad
 
PPT
Chapter 3 Computer Crimes
Mar Soriano
 
PPTX
Cloud Camp: Infrastructure as a service advance workloads
Asaf Nakash
 
PPT
Water resources
Emily Kissner
 
PPTX
I1 - Securing Office 365 and Microsoft Azure like a rockstar (or like a group...
SPS Paris
 
PPTX
De Persgroep Big Data Expo
BigDataExpo
 
PPTX
Oracle cloud, private, public and hybrid
Johan Louwers
 
PDF
Oracle Cloud Café IoT 12-APR-2016
Jean-Marc Hui Bon Hoa
 
PPTX
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
BigDataExpo
 
PPTX
Gastles PXL Hogeschool 2017
Bart Van Den Brande
 
PDF
SRE Study Notes - CH2,3,4
Rick Hwang
 
PDF
Bol.com
BigDataExpo
 
PDF
Info qiy foundation digital me - dappre-eng-aug17
BigDataExpo
 
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
Grid Dynamics
 
Vert.x for Microservices Architecture
Idan Fridman
 
Generalized B2B Machine Learning by Andrew Waage
Data Con LA
 
NTT SIC marketplace slide deck at Tokyo Summit
Toshikazu Ichikawa
 
BMC Engage 2015: IT Asset Management - An essential pillar for the digital en...
Jon Stevens-Hall
 
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
DATAVERSITY
 
KD2017_System Center in the "cloud first" era
Tomica Kaniski
 
Status Quo on the automation support in SOA Suite OGhTech17
Jon Petter Hjulstad
 
Chapter 3 Computer Crimes
Mar Soriano
 
Cloud Camp: Infrastructure as a service advance workloads
Asaf Nakash
 
Water resources
Emily Kissner
 
I1 - Securing Office 365 and Microsoft Azure like a rockstar (or like a group...
SPS Paris
 
De Persgroep Big Data Expo
BigDataExpo
 
Oracle cloud, private, public and hybrid
Johan Louwers
 
Oracle Cloud Café IoT 12-APR-2016
Jean-Marc Hui Bon Hoa
 
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
BigDataExpo
 
Gastles PXL Hogeschool 2017
Bart Van Den Brande
 
SRE Study Notes - CH2,3,4
Rick Hwang
 
Bol.com
BigDataExpo
 
Info qiy foundation digital me - dappre-eng-aug17
BigDataExpo
 
Ad

Similar to Houston Energy Data Science Meet up_TIBCO Slides (20)

PDF
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Nicola Sandoli
 
PDF
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
Chief Analytics Officer Forum
 
PDF
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Kai Wähner
 
PDF
Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...
Matt Stubbs
 
PDF
TERR in BI and Real Time applications
Lou Bajuk
 
PDF
Applying R in BI and Real Time applications EARL London 2015
Lou Bajuk
 
PPTX
Applying the R Language to BI and Real Time Applications
Lou Bajuk
 
PDF
Oracle Analytics Cloud
Joseph Alaimo Jr
 
PDF
Deploying R in BI and Real time Applications
Lou Bajuk
 
PDF
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
Codemotion
 
PDF
Sensor Data Management & Analytics: Advanced Process Control
TIBCO_Software
 
PPTX
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
Avkash Chauhan
 
PDF
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Big Data Spain
 
PDF
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Kai Wähner
 
PDF
AI Foundations: Simpler Technologies, Smarter Business
TIBCO_Software
 
PDF
Big data for Telco: opportunity or threat?
Swiss Big Data User Group
 
PDF
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Denodo
 
PPTX
StreamCentral for the IT Professional
Raheel Retiwalla
 
PPTX
IT and OT Convergence
OpsRamp
 
Tibco Augmented Intelligence - Analytics, IoT, Big Data, Streaming 20161025
Nicola Sandoli
 
TIBCO presentation at the Chief Analytics Officer Forum East Coast 2016 (#CAO...
Chief Analytics Officer Forum
 
Data Preparation vs. Inline Data Wrangling in Data Science and Machine Learning
Kai Wähner
 
Big Data LDN 2017: How Big Data Insights Become Easily Accessible With Workfl...
Matt Stubbs
 
TERR in BI and Real Time applications
Lou Bajuk
 
Applying R in BI and Real Time applications EARL London 2015
Lou Bajuk
 
Applying the R Language to BI and Real Time Applications
Lou Bajuk
 
Oracle Analytics Cloud
Joseph Alaimo Jr
 
Deploying R in BI and Real time Applications
Lou Bajuk
 
How to Apply Big Data Analytics and Machine Learning to Real Time Processing ...
Codemotion
 
Sensor Data Management & Analytics: Advanced Process Control
TIBCO_Software
 
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
Avkash Chauhan
 
Stream Processing as Game Changer for Big Data and Internet of Things by Kai ...
Big Data Spain
 
Streaming Analytics Comparison of Open Source Frameworks, Products, Cloud Ser...
Kai Wähner
 
AI Foundations: Simpler Technologies, Smarter Business
TIBCO_Software
 
Big data for Telco: opportunity or threat?
Swiss Big Data User Group
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Denodo
 
StreamCentral for the IT Professional
Raheel Retiwalla
 
IT and OT Convergence
OpsRamp
 

Recently uploaded (20)

PPT
Activate_Methodology_Summary presentatio
annapureddyn
 
PDF
49785682629390197565_LRN3014_Migrating_the_Beast.pdf
Abilash868456
 
PDF
Enhancing Healthcare RPM Platforms with Contextual AI Integration
Cadabra Studio
 
PPTX
AI-Ready Handoff: Auto-Summaries & Draft Emails from MQL to Slack in One Flow
bbedford2
 
PDF
49784907924775488180_LRN2959_Data_Pump_23ai.pdf
Abilash868456
 
PDF
Key Features to Look for in Arizona App Development Services
Net-Craft.com
 
PPTX
Visualising Data with Scatterplots in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PPTX
Presentation about variables and constant.pptx
kr2589474
 
PDF
Generating Union types w/ Static Analysis
K. Matthew Dupree
 
PPTX
Can You Build Dashboards Using Open Source Visualization Tool.pptx
Varsha Nayak
 
PDF
Salesforce Implementation Services Provider.pdf
VALiNTRY360
 
PDF
ShowUs: Pharo Stream Deck (ESUG 2025, Gdansk)
ESUG
 
PPTX
PFAS Reporting Requirements 2026 Are You Submission Ready Certivo.pptx
Certivo Inc
 
PDF
vAdobe Premiere Pro 2025 (v25.2.3.004) Crack Pre-Activated Latest
imang66g
 
PDF
WatchTraderHub - Watch Dealer software with inventory management and multi-ch...
WatchDealer Pavel
 
PPTX
classification of computer and basic part of digital computer
ravisinghrajpurohit3
 
PDF
Applitools Platform Pulse: What's New and What's Coming - July 2025
Applitools
 
PDF
MiniTool Power Data Recovery Crack New Pre Activated Version Latest 2025
imang66g
 
PPTX
Contractor Management Platform and Software Solution for Compliance
SHEQ Network Limited
 
PDF
Bandai Playdia The Book - David Glotz
BluePanther6
 
Activate_Methodology_Summary presentatio
annapureddyn
 
49785682629390197565_LRN3014_Migrating_the_Beast.pdf
Abilash868456
 
Enhancing Healthcare RPM Platforms with Contextual AI Integration
Cadabra Studio
 
AI-Ready Handoff: Auto-Summaries & Draft Emails from MQL to Slack in One Flow
bbedford2
 
49784907924775488180_LRN2959_Data_Pump_23ai.pdf
Abilash868456
 
Key Features to Look for in Arizona App Development Services
Net-Craft.com
 
Visualising Data with Scatterplots in IBM SPSS Statistics.pptx
Version 1 Analytics
 
Presentation about variables and constant.pptx
kr2589474
 
Generating Union types w/ Static Analysis
K. Matthew Dupree
 
Can You Build Dashboards Using Open Source Visualization Tool.pptx
Varsha Nayak
 
Salesforce Implementation Services Provider.pdf
VALiNTRY360
 
ShowUs: Pharo Stream Deck (ESUG 2025, Gdansk)
ESUG
 
PFAS Reporting Requirements 2026 Are You Submission Ready Certivo.pptx
Certivo Inc
 
vAdobe Premiere Pro 2025 (v25.2.3.004) Crack Pre-Activated Latest
imang66g
 
WatchTraderHub - Watch Dealer software with inventory management and multi-ch...
WatchDealer Pavel
 
classification of computer and basic part of digital computer
ravisinghrajpurohit3
 
Applitools Platform Pulse: What's New and What's Coming - July 2025
Applitools
 
MiniTool Power Data Recovery Crack New Pre Activated Version Latest 2025
imang66g
 
Contractor Management Platform and Software Solution for Compliance
SHEQ Network Limited
 
Bandai Playdia The Book - David Glotz
BluePanther6
 

Houston Energy Data Science Meet up_TIBCO Slides

  • 1. TIBCO Advanced Analytics Houston Energy Data Science Meetup Michael O’Connell Chief Data Scientist [email protected] @moc_tib August 2015
  • 2. • Data Science Process • Data Analysis Pipeline • Understand – Anticipate – Act • Advanced Analytics • TIBCO’s R engine • GeoLocation Analytics • Real-Time Analytics • Remote Monitoring – the Digital Nervous System • Software & APIs • Wrap-Up / Questions Increase Productivity Grow Revenue Value Reduce Risk ROI TIBCO Analytics – Insight to Action © Copyright 2000-2015 TIBCO Software Inc.
  • 3. “Data Science” Engineer/Marketeer “Address the business issue” Statistician “Build the best model” IT / Developer “Manage my infrastructure” Engineer/Marketeer: Knows the business problem but doesn’t know how to prepare data or build models. Statistician: Knows how to develop appropriate models to address business problems but is in short supply and can’t deploy IT or business systems IT / Developer: Knows databases, application provisioning and development tools but isn’t familiar with data meaning or analytical workflow purpose What is a Data Scientist © Copyright 2000-2015 TIBCO Software Inc.
  • 4. Data Access & Prep Exploratory Data Analysis Features Visual Dashboard Model & Predict Deploy Champion Model Test & Learn Channel Social Loyalty Campaign Filter Map Merge Shape Propensity Affinity ImproveGuided -------- Deploy -------- In-LineExplore Data Aggregate Prepare DataBusiness Case Increase Productivity Grow Revenue Ensemble Forest Regression Additive Models Segment Visualize Pricing Promotion Challenger Models At Rest In Motion Value Theses Reduce Risk ROI Value Dashboard Updates Data a Insight a Action © Copyright 2000-2015 TIBCO Software Inc.
  • 6. Custom GUI-driven data access via SDK Enterprise Data Access Siebel eBusiness Local data sources AccessExcel STDF Drag-and-drop MySQL SQL Server Oracle Information Services (join, transform, reusable, parameterized, dynamic query for in-memory use) Databases JDBC/ODBC Hadoop SFDC PostgreSQL Teradata Netezza Etc.XML RDBMS Flat Files Spread- sheets Web Services Oracle E-Business RDBMS RDBMS RDBMS SAP BWSAP R/3 D A T A F A B R I C Salesforce ODBC OLE DB SqlClient Direct connection Oracle TeradataAsterMS SSAS Teradata Direct Query (dynamically query and retrieve data for visualization and analysis) Databases MySQL Etc. OBIEE Netezza Hadoop © Copyright 2000-2015 TIBCO Software Inc.
  • 7. Immediate Long-Term Competitive AdvantageValue to the Organization TIBCO is the only analytics platform that provides business value across the Analytics Spectrum Self-service Dashboards Event Processing Predictive and Prescriptive Analytics Measure Diagnose Predict Optimize Operationalize Automate Analytics Maturity Analytics Spectrum
  • 8. Immediate Long-Term Competitive AdvantageValue to the Organization TIBCO is the only analytics platform that provides business value across the Analytics Spectrum Self-service Dashboards Measure Diagnose Predict Optimize Operationalize Automate Analytics Maturity Analytics Spectrum Predictive and Prescriptive Analytics Event Processing
  • 9. Immediate Long-Term Competitive AdvantageValue to the Organization TIBCO is the only analytics platform that provides business value across the Analytics Spectrum Self-service Dashboards Predictive and Prescriptive Analytics Measure Diagnose Predict Optimize Operationalize Automate Analytics Maturity Analytics Spectrum Event Processing
  • 10. © Copyright 2000-2015 TIBCO Software Inc. 10 Visual Analytics – Spotfire
  • 11. Visual Analytics – Spotfire 3D rotate SurfacePolar Contour Network Funnel
  • 12. Spotfire Extensions – d3 and JS © Copyright 2000-2015 TIBCO Software Inc. Sankey Venn ChordDonut Dials Gantt
  • 13. Visual Analytics – Dashboards
  • 14. Visual Analytics – Dashboards
  • 15. Visual Analytics – Dashboards
  • 16. Visual Analytics – Dashboards
  • 17. Visual Analytics – Dashboards
  • 21. Jaspersoft Pixel-Perfect Embedded Reports © Copyright 2000-2015 TIBCO Software Inc.
  • 22. Analytic Workspaces & Analytic Fabric APIs Search,Sharingetc. Business Analysts Report Developers Analytic Workspaces Analytic Fabric Data Discovery Analytics Dashboards Reports © Copyright 2000-2015 TIBCO Software Inc.
  • 23. Spotfire is Super Simple to Use US Homeless Analysis Step-by-Step YouTube Playlist • Dashboards • Predictive • GeoLocation © Copyright 2000-2015 TIBCO Software Inc.
  • 24. Immediate Long-Term Competitive AdvantageValue to the Organization TIBCO is the only analytics platform that provides business value across the Analytics Spectrum Self-service Dashboards Measure Diagnose Predict Optimize Operationalize Automate Analytics Maturity Analytics Spectrum Predictive and Prescriptive Analytics Event Processing
  • 25. Advanced Analytics Ecosystem © Copyright 2000-2015 TIBCO Software Inc.
  • 26. TIBCO Enterprise Runtime for R (TERR) © Copyright 2000-2015 TIBCO Software Inc. • TIBCO has rewritten R as a Commercial Compute Engine • Latest statistics scripting engine: S a S-PLUS® a R a TERR • Runs R code including CRAN packages • Engine internals rebuilt from scratch at low-level • Redesigned data objects, memory management • High performance + Big Data • TERR is licensed from TIBCO • TERR Installs (free) with Spotfire Analyst / Desktop and other TIBCO products (CEP, Stats) • Spotfire Server can manage all TERR / R scripts, artifacts for reuse • Standalone Developer Edition: www.TIBCOmmunity.com • Supported by TIBCO
  • 27. Model Fitting: 5 Million Rows Model Scoring: 20 Million Rows TERR 7X faster 84X TERR Performance © Copyright 2000-2015 TIBCO Software Inc.
  • 28. Spotfire and TERR local TERR on server Spotfire-TERR – Local and Server • Build models on data using local TERR engine embedded in Spotfire • Build models on big data directly in TERR on server and display results in Spotfire • Run TERR as parallel sessions on Hadoop cluster, controlled and visualized in Spotfire Data Source TERR TSSS Spotfire Results ODBC JDBC SDC File Data Function Larger Data Modeling Spotfire Local TERR ODBC JDBC SDC File Data Data Source Both Spotfire and TERR can load data from any ODBC or JDBC compliant source or from Spotfire Data Connections (SDC) or Spotfire Information Links stored in the Spotfire library. © Copyright 2000-2015 TIBCO Software Inc.
  • 29. © Copyright 2000-2015 TIBCO Software Inc. Simple Predictive Analytics – Forecasting & Modeling Contextual Analytics - Forecasting Contextual Analytics - Machine Learning
  • 30. Extensible Predictive Analytics – Analysis Workflows Interactive Spotfire Analytics with R - Data Function - Robust Cluster Analysis - Any Analysis in R / CRAN Variables driving segments - Random Forest Revenue by product - Color by segment
  • 31. Free Scripts - GeoCluster [kmeans(x,y)]
  • 32. Free Scripts - Contours [contourLines(x,y,z)]
  • 33. Spotfire-TERR : Data Types, Analyses Spotfire data functions support any type of data as input and output parameters to and from TERR. TERR data functions used for data prep, integration, predictive & prescriptive analytics, … TERR data functions can output content metadata to Spotfire • formatting of fields • handling of binary data including images and geospatial objects. Rows Columns Values Tables Metadata Blobs Geometries Images Spotfire TERR Data Function © Copyright 2000-2015 TIBCO Software Inc.
  • 38. Production Forecasting Forecast Production – Set Expected Production for Wells• Resource Play • Repeatable distribution for EUR • Offset not reliable predictor • Continuous hydrocarbon system • Free hydrocarbon not held in place by hydrodynamics • Geologic Subset • Analogous Wells • Geology, completion, spacing, vintage • Analysis and Data • Production forecasting (EUR) • Probability of production • Proven (P90), Probable (P50), Possible (P10) • Cluster and Regression Analysis © Copyright 2000-2015 TIBCO Software Inc.
  • 39. Proven, Probable and Possible Production• Resource Play • Repeatable distribution for EUR • Offset not reliable predictor • Continuous hydrocarbon system • Free hydrocarbon not held in place by hydrodynamics • Geologic Subset • Analogous Wells • Geology, completion, spacing, vintage • Analysis and Data • Production forecasting (EUR) • Probability of production • Proven (P90), Probable (P50), Possible (P10) • Cluster and Regression Analysis Probability: Proven & Probable Production © Copyright 2000-2015 TIBCO Software Inc.
  • 40. Completions Optimization • Business Opportunities • Completions optimization by well • Production prediction for new wells • Identify factors driving production vs expected production e.g. operator • Analysis and Data • Subsurface (e.g. Spectra) • Location • Completions • Production • Value and Financial Impact • Optimal completions • Operations management • Asset valuation & “where to drill” Optimize Completions – Location, Subsurface © Copyright 2000-2015 TIBCO Software Inc.
  • 41. 41 © Copyright 2000-2014 TIBCO Software Inc. • Business Opportunities • Maintenance optimization • Analysis and Data • Failure times and locations • Maintenance and failure costs • Root cause analysis • Value and Financial Impact • Visibility into maintenance expenses and root causes • Optimal maintenance scheduling Maintenance Optimization Equipment Reliability - Refining
  • 42. Winner of 2014 Strata Cloudera Award For Best Advanced Analytics Application Big Data Analytics with Spotfire and TERR © Copyright 2000-2015 TIBCO Software Inc.
  • 43. Big Data Analytics with TERR TERR on the nodes of Hadoop Cluster TERR in Action • Hadoop cluster compute • TIBCO Cloud Compute Grid • TIBCO Streambase • TIBCO Business Events • KNIME • Lavastorm • Rstudio • Teradata • TIBCO Statistics Services • TIBCO Spotfire © Copyright 2000-2015 TIBCO Software Inc.
  • 44. © Copyright 2000-2015 TIBCO Software Inc. Predictive & Collaborative Analytics Library of Data Functions – everyone Shares • Analysts use functions – no code • Coders develop new functions – R Data Function Samples • Ship with Spotfire Server • Geospatial • Computations with polygons on a map • Computing optimal routes in logistics • Machine Learning • Fitting models and making predictions • Applications • Customers, Finance, Machines, … IT View - GovernanceUser View - Functions
  • 45. Immediate Long-Term Competitive AdvantageValue to the Organization TIBCO is the only analytics platform that provides business value across the Analytics Spectrum Self-service Dashboards Predictive and Prescriptive Analytics Measure Diagnose Predict Optimize Operationalize Automate Analytics Maturity Analytics Spectrum Event Processing
  • 46. BIG DATA AT REST FAST DATA IN MOTION Insight to Action © Copyright 2000-2015 TIBCO Software Inc.
  • 47. Analyze And Act On “Critical Business Moments” Optimize pricing Check for fraud Make offer to customer Restock inventory Reroute transport Give customer service Proactively maintain machines © Copyright 2000-2015 TIBCO Software Inc.
  • 48. Big Data – Analysis of production – Analysis of contracts and product inventory Fast Data – Location data from ships and trains, weather and tides – Manage product supply – Optimize fuel use Benefits – Optimize product contracts – Maximize product shipped – Minimize logistics cost Managing Supply Chain
  • 50. Managing Industrial Equipment Big Data – Analysis of production – Failure analytics Fast Data – Real-time sensor data – Leading indicator for shutdowns – Drilling: kick detection – Flow monitoring Benefits – Reduced NPT: Big $$s – System reliability – Efficient drilling
  • 51. Data Monitoring • Motor temperature • Motor vibration • Current • Intake pressure • Intake temperature  Flow Electrical power cable Pump Intake Protector ESP motor Pump monitoring unit Pump Components Equipment Monitoring & Management Video: https://youtu.be/vIVepQRl5SY
  • 52. • Business Opportunities • Pump health & performance surveillance • Condition-based maintenance • Analysis and Data • Effects of operating conditions on performance • Effects of suppliers on reliability • Component faults and failure analysis • Value and Financial Impact • Prioritization of engineering and retrofit • Supplier involvement in system reliability • ID systems for Engineering focus • Warranty cost recovery Equipment Monitoring & Management Video: https://youtu.be/vIVepQRl5SY
  • 53. Equipment Monitoring & Management Video: https://youtu.be/vIVepQRl5SY
  • 54. Trend Analysis Combination of Rules CUSUM Analysis Statistical Analysis Statistical Process Control Machine Learning Location Change – Variable moves up or down Slope Change – Variable changes trend Variance Change – Variable becomes more/less volatile Process Threshold – Shewhart control chart Failure Model y (0/1) = f (X, b) + e; f = logistic regression, trees, svm, nnet, ... Sensor Analytics
  • 55. 1. Analytics models 2. Data streams 3. Calculations on live data 4. Analysis notifications Fast Data Analytics Video: https://youtu.be/vIVepQRl5SY
  • 59. Industrial Equipment Management Improves Operations
  • 60. IT & Governance © Copyright 2000-2015 TIBCO Software Inc.© Copyright 2000-2014 TIBCO Software Inc. • Library Services • Centralized management of Spotfire analysis files, metadata, information links, TERR scripts, … • User Services • User authentication, role-based authorization • Audit Services • Content access, modification, deletion • User authentication, data access, library operations • Usage Log Analytics • Sessions, Users, Admin, Local Files • Library, Information Links, Admin, Detailed Logs • Analysis Profiler • Automate every analysis file during upgrade / migration
  • 61. © Copyright 2000-2015 TIBCO Software Inc. Tibco’s Fast Data Platform Architecture
  • 62. Learn how some of the major players in the energy industry are using Spotfire to revolutionize their business: • How to minimize risks by better understanding exposure to asset integrity issues • Using analytics to control margins and conduct customer profiling • Leveraging forensics to reduce NPT and monitor production • Production optimization techniques http://energyforum.tibco.com/ Energy Forum September 1st – 2nd | Norris Conference Center | Houston, TX
  • 64. Monthly Knowledge Share Hosted by Quintus Linked In hosted by Syntelli LinkedIn
  • 65. Webcasts Insight and Action - Analyzing Your OSIsoft PI System Data Tuesday, July 7, 2015 1 PM EST Presenter: Michael O'Connell & Dave Leigh Predictive Analytics in the Energy Sector: Asset Valuation Tuesday, July 28, 2015 1PM EST Presenter: Michael O'Connell & Peter Shaw with Haas Engineering and R Lacy Seeing Stars: the Gartner BI Bakeoff Recording, May 27, 2015 Presenter: Anna Nowakowska & Michael O'Connell Events spotfire.tibco.com/about-us/events
  • 66. 66 © Copyright 2000-2014 TIBCO Software Inc. Spotfire Ecosystem
  • 67. Thank you! Michael O’Connell, PhD Chief Data Scientist TIBCO [email protected] @moc_tib http://about.me/moconnell +1-919-7401560 First to Insight, First to Action © Copyright 2000-2015 TIBCO Software Inc.

Editor's Notes

  • #11: Visual Analytics For exploratory analysis And publication reporting
  • #48: Finally, one of the most valuable initiatives, which builds on the previous one, is the ability to sense, respond and influence business moments. Business moments are situations of interest, opportunities for the business to marry insights from big data with the understanding of the context in real-time, to take an action. Example: predictive maintenance. Machine is close to maintenance period but not there yet. The production forecast is low right now but will become intense. Propose to operations team to execute maintenance operations ASAP as it’s the scenario with least impact on the forecast.
  • #49: Managing ships, trains, vehicles Taking into account: Weather Business metrics Pit to Port Long train tracks, has the longest trains in the world Ships waiting to be loaded Need to manage tide while complying with SLAs Big Data provides
  • #51: Michael O’Connell
  • #55: Thresholds can include a change in location, slope or variance e.g. motor temperature jumping 20 degrees in an hour; anomalies exceeding process control limits; or an empirical machine-learning model.
  • #56: The Event Server calculates the models on live data and provides notifications – including emails to engineers and/or logging to operational data stores or BPM systems.