© 2014 Guavus, Inc. All rights reserved.
Nicolas Hohn
Director of Analytics
Guavus
LARGE SCALE PREDICTIVE ANALYTICS
FOR ANOMALY DETECTION
2015
© 2015 Guavus, Inc. All rights reserved. 2
Guavus Applications Focus
Planning
Engineering
•  Network Analytics
•  Capacity Management
•  Trending & Forecasting
•  Value-based
Network Planning
•  Quality of Experience
•  Complaints Mitigation
•  Proactive Care
•  Revenue Assurance
•  Self Care Usage Portal
Network
Operations
MarketingCare
•  Service Management
•  QoS Management
•  Performance Monitoring
•  Proactive Service
Assurance
•  Subscriber Profiling
•  Personalization &
Targeting
•  CSP Data
Monetization
Service Assurance Customer Experience
© 2015 Guavus, Inc. All rights reserved. 3
Anomaly Detection
•  Anomaly: something that is unusual or unexpected
•  Detection: extraction of particular information from a larger stream of information
without specific cooperation from or synchronization with the sender
Implementation
•  Rule based: manual thresholds
•  Automated: thresholds set by machine learning
Operational Intelligence
Service
Degrading
Problem!
Service
Anomalies
Identification!
Root-Cause
Analysis!
Problem
Resolution!
Quantify how ‘unusual’ a signal value is
Unsupervised learning to send trigger
when signal is ‘unexpected enough’
DOES NOT SCALE
spike step slope
time time time
KPI
© 2015 Guavus, Inc. All rights reserved. 4
Anomaly detection
Event Arrival Times
2014-09-16 00:00:06
2014-09-16 00:00:09
2014-09-16 00:00:40
2014-09-16 00:00:42
2014-09-16 00:00:45
2014-09-16 00:01:00
2014-09-16 00:01:09
2014-09-16 00:01:11
2014-09-16 00:01:20
2014-09-16 00:02:09
……
5
4
Define KPI and time scale
Predict conditional baseline (black line) and
probability density given historical data
KPI value (green line)
Trigger alert (red dot) if data point
significantly above baseline, i.e. outside
confidence interval (gray bands)
1 2 3
4
•  4 step process
#events
Time
© 2015 Guavus, Inc. All rights reserved. 5
KPI
time
EASIER
Challenges
•  Predict distribution of current value based on past values
–  Uni/Multi variate time series analysis
•  Unify uncertainty metric across all types of input signals to build a global ranking of alarms
•  Scale on limited hardware footprint. Real time monitoring of potentially millions of time series
•  Keep customer happy (no alarm flooding, limit false positives, rank alarms by severity)
KPI
time
HARDER
HARD
HARD
HARD
© 2015 Guavus, Inc. All rights reserved. 6
Solutions
time
Anomalyindicator
#events
•  Data Science:
–  Robust to past anomalies
•  No guarantee that ‘training’ data is anomaly free
–  Adapt to changes
•  Retrain model
–  Cannot rely on labeled data:
•  Understand customer ‘utility function’, business impact of anomalies
•  Set thresholds automatically
•  Quantify cost of false positives and false negatives
•  Engineering:
–  Intelligent caching
–  Compression
–  Scalable system
© 2015 Guavus, Inc. All rights reserved. 7
•  Monitor KPIs, such as dropped call rate on each base station in a 4G network
•  Detect anomalies
•  Infer root cause by analyzing 1000s of other KPIs available on each cell of the network
Use case: Networks analytics
© 2015 Guavus, Inc. All rights reserved. 8
Architecture of the solution
Data	
  fusion	
  	
  
aggrega/on	
  
Compute Cluster Analytics Cluster
Intelligent	
  Cache	
  
Collector
Adapter	
  1	
  
Custom	
  
Adapter	
  2	
  
Columnar	
  
Storage	
  
Anomaly	
  
Detec/on	
  
UserInterface
Time	
  series	
  
analy/cs	
  
Rules/	
  alerts	
  
frame-­‐work	
  
M2MInterface
withcustomer
system
Data	
  streams	
  
© 2015 Guavus, Inc. All rights reserved. 9
Conclusion
Lessons learned
•  No silver bullet, but multiple methods each with their own pros/cons
•  Simple and scalable solution
•  Adapt to:
–  data changes
–  customer needs
•  API design:
–  black-box approach: hide complexity from developers
© 2015 Guavus, Inc. All rights reserved.
QA
Nicolas Hohn, Director of Analytics
nicolas.hohn@guavus.com
2015

More Related Content

PDF
PCI DSS ASV Scanning from Nettitude
PDF
Sensitel TrackAware for Asset, Shipment and Package Tracking
PDF
Logic Monitoring Service
PDF
CTG Logic monitor
PDF
Dynatrace FreeTrial Test Drive
PDF
Operational Analytics at Credit Suisse from ThousandEyes Connect
PDF
Unified Monitoring Webinar with Dustin Whittle
PPTX
How to deploy AppInternals in azure
PCI DSS ASV Scanning from Nettitude
Sensitel TrackAware for Asset, Shipment and Package Tracking
Logic Monitoring Service
CTG Logic monitor
Dynatrace FreeTrial Test Drive
Operational Analytics at Credit Suisse from ThousandEyes Connect
Unified Monitoring Webinar with Dustin Whittle
How to deploy AppInternals in azure

What's hot (10)

PDF
The Value of Options Analytics "as-a-Service"
PPTX
Top 5 IT challenges for 2017
PDF
DCMS AKCP Product Presentation
PDF
[CLASS 2014] Palestra Técnica - Leonardo Scudere
PPTX
Open Technology Solutions For Healthcare Startups
PDF
Service Assurance for Modern Apps - BigPanda NA SNO - April 2015 - Dan Turchin
PDF
LogicMonitor: An Overview
PPTX
Critical online success factors with dynatrace
PDF
Backup manager
PPTX
MachinePulse Products
The Value of Options Analytics "as-a-Service"
Top 5 IT challenges for 2017
DCMS AKCP Product Presentation
[CLASS 2014] Palestra Técnica - Leonardo Scudere
Open Technology Solutions For Healthcare Startups
Service Assurance for Modern Apps - BigPanda NA SNO - April 2015 - Dan Turchin
LogicMonitor: An Overview
Critical online success factors with dynatrace
Backup manager
MachinePulse Products
Ad

Similar to Large scale predictive analytics for anomaly detection - Nicolas Hohn (20)

PDF
Intelligent Digital Mesh Testing
PDF
Algolytics company Overview 2015
PDF
Algolytics company Overview 2015
PDF
Prism presentation
PPTX
Myths of validation
PDF
Platforming the Major Analytic Use Cases for Modern Engineering
PPTX
DISCUSSION ON DIGITAL OILFIELD FULL-FIELD OPTIMIZATION
PPTX
Network Monitoring Software Ensuring Secure and Reliable IT Operations.pptx
PDF
Savvius_Introduction to workshop
PPTX
Industrial asset optimization overview slideshare
PPTX
Performance and penetration_testing_with_a_partner_how_to_start!
PPTX
Performance and penetration_testing_with_a_partner_how_to_start!
PPTX
API Days Paris - When RESTful may be considered harmful
PPTX
Hyper-connected apps: Hyper-Connected Apps: Testing Peripherals and Mobile Ap...
PPT
CBN Supply Vending Presentation
PPTX
AI Based CI/CD Pipelines -- Chapter 8.pptx
PDF
Managing End User Expectations -- The A La Carte Strategy
PDF
Trends in Quality Assurance area
PPTX
The Big Picture: Learned Behaviors in Churn
PDF
Trends in the quality assurance area
Intelligent Digital Mesh Testing
Algolytics company Overview 2015
Algolytics company Overview 2015
Prism presentation
Myths of validation
Platforming the Major Analytic Use Cases for Modern Engineering
DISCUSSION ON DIGITAL OILFIELD FULL-FIELD OPTIMIZATION
Network Monitoring Software Ensuring Secure and Reliable IT Operations.pptx
Savvius_Introduction to workshop
Industrial asset optimization overview slideshare
Performance and penetration_testing_with_a_partner_how_to_start!
Performance and penetration_testing_with_a_partner_how_to_start!
API Days Paris - When RESTful may be considered harmful
Hyper-connected apps: Hyper-Connected Apps: Testing Peripherals and Mobile Ap...
CBN Supply Vending Presentation
AI Based CI/CD Pipelines -- Chapter 8.pptx
Managing End User Expectations -- The A La Carte Strategy
Trends in Quality Assurance area
The Big Picture: Learned Behaviors in Churn
Trends in the quality assurance area
Ad

More from PAPIs.io (20)

PDF
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
PDF
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
PDF
Extracting information from images using deep learning and transfer learning ...
PDF
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
PDF
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
PDF
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
PDF
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
PDF
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
PDF
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
PDF
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
PDF
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
PDF
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
PDF
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
PDF
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
PDF
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
PDF
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
PDF
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
PDF
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
PDF
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
PDF
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...
Shortening the time from analysis to deployment with ml as-a-service — Luiz A...
Feature engineering — HJ Van Veen (Nubank) @@PAPIs Connect — São Paulo 2017
Extracting information from images using deep learning and transfer learning ...
Discovering the hidden treasure of data using graph analytic — Ana Paula Appe...
Deep learning for sentiment analysis — André Barbosa (elo7) @PAPIs Connect — ...
Building machine learning service in your business — Eric Chen (Uber) @PAPIs ...
Building machine learning applications locally with Spark — Joel Pinho Lucas ...
Battery log data mining — Ramon Oliveira (Datart) @PAPIs Connect — São Paulo ...
A tensorflow recommending system for news — Fabrício Vargas Matos (Hearst tv)...
Scaling machine learning as a service at Uber — Li Erran Li at #papis2016
Real-world applications of AI - Daniel Hulme @ PAPIs Connect
Past, Present and Future of AI: a Fascinating Journey - Ramon Lopez de Mantar...
Revolutionizing Offline Retail Pricing & Promotions with ML - Daniel Guhl @ P...
Demystifying Deep Learning - Roberto Paredes Palacios @ PAPIs Connect
Predictive APIs: What about Banking? - Natalino Busa @ PAPIs Connect
Microdecision making in financial services - Greg Lamp @ PAPIs Connect
Engineering the Future of Our Choice with General AI - JoEllen Lukavec Koeste...
Distributed deep learning with spark on AWS - Vincent Van Steenbergen @ PAPIs...
How to predict the future of shopping - Ulrich Kerzel @ PAPIs Connect
The emergent opportunity of Big Data for Social Good - Nuria Oliver @ PAPIs C...

Recently uploaded (20)

PDF
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
PPTX
inbound6529290805104538764.pptxmmmmmmmmm
PDF
technical specifications solar ear 2025.
PPTX
AI AND ML PROPOSAL PRESENTATION MUST.pptx
PPTX
transformers as a tool for understanding advance algorithms in deep learning
PPTX
indiraparyavaranbhavan-240418134200-31d840b3.pptx
PPTX
Hushh Hackathon for IIT Bombay: Create your very own Agents
PPTX
GPS sensor used agriculture land for automation
PPTX
Stats annual compiled ipd opd ot br 2024
PDF
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
PPTX
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
PPTX
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
PPT
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
PPTX
PPT for Diseases.pptx, there are 3 types of diseases
PDF
Session 11 - Data Visualization Storytelling (2).pdf
PPTX
MBA JAPAN: 2025 the University of Waseda
PPTX
ch20 Database System Architecture by Rizvee
PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
PPTX
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx
PDF
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
inbound6529290805104538764.pptxmmmmmmmmm
technical specifications solar ear 2025.
AI AND ML PROPOSAL PRESENTATION MUST.pptx
transformers as a tool for understanding advance algorithms in deep learning
indiraparyavaranbhavan-240418134200-31d840b3.pptx
Hushh Hackathon for IIT Bombay: Create your very own Agents
GPS sensor used agriculture land for automation
Stats annual compiled ipd opd ot br 2024
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
PPT for Diseases.pptx, there are 3 types of diseases
Session 11 - Data Visualization Storytelling (2).pdf
MBA JAPAN: 2025 the University of Waseda
ch20 Database System Architecture by Rizvee
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.

Large scale predictive analytics for anomaly detection - Nicolas Hohn

  • 1. © 2014 Guavus, Inc. All rights reserved. Nicolas Hohn Director of Analytics Guavus LARGE SCALE PREDICTIVE ANALYTICS FOR ANOMALY DETECTION 2015
  • 2. © 2015 Guavus, Inc. All rights reserved. 2 Guavus Applications Focus Planning Engineering •  Network Analytics •  Capacity Management •  Trending & Forecasting •  Value-based Network Planning •  Quality of Experience •  Complaints Mitigation •  Proactive Care •  Revenue Assurance •  Self Care Usage Portal Network Operations MarketingCare •  Service Management •  QoS Management •  Performance Monitoring •  Proactive Service Assurance •  Subscriber Profiling •  Personalization & Targeting •  CSP Data Monetization Service Assurance Customer Experience
  • 3. © 2015 Guavus, Inc. All rights reserved. 3 Anomaly Detection •  Anomaly: something that is unusual or unexpected •  Detection: extraction of particular information from a larger stream of information without specific cooperation from or synchronization with the sender Implementation •  Rule based: manual thresholds •  Automated: thresholds set by machine learning Operational Intelligence Service Degrading Problem! Service Anomalies Identification! Root-Cause Analysis! Problem Resolution! Quantify how ‘unusual’ a signal value is Unsupervised learning to send trigger when signal is ‘unexpected enough’ DOES NOT SCALE spike step slope time time time KPI
  • 4. © 2015 Guavus, Inc. All rights reserved. 4 Anomaly detection Event Arrival Times 2014-09-16 00:00:06 2014-09-16 00:00:09 2014-09-16 00:00:40 2014-09-16 00:00:42 2014-09-16 00:00:45 2014-09-16 00:01:00 2014-09-16 00:01:09 2014-09-16 00:01:11 2014-09-16 00:01:20 2014-09-16 00:02:09 …… 5 4 Define KPI and time scale Predict conditional baseline (black line) and probability density given historical data KPI value (green line) Trigger alert (red dot) if data point significantly above baseline, i.e. outside confidence interval (gray bands) 1 2 3 4 •  4 step process #events Time
  • 5. © 2015 Guavus, Inc. All rights reserved. 5 KPI time EASIER Challenges •  Predict distribution of current value based on past values –  Uni/Multi variate time series analysis •  Unify uncertainty metric across all types of input signals to build a global ranking of alarms •  Scale on limited hardware footprint. Real time monitoring of potentially millions of time series •  Keep customer happy (no alarm flooding, limit false positives, rank alarms by severity) KPI time HARDER HARD HARD HARD
  • 6. © 2015 Guavus, Inc. All rights reserved. 6 Solutions time Anomalyindicator #events •  Data Science: –  Robust to past anomalies •  No guarantee that ‘training’ data is anomaly free –  Adapt to changes •  Retrain model –  Cannot rely on labeled data: •  Understand customer ‘utility function’, business impact of anomalies •  Set thresholds automatically •  Quantify cost of false positives and false negatives •  Engineering: –  Intelligent caching –  Compression –  Scalable system
  • 7. © 2015 Guavus, Inc. All rights reserved. 7 •  Monitor KPIs, such as dropped call rate on each base station in a 4G network •  Detect anomalies •  Infer root cause by analyzing 1000s of other KPIs available on each cell of the network Use case: Networks analytics
  • 8. © 2015 Guavus, Inc. All rights reserved. 8 Architecture of the solution Data  fusion     aggrega/on   Compute Cluster Analytics Cluster Intelligent  Cache   Collector Adapter  1   Custom   Adapter  2   Columnar   Storage   Anomaly   Detec/on   UserInterface Time  series   analy/cs   Rules/  alerts   frame-­‐work   M2MInterface withcustomer system Data  streams  
  • 9. © 2015 Guavus, Inc. All rights reserved. 9 Conclusion Lessons learned •  No silver bullet, but multiple methods each with their own pros/cons •  Simple and scalable solution •  Adapt to: –  data changes –  customer needs •  API design: –  black-box approach: hide complexity from developers
  • 10. © 2015 Guavus, Inc. All rights reserved. QA Nicolas Hohn, Director of Analytics [email protected] 2015