Intelligently Monitor And Rapidly Troubleshoot Streaming Fast Data Applications With OpsClarity
Agenda
1. Fast Data & Streaming Applications
2. The Challenges of Monitoring Fast Data Applications	
3. What To Look For In a Fast Data Application Monitoring Solution
4. Intelligent End-To-End Monitoring from Lightbend	
5. Live Demo
6. Questions
Intelligently Monitor And Rapidly Troubleshoot Streaming Fast Data Applications With OpsClarity
Intelligently Monitor And Rapidly Troubleshoot Streaming Fast Data Applications With OpsClarity
reactivemanifesto.org
Reactive Underpinnings: Fast Data and streaming applications
often incorporate, or are based on, Reactive principles
• Rapidly Evolving Ecosystem
• Understanding the Data Pipeline
• Dynamic Architectures
• Intricately Interconnected
• Distributed And Clustered
The Challenges of Monitoring Fast Data Applications
Apache Spark, As An Illustrative Example
The Challenges of Monitoring Fast Data Applications
Concern Questions	To	Ask
Data	Health	(for	a	
given	application)
• Throughput:	is	data	processing	occurring	at	the	expected	rate?		
• Latency:	is	data	processing	occurring	within	the	expected	timeframe?		
• Error/quality:	are	there	problems	with	the	data	being	produced?		
• Input	data:	are	input	data	streams	flowing	into	Spark	behaving	normally?	For	instance,	what	are	the	
throughput	rates	for	Kafka	topics	feeding	into	the	Spark	job?
Dependency	Health • Are	the	systems	feeding	input	into	the	spark	job	(such	as	Kafka)	healthy?		
• Are	the	systems	that	the	application	is	dependent	on,	such	as	Memcache	or	other	API	endpoints,	
healthy?	
Service	Health • Is	the	Spark	master	operating	normally?	If	not,	engineering	will	be	unable	to	re-balance	workloads	or	
restart	jobs.	
Application	Health • Are	the	application	KPIs	within	normal	operating	parameters?	
Topology	Health • Are	there	resources	assigned	to	the	given	Spark	topology?		
• •	Are	the	Spark	tasks	and	executors	well-distributed	amongst	the	Spark	cluster?		
• •	Are	the	performance	counters	(emitted,	failed,	latency,	etc.)	for	the	given	Spark	topology	normal?	
Node	System	Health • Are	the	key	system	metrics	(load,	CPU,	memory,	net-i/o,	disk-i/o,	disk	free)	operating	normally?
Modern monitoring for modern applications
Why traditional monitoring tools won’t help you
• Built	to	monitor	monolithic	
applications	
• Can	only	be	used	to	extract	
metrics	and	trace	information	
based	on	a	synchronous	flow	
• Not	built	for	asynchronous	
flows	(i.e.	in	Fast	Data	and	
streaming	applications)	
• Cannot	easily	handle	streaming	
systems	running	on	distributed	
clusters
• Automatic Telemetry
• Visual Maps
• Intelligent, Rapid
Troubleshooting
What users need to effectively monitor Fast Data and
streaming applications
• Deep Telemetry	
• Domain Expertise
• Intelligent Anomaly
Detection
• Fine-Grained
Visibility, with Drill-
Down Capabilities
Data-Science Driven Anomaly Detection
• Automated Topology
Discovery
• Automatic Metric
Collection
• Real-Time Topology
Visualization
Automated Discovery, Configuration & Topology
Visualization
• Single Pane of Glass
Visibility
• Rapid Root Cause
Analysis
• Reduced Mean-Time-
To-Repair (MTTR)
Intelligent, Rapid Troubleshooting
• Dramatically reduce the time and cost to identify and remediate issues across
application life-cycle.

• Create happier, more satisfied customers – and lower churn	
• Lower HW/infrastructure costs and reduce concerns about chargebacks & SLA
penalties
• Deliver rapid time to value because everything you need for monitoring is packaged
into an easy-to-use solution
Benefits for your business
On to the demo…
lightbend.com/learn
If you’re serious about having end-to-end monitoring
for your Fast Data and streaming applications, 

let’s chat!
SET UP A 20-MIN DEMO

More Related Content

PDF
Insights Without Tradeoffs: Using Structured Streaming
PPTX
Case Study: Elasticsearch Ingest Using StreamSets at Cisco Intercloud
PPTX
Blind spots in big data erez koren @ forter
PPTX
Anomaly Detection using Spark MLlib and Spark Streaming
PPTX
Databus - LinkedIn's Change Data Capture Pipeline
PDF
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
PPTX
Data Infrastructure at LinkedIn
PDF
Using Spark Mllib Models in a Production Training and Serving Platform: Exper...
Insights Without Tradeoffs: Using Structured Streaming
Case Study: Elasticsearch Ingest Using StreamSets at Cisco Intercloud
Blind spots in big data erez koren @ forter
Anomaly Detection using Spark MLlib and Spark Streaming
Databus - LinkedIn's Change Data Capture Pipeline
Apache ® Spark™ MLlib 2.x: How to Productionize your Machine Learning Models
Data Infrastructure at LinkedIn
Using Spark Mllib Models in a Production Training and Serving Platform: Exper...

What's hot (20)

PDF
Accelerating Machine Learning on Databricks Runtime
PDF
KFServing, Model Monitoring with Apache Spark and a Feature Store
PDF
Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...
PPTX
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
PPTX
Insights Without Tradeoffs Using Structured Streaming keynote by Michael Armb...
PPTX
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
PDF
Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15
PPTX
R at Microsoft
PDF
Challenges of Operationalising Data Science in Production
PDF
Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...
PPTX
Streaming Data Ingest and Processing with Apache Kafka
PDF
Fighting Fraud with Apache Spark
PPTX
Spark ML Pipeline serving
PDF
How to Productionize Your Machine Learning Models Using Apache Spark MLlib 2....
PPTX
Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
PDF
Using PySpark to Process Boat Loads of Data
PDF
MLeap: Productionize Data Science Workflows Using Spark
PPTX
Innovation in the Enterprise Rent-A-Car Data Warehouse
PDF
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
PDF
1200x630 1
Accelerating Machine Learning on Databricks Runtime
KFServing, Model Monitoring with Apache Spark and a Feature Store
Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
Insights Without Tradeoffs Using Structured Streaming keynote by Michael Armb...
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
Revolution R Enterprise 7.4 - Presentation by Bill Jacobs 11Jun15
R at Microsoft
Challenges of Operationalising Data Science in Production
Accelerating Deep Learning Training with BigDL and Drizzle on Apache Spark wi...
Streaming Data Ingest and Processing with Apache Kafka
Fighting Fraud with Apache Spark
Spark ML Pipeline serving
How to Productionize Your Machine Learning Models Using Apache Spark MLlib 2....
Reactive Fast Data & the Data Lake with Akka, Kafka, Spark
Using PySpark to Process Boat Loads of Data
MLeap: Productionize Data Science Workflows Using Spark
Innovation in the Enterprise Rent-A-Car Data Warehouse
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
1200x630 1

Similar to Intelligently Monitor And Rapidly Troubleshoot Streaming Fast Data Applications With OpsClarity (20)

PPTX
February 2017 HUG: Slow, Stuck, or Runaway Apps? Learn How to Quickly Fix Pro...
PDF
Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...
PPTX
I Heart Log: Real-time Data and Apache Kafka
PDF
Data Streaming For Big Data
PPTX
Monitoring microservices lightning ddd north 20171014
PPTX
Hands-on Machine Learning Using Healthcare
PPTX
Observability in real time at scale
PDF
Data Engineering.pdf
PPTX
Automating the process of continuously prioritising data, updating and deploy...
PDF
Medidata AMUG Meeting / Presentation 2013
PPT
Lecture 01 Evolution of Decision Support Systems
PPTX
Apache kafka
PPTX
Data munging and analysis
PPT
The UK National Chemical Database Service – an integration of commercial and ...
PPTX
Performance Testing
PDF
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
PPTX
Providence: rapid vulnerability prevention
PPTX
Nicola Pagni - Anomaly Detection in Elasticsearch
PDF
Developing high frequency indicators using real time tick data on apache supe...
PPTX
Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...
February 2017 HUG: Slow, Stuck, or Runaway Apps? Learn How to Quickly Fix Pro...
Using Machine Learning to Understand Kafka Runtime Behavior (Shivanath Babu, ...
I Heart Log: Real-time Data and Apache Kafka
Data Streaming For Big Data
Monitoring microservices lightning ddd north 20171014
Hands-on Machine Learning Using Healthcare
Observability in real time at scale
Data Engineering.pdf
Automating the process of continuously prioritising data, updating and deploy...
Medidata AMUG Meeting / Presentation 2013
Lecture 01 Evolution of Decision Support Systems
Apache kafka
Data munging and analysis
The UK National Chemical Database Service – an integration of commercial and ...
Performance Testing
Building Streaming And Fast Data Applications With Spark, Mesos, Akka, Cassan...
Providence: rapid vulnerability prevention
Nicola Pagni - Anomaly Detection in Elasticsearch
Developing high frequency indicators using real time tick data on apache supe...
Observing Intraday Indicators Using Real-Time Tick Data on Apache Superset an...

More from Lightbend (20)

PDF
IoT 'Megaservices' - High Throughput Microservices with Akka
PDF
How Akka Cluster Works: Actors Living in a Cluster
PDF
The Reactive Principles: Eight Tenets For Building Cloud Native Applications
PDF
Putting the 'I' in IoT - Building Digital Twins with Akka Microservices
PDF
Akka at Enterprise Scale: Performance Tuning Distributed Applications
PDF
Digital Transformation with Kubernetes, Containers, and Microservices
PDF
Detecting Real-Time Financial Fraud with Cloudflow on Kubernetes
PDF
Cloudstate - Towards Stateful Serverless
PDF
Digital Transformation from Monoliths to Microservices to Serverless and Beyond
PDF
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
PDF
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
PDF
Microservices, Kubernetes, and Application Modernization Done Right
PDF
Full Stack Reactive In Practice
PDF
Akka and Kubernetes: A Symbiotic Love Story
PPTX
Scala 3 Is Coming: Martin Odersky Shares What To Know
PDF
Migrating From Java EE To Cloud-Native Reactive Systems
PDF
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
PDF
Designing Events-First Microservices For A Cloud Native World
PDF
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala
PDF
How To Build, Integrate, and Deploy Real-Time Streaming Pipelines On Kubernetes
IoT 'Megaservices' - High Throughput Microservices with Akka
How Akka Cluster Works: Actors Living in a Cluster
The Reactive Principles: Eight Tenets For Building Cloud Native Applications
Putting the 'I' in IoT - Building Digital Twins with Akka Microservices
Akka at Enterprise Scale: Performance Tuning Distributed Applications
Digital Transformation with Kubernetes, Containers, and Microservices
Detecting Real-Time Financial Fraud with Cloudflow on Kubernetes
Cloudstate - Towards Stateful Serverless
Digital Transformation from Monoliths to Microservices to Serverless and Beyond
Akka Anti-Patterns, Goodbye: Six Features of Akka 2.6
How to build streaming data pipelines with Akka Streams, Flink, and Spark usi...
Microservices, Kubernetes, and Application Modernization Done Right
Full Stack Reactive In Practice
Akka and Kubernetes: A Symbiotic Love Story
Scala 3 Is Coming: Martin Odersky Shares What To Know
Migrating From Java EE To Cloud-Native Reactive Systems
Running Kafka On Kubernetes With Strimzi For Real-Time Streaming Applications
Designing Events-First Microservices For A Cloud Native World
Scala Security: Eliminate 200+ Code-Level Threats With Fortify SCA For Scala
How To Build, Integrate, and Deploy Real-Time Streaming Pipelines On Kubernetes

Recently uploaded (20)

PDF
Understanding the Need for Systemic Change in Open Source Through Intersectio...
PPTX
Independent Consultants’ Biggest Challenges in ERP Projects – and How Apagen ...
PDF
WhatsApp Chatbots The Key to Scalable Customer Support.pdf
PDF
Building an Inclusive Web Accessibility Made Simple with Accessibility Analyzer
PDF
Mobile App Backend Development with WordPress REST API: The Complete eBook
PDF
Odoo Construction Management System by CandidRoot
PPTX
Comprehensive Guide to Digital Image Processing Concepts and Applications
PDF
Coding with GPT-5- What’s New in GPT 5 That Benefits Developers.pdf
PPTX
UNIT II: Software design, software .pptx
PPTX
Presentation - Summer Internship at Samatrix.io_template_2.pptx
PPTX
FLIGHT TICKET API | API INTEGRATION PLATFORM
PPTX
Swiggy API Scraping A Comprehensive Guide on Data Sets and Applications.pptx
PDF
Mobile App for Guard Tour and Reporting.pdf
PDF
Ragic Data Security Overview: Certifications, Compliance, and Network Safegua...
PDF
Module 1 - Introduction to Generative AI.pdf
PPTX
WJQSJXNAZJVCVSAXJHBZKSJXKJKXJSBHJBJEHHJB
PDF
Multiverse AI Review 2025_ The Ultimate All-in-One AI Platform.pdf
PPTX
StacksandQueuesCLASS 12 COMPUTER SCIENCE.pptx
PDF
Top 10 Project Management Software for Small Teams in 2025.pdf
PPTX
MCP empowers AI Agents from Zero to Production
Understanding the Need for Systemic Change in Open Source Through Intersectio...
Independent Consultants’ Biggest Challenges in ERP Projects – and How Apagen ...
WhatsApp Chatbots The Key to Scalable Customer Support.pdf
Building an Inclusive Web Accessibility Made Simple with Accessibility Analyzer
Mobile App Backend Development with WordPress REST API: The Complete eBook
Odoo Construction Management System by CandidRoot
Comprehensive Guide to Digital Image Processing Concepts and Applications
Coding with GPT-5- What’s New in GPT 5 That Benefits Developers.pdf
UNIT II: Software design, software .pptx
Presentation - Summer Internship at Samatrix.io_template_2.pptx
FLIGHT TICKET API | API INTEGRATION PLATFORM
Swiggy API Scraping A Comprehensive Guide on Data Sets and Applications.pptx
Mobile App for Guard Tour and Reporting.pdf
Ragic Data Security Overview: Certifications, Compliance, and Network Safegua...
Module 1 - Introduction to Generative AI.pdf
WJQSJXNAZJVCVSAXJHBZKSJXKJKXJSBHJBJEHHJB
Multiverse AI Review 2025_ The Ultimate All-in-One AI Platform.pdf
StacksandQueuesCLASS 12 COMPUTER SCIENCE.pptx
Top 10 Project Management Software for Small Teams in 2025.pdf
MCP empowers AI Agents from Zero to Production

Intelligently Monitor And Rapidly Troubleshoot Streaming Fast Data Applications With OpsClarity

  • 2. Agenda 1. Fast Data & Streaming Applications 2. The Challenges of Monitoring Fast Data Applications 3. What To Look For In a Fast Data Application Monitoring Solution 4. Intelligent End-To-End Monitoring from Lightbend 5. Live Demo 6. Questions
  • 5. reactivemanifesto.org Reactive Underpinnings: Fast Data and streaming applications often incorporate, or are based on, Reactive principles
  • 6. • Rapidly Evolving Ecosystem • Understanding the Data Pipeline • Dynamic Architectures • Intricately Interconnected • Distributed And Clustered The Challenges of Monitoring Fast Data Applications
  • 7. Apache Spark, As An Illustrative Example The Challenges of Monitoring Fast Data Applications Concern Questions To Ask Data Health (for a given application) • Throughput: is data processing occurring at the expected rate? • Latency: is data processing occurring within the expected timeframe? • Error/quality: are there problems with the data being produced? • Input data: are input data streams flowing into Spark behaving normally? For instance, what are the throughput rates for Kafka topics feeding into the Spark job? Dependency Health • Are the systems feeding input into the spark job (such as Kafka) healthy? • Are the systems that the application is dependent on, such as Memcache or other API endpoints, healthy? Service Health • Is the Spark master operating normally? If not, engineering will be unable to re-balance workloads or restart jobs. Application Health • Are the application KPIs within normal operating parameters? Topology Health • Are there resources assigned to the given Spark topology? • • Are the Spark tasks and executors well-distributed amongst the Spark cluster? • • Are the performance counters (emitted, failed, latency, etc.) for the given Spark topology normal? Node System Health • Are the key system metrics (load, CPU, memory, net-i/o, disk-i/o, disk free) operating normally?
  • 8. Modern monitoring for modern applications
  • 9. Why traditional monitoring tools won’t help you • Built to monitor monolithic applications • Can only be used to extract metrics and trace information based on a synchronous flow • Not built for asynchronous flows (i.e. in Fast Data and streaming applications) • Cannot easily handle streaming systems running on distributed clusters
  • 10. • Automatic Telemetry • Visual Maps • Intelligent, Rapid Troubleshooting What users need to effectively monitor Fast Data and streaming applications
  • 11. • Deep Telemetry • Domain Expertise • Intelligent Anomaly Detection • Fine-Grained Visibility, with Drill- Down Capabilities Data-Science Driven Anomaly Detection
  • 12. • Automated Topology Discovery • Automatic Metric Collection • Real-Time Topology Visualization Automated Discovery, Configuration & Topology Visualization
  • 13. • Single Pane of Glass Visibility • Rapid Root Cause Analysis • Reduced Mean-Time- To-Repair (MTTR) Intelligent, Rapid Troubleshooting
  • 14. • Dramatically reduce the time and cost to identify and remediate issues across application life-cycle.
 • Create happier, more satisfied customers – and lower churn • Lower HW/infrastructure costs and reduce concerns about chargebacks & SLA penalties • Deliver rapid time to value because everything you need for monitoring is packaged into an easy-to-use solution Benefits for your business
  • 15. On to the demo…
  • 17. If you’re serious about having end-to-end monitoring for your Fast Data and streaming applications, 
 let’s chat! SET UP A 20-MIN DEMO