SlideShare a Scribd company logo
Container native
Monitoring Rohit Jnagal
Anushree Narasimha
Native container monitoring
Overview
● Monitoring for containers
● Monitoring in a distributed system
● cAdvisor
Application Metrics
● In cAdvisor
● Plumbing through a cluster
Future Work
● Heapster/Kubedash
● Templates
Outline
Monitoring in three parts
Collection
● collecting data and metrics
● making it discoverable
● plumbing it to higher-level systems
Processing
● Ingesting, aggregation
● Analytics
Managing
● Actions based on signals
● Alerts. Pagers!
Monitoring in three parts
Collection
● collecting data and metrics
● making it discoverable
● plumbing it to higher-level systems
Processing
● Ingesting, aggregation
● Analytics
Managing
● Actions based on signals
● Alerts. Pagers!
Collection in servers/VMs
● Node agent
● Knows the binary to monitor
● Monitoring logic plugged into agent
Moving to containers
● Same node agent - understands multiple
applications.
● Applications can push data to agent.
● Sidecars
● Applications publish data. Pulled off by off-
host agents.
Near-instant data
● schedulers in cluster management tools
● load balancers
● Alerting systems
Slower feedback loops
● Autoscaling / Autoupdaters
● CPI2
Historical Data
● Offline analysis.
consumers
Monitoring in distributed
environment
• Hosts are invisible
• Containers can hop around
• Monitoring needs to track
and correlate multiple
containers
In Action!
cAdvisor
Analyzes resource usage and performance
characteristics of running containers
Native Docker support, work with LXC and any
other container format
Knows containers deeply and monitors their
performance
github.com/google/cadvisor
Running cAdvisor
docker run
--volume=/:/rootfs:ro
--volume=/var/run:/var/run:rw
--volume=/sys:/sys:ro
--volume=/var/lib/docker/:/var/lib/docker:ro
--publish=8080:8080
--detach=true
--name=cadvisor
google/cadvisor:latest
API
Node and container spec
http://host:8080/api/v2.0/machine
http://host:8080/api/v2.0/spec/redis?type=docker
Hierarchical Container stats:
http://host:8080/api/v2.0/stats/nginx?type=docker
http://host:8080/api/v2.0/stats?recursive=true
Others:
/summary, /events, /storage, /attributes, /ps
Storage
Backends
Coming soon ...
Application Metrics
Every container is packaged with its monitoring data.
Monitoring moves with the container.
Use container composability to stack up metrics from all layers.
Use metadata in image or runtime (Docker labels!) to configure
monitoring
Configuring containers
FROM redis
ADD redis_config.json /var/cadvisor/redis_config.json
LABEL io.cadvisor.metric.redis="/var/cadvisor/redis_config.json"
In cAdvisor
read Labels “io.cadvisor.metric.*”
read /rootfs/proc/<pid>/root/<config path>
Monitoring Configuration
Holds metadata about metrics
● Endpoint (Location to collect metrics from)
● Name of metric
● Type (Counter, Gauge, ...)
● Data Type (int, float)
● Units (kbps, seconds, count)
● Polling Frequency
● Regexps (Regular expressions to be used to collect a metric)
Sample Configurations
{
"endpoint" : "http://localhost:8000/nginx_status",
"metrics_config" : [
{ "name" : "activeConnections",
"metric_type" : "gauge",
"units" : "number of active connections",
"data_type" : "int",
"polling_frequency" : 10,
"regex" : "Active connections: ([0-9]+)"
},
{ "name" : "reading",
"metric_type" : "gauge",
"units" : "number of reading connections",
"data_type" : "int",
"polling_frequency" : 10,
"regex" : "Reading: ([0-9]+) .*"
}
]
}
Get all prometheus metrics:
{
"endpoint" : "http://localhost:9100/metrics",
}
Get selected prometheus metrics:
{
"endpoint" : "http://localhost:8000/metrics",
"metrics_config" : [
{ "scheduler_binding_latency",
"scheduler_e2e_scheduling_latency",
"scheduling_algorithm_latency"
}
]
}
App Metrics in action
Endpoint for custom metrics:
http://localhost:8080/api/v2.0/appmetrics/containerName
Application metrics being collected can be discovered from the spec:
http://localhost:8080/api/v2.0/spec/containerName
Regular stats api also reports application metrics:
http://localhost:8080/api/v2.0/stats/containerName
API
Kubernetes
Open-source Container orchestration from Google
Declarative pattern for managing containers
Physical hosts abstracted out as resources
Inspired and informed by Borg
Kubernetes UI powered by cAdvisor
github.com/kubernetes/kubernetes
App Metrics for Kubernetes
All kubernetes components run in containers
All system services on node runs in containers
All kubernetes nodes run cAdvisor (built into kubelet)
All kubernetes components expose prometheus metrics
App-metrics for kubernetes components!
Heapster
github.com/kubernetes/heapster
Heapster
Kubedash
github.com/kubernetes/kubedash
Templates
Add templates for applications that have stable stats API
LABEL io.cadvisor.metric.type=redis
Infer monitoring information
● Lookup ports through docker inspection
● hit known endpoints (e.g.: /metrics for prometheus)
● overrides through config
Tags
Adding tags to specific metrics
Convey metric intent to processors
eg. Autoscalers
Ongoing work
Endpoints
Storage drivers
Standard config syntax
Automagic
Plumbing through heapster/kubedash/kubernetes/...
Thank you!
Rohit Jnagal jnagal@google
Anushree Narasimha anushree.bnp@gmail
cAdvisor github.com/google/cadvisor
kubernetes kubernetes.io
irc #google-containers

More Related Content

What's hot (19)

PPTX
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lightbend
 
PPTX
Zoo keeper in the wild
datamantra
 
PDF
Python and cassandra
Jon Haddad
 
PPTX
Streaming and Messaging
Xin Wang
 
PPTX
Real-Time Distributed and Reactive Systems with Apache Kafka and Apache Accumulo
Joe Stein
 
PPTX
Deploying and managing SolrCloud in the cloud using the Solr Scale Toolkit
thelabdude
 
PDF
Kafka Summit NYC 2017 - Running Hundreds of Kafka Clusters with 5 People
confluent
 
PDF
Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015
Datadog
 
PPTX
How to manage large amounts of data with akka streams
Igor Mielientiev
 
PPTX
Back-Pressure in Action: Handling High-Burst Workloads with Akka Streams & Kafka
Akara Sucharitakul
 
PDF
Kafka Summit SF 2017 - Exactly-once Stream Processing with Kafka Streams
confluent
 
PDF
KSQL: Streaming SQL for Kafka
confluent
 
PPTX
Service Stampede: Surviving a Thousand Services
Anil Gursel
 
PPTX
Scylla Summit 2018: Introducing ValuStor, A Memcached Alternative Made to Run...
ScyllaDB
 
PDF
Fact-Based Monitoring - PuppetConf 2014
Puppet
 
PPTX
Apache zookeeper seminar_trinh_viet_dung_03_2016
Viet-Dung TRINH
 
PPTX
Asynchronous Orchestration DSL on squbs
Anil Gursel
 
PDF
Advanced Operations
DataStax Academy
 
PDF
Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDog
Redis Labs
 
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lightbend
 
Zoo keeper in the wild
datamantra
 
Python and cassandra
Jon Haddad
 
Streaming and Messaging
Xin Wang
 
Real-Time Distributed and Reactive Systems with Apache Kafka and Apache Accumulo
Joe Stein
 
Deploying and managing SolrCloud in the cloud using the Solr Scale Toolkit
thelabdude
 
Kafka Summit NYC 2017 - Running Hundreds of Kafka Clusters with 5 People
confluent
 
Monitoring Docker at Scale - Docker San Francisco Meetup - August 11, 2015
Datadog
 
How to manage large amounts of data with akka streams
Igor Mielientiev
 
Back-Pressure in Action: Handling High-Burst Workloads with Akka Streams & Kafka
Akara Sucharitakul
 
Kafka Summit SF 2017 - Exactly-once Stream Processing with Kafka Streams
confluent
 
KSQL: Streaming SQL for Kafka
confluent
 
Service Stampede: Surviving a Thousand Services
Anil Gursel
 
Scylla Summit 2018: Introducing ValuStor, A Memcached Alternative Made to Run...
ScyllaDB
 
Fact-Based Monitoring - PuppetConf 2014
Puppet
 
Apache zookeeper seminar_trinh_viet_dung_03_2016
Viet-Dung TRINH
 
Asynchronous Orchestration DSL on squbs
Anil Gursel
 
Advanced Operations
DataStax Academy
 
Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDog
Redis Labs
 

Viewers also liked (20)

PDF
Performance Monitoring: Understanding Your Scylla Cluster
ScyllaDB
 
PDF
Monitoring Kubernetes with Prometheus
Tobias Schmidt
 
PDF
CloudCamp Chicago - Big Data & Cloud May 2015 - All Slides
CloudCamp Chicago
 
PPTX
Monitoring in a scalable world
TechExeter
 
PDF
Skynet project: Monitor, analyze, scale, and maintain a system in the Cloud
Sylvain Kalache
 
PPTX
Lifting the Blinds: Monitoring Windows Server 2012
Datadog
 
PDF
Monitoring kubernetes across data center and cloud
Datadog
 
PDF
Data Logging and Telemetry
Francesco Meschia
 
PDF
Prometheus and Docker (Docker Galway, November 2015)
Brian Brazil
 
PPTX
Deep-Dive to Application Insights
Gunnar Peipman
 
PDF
Intro to open source telemetry linux con 2016
Matthew Broberg
 
PPTX
How to Monitor Application Performance in a Container-Based World
Ken Owens
 
PPTX
Sysdig Monitorama Slides
Loris Degioanni
 
PPTX
Volta: Logging, Metrics, and Monitoring as a Service
LN Renganarayana
 
PDF
Kubernetes and Prometheus
Weaveworks
 
PDF
Container orchestration
Timo Derstappen
 
PDF
Netflix: From Clouds to Roots
Brendan Gregg
 
PPTX
AWS Re:Invent - High Availability Architecture at Netflix
Adrian Cockcroft
 
PDF
Container Orchestration Wars
Karl Isenberg
 
PDF
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)
Brian Brazil
 
Performance Monitoring: Understanding Your Scylla Cluster
ScyllaDB
 
Monitoring Kubernetes with Prometheus
Tobias Schmidt
 
CloudCamp Chicago - Big Data & Cloud May 2015 - All Slides
CloudCamp Chicago
 
Monitoring in a scalable world
TechExeter
 
Skynet project: Monitor, analyze, scale, and maintain a system in the Cloud
Sylvain Kalache
 
Lifting the Blinds: Monitoring Windows Server 2012
Datadog
 
Monitoring kubernetes across data center and cloud
Datadog
 
Data Logging and Telemetry
Francesco Meschia
 
Prometheus and Docker (Docker Galway, November 2015)
Brian Brazil
 
Deep-Dive to Application Insights
Gunnar Peipman
 
Intro to open source telemetry linux con 2016
Matthew Broberg
 
How to Monitor Application Performance in a Container-Based World
Ken Owens
 
Sysdig Monitorama Slides
Loris Degioanni
 
Volta: Logging, Metrics, and Monitoring as a Service
LN Renganarayana
 
Kubernetes and Prometheus
Weaveworks
 
Container orchestration
Timo Derstappen
 
Netflix: From Clouds to Roots
Brendan Gregg
 
AWS Re:Invent - High Availability Architecture at Netflix
Adrian Cockcroft
 
Container Orchestration Wars
Karl Isenberg
 
Monitoring Kubernetes with Prometheus (Kubernetes Ireland, 2016)
Brian Brazil
 
Ad

Similar to Native container monitoring (20)

PDF
Building a data pipeline to ingest data into Hadoop in minutes using Streamse...
Guglielmo Iozzia
 
PDF
Webinar Monitoring in era of cloud computing
CREATE-NET
 
PPTX
Fabric - Realtime stream processing framework
Shashank Gautam
 
PDF
Intelligent Monitoring
Intelie
 
PDF
TimeSeries Machine Learning - PyData London 2025
Suyash Joshi
 
PPTX
MongoDB for Time Series Data
MongoDB
 
PDF
Analytics&IoT
Selvaraj Kesavan
 
PDF
DNA: an overview
Cisco DevNet
 
PPTX
[DSC Europe 23] Pramod Immaneni - Real-time analytics at IoT scale
DataScienceConferenc1
 
PPTX
Microsoft Azure Big Data Analytics
Mark Kromer
 
PPTX
MongoDB for Time Series Data: Setting the Stage for Sensor Management
MongoDB
 
PDF
[WSO2Con EU 2018] Patterns for Building Streaming Apps
WSO2
 
PDF
Apache Big Data EU 2016: Next Gen Big Data Analytics with Apache Apex
Apache Apex
 
PDF
Analytics in Your Enterprise
WSO2
 
PDF
Streaming Analytics and Internet of Things - Geesara Prathap
WithTheBest
 
ODP
Log aggregation and analysis
Dhaval Mehta
 
PPTX
WSO2Con USA 2015: Patterns for Deploying Analytics in the Real World
WSO2
 
PDF
Azure Monitoring Overview
gjuljo
 
PPTX
Real time streaming analytics
Anirudh
 
PDF
Understanding Business APIs through statistics
WSO2
 
Building a data pipeline to ingest data into Hadoop in minutes using Streamse...
Guglielmo Iozzia
 
Webinar Monitoring in era of cloud computing
CREATE-NET
 
Fabric - Realtime stream processing framework
Shashank Gautam
 
Intelligent Monitoring
Intelie
 
TimeSeries Machine Learning - PyData London 2025
Suyash Joshi
 
MongoDB for Time Series Data
MongoDB
 
Analytics&IoT
Selvaraj Kesavan
 
DNA: an overview
Cisco DevNet
 
[DSC Europe 23] Pramod Immaneni - Real-time analytics at IoT scale
DataScienceConferenc1
 
Microsoft Azure Big Data Analytics
Mark Kromer
 
MongoDB for Time Series Data: Setting the Stage for Sensor Management
MongoDB
 
[WSO2Con EU 2018] Patterns for Building Streaming Apps
WSO2
 
Apache Big Data EU 2016: Next Gen Big Data Analytics with Apache Apex
Apache Apex
 
Analytics in Your Enterprise
WSO2
 
Streaming Analytics and Internet of Things - Geesara Prathap
WithTheBest
 
Log aggregation and analysis
Dhaval Mehta
 
WSO2Con USA 2015: Patterns for Deploying Analytics in the Real World
WSO2
 
Azure Monitoring Overview
gjuljo
 
Real time streaming analytics
Anirudh
 
Understanding Business APIs through statistics
WSO2
 
Ad

More from Rohit Jnagal (7)

PDF
Task migration using CRIU
Rohit Jnagal
 
PDF
Memory Bandwidth QoS
Rohit Jnagal
 
PDF
Cat @ scale
Rohit Jnagal
 
PDF
Kubernetes intro public - kubernetes meetup 4-21-2015
Rohit Jnagal
 
PDF
Docker n co
Rohit Jnagal
 
PDF
Docker Overview
Rohit Jnagal
 
PDF
Docker internals
Rohit Jnagal
 
Task migration using CRIU
Rohit Jnagal
 
Memory Bandwidth QoS
Rohit Jnagal
 
Cat @ scale
Rohit Jnagal
 
Kubernetes intro public - kubernetes meetup 4-21-2015
Rohit Jnagal
 
Docker n co
Rohit Jnagal
 
Docker Overview
Rohit Jnagal
 
Docker internals
Rohit Jnagal
 

Recently uploaded (20)

PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PPTX
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PDF
Per Axbom: The spectacular lies of maps
Nexer Digital
 
PDF
Market Insight : ETH Dominance Returns
CIFDAQ
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
TrustArc Webinar - Navigating Data Privacy in LATAM: Laws, Trends, and Compli...
TrustArc
 
PDF
Researching The Best Chat SDK Providers in 2025
Ray Fields
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
Per Axbom: The spectacular lies of maps
Nexer Digital
 
Market Insight : ETH Dominance Returns
CIFDAQ
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
TrustArc Webinar - Navigating Data Privacy in LATAM: Laws, Trends, and Compli...
TrustArc
 
Researching The Best Chat SDK Providers in 2025
Ray Fields
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 

Native container monitoring

  • 1. Container native Monitoring Rohit Jnagal Anushree Narasimha
  • 3. Overview ● Monitoring for containers ● Monitoring in a distributed system ● cAdvisor Application Metrics ● In cAdvisor ● Plumbing through a cluster Future Work ● Heapster/Kubedash ● Templates Outline
  • 4. Monitoring in three parts Collection ● collecting data and metrics ● making it discoverable ● plumbing it to higher-level systems Processing ● Ingesting, aggregation ● Analytics Managing ● Actions based on signals ● Alerts. Pagers!
  • 5. Monitoring in three parts Collection ● collecting data and metrics ● making it discoverable ● plumbing it to higher-level systems Processing ● Ingesting, aggregation ● Analytics Managing ● Actions based on signals ● Alerts. Pagers!
  • 6. Collection in servers/VMs ● Node agent ● Knows the binary to monitor ● Monitoring logic plugged into agent Moving to containers ● Same node agent - understands multiple applications. ● Applications can push data to agent. ● Sidecars ● Applications publish data. Pulled off by off- host agents.
  • 7. Near-instant data ● schedulers in cluster management tools ● load balancers ● Alerting systems Slower feedback loops ● Autoscaling / Autoupdaters ● CPI2 Historical Data ● Offline analysis. consumers
  • 8. Monitoring in distributed environment • Hosts are invisible • Containers can hop around • Monitoring needs to track and correlate multiple containers
  • 10. cAdvisor Analyzes resource usage and performance characteristics of running containers Native Docker support, work with LXC and any other container format Knows containers deeply and monitors their performance github.com/google/cadvisor
  • 12. API Node and container spec http://host:8080/api/v2.0/machine http://host:8080/api/v2.0/spec/redis?type=docker Hierarchical Container stats: http://host:8080/api/v2.0/stats/nginx?type=docker http://host:8080/api/v2.0/stats?recursive=true Others: /summary, /events, /storage, /attributes, /ps
  • 14. Application Metrics Every container is packaged with its monitoring data. Monitoring moves with the container. Use container composability to stack up metrics from all layers. Use metadata in image or runtime (Docker labels!) to configure monitoring
  • 15. Configuring containers FROM redis ADD redis_config.json /var/cadvisor/redis_config.json LABEL io.cadvisor.metric.redis="/var/cadvisor/redis_config.json" In cAdvisor read Labels “io.cadvisor.metric.*” read /rootfs/proc/<pid>/root/<config path>
  • 16. Monitoring Configuration Holds metadata about metrics ● Endpoint (Location to collect metrics from) ● Name of metric ● Type (Counter, Gauge, ...) ● Data Type (int, float) ● Units (kbps, seconds, count) ● Polling Frequency ● Regexps (Regular expressions to be used to collect a metric)
  • 17. Sample Configurations { "endpoint" : "http://localhost:8000/nginx_status", "metrics_config" : [ { "name" : "activeConnections", "metric_type" : "gauge", "units" : "number of active connections", "data_type" : "int", "polling_frequency" : 10, "regex" : "Active connections: ([0-9]+)" }, { "name" : "reading", "metric_type" : "gauge", "units" : "number of reading connections", "data_type" : "int", "polling_frequency" : 10, "regex" : "Reading: ([0-9]+) .*" } ] } Get all prometheus metrics: { "endpoint" : "http://localhost:9100/metrics", } Get selected prometheus metrics: { "endpoint" : "http://localhost:8000/metrics", "metrics_config" : [ { "scheduler_binding_latency", "scheduler_e2e_scheduling_latency", "scheduling_algorithm_latency" } ] }
  • 18. App Metrics in action
  • 19. Endpoint for custom metrics: http://localhost:8080/api/v2.0/appmetrics/containerName Application metrics being collected can be discovered from the spec: http://localhost:8080/api/v2.0/spec/containerName Regular stats api also reports application metrics: http://localhost:8080/api/v2.0/stats/containerName API
  • 20. Kubernetes Open-source Container orchestration from Google Declarative pattern for managing containers Physical hosts abstracted out as resources Inspired and informed by Borg Kubernetes UI powered by cAdvisor github.com/kubernetes/kubernetes
  • 21. App Metrics for Kubernetes All kubernetes components run in containers All system services on node runs in containers All kubernetes nodes run cAdvisor (built into kubelet) All kubernetes components expose prometheus metrics App-metrics for kubernetes components!
  • 24. Templates Add templates for applications that have stable stats API LABEL io.cadvisor.metric.type=redis Infer monitoring information ● Lookup ports through docker inspection ● hit known endpoints (e.g.: /metrics for prometheus) ● overrides through config
  • 25. Tags Adding tags to specific metrics Convey metric intent to processors eg. Autoscalers
  • 26. Ongoing work Endpoints Storage drivers Standard config syntax Automagic Plumbing through heapster/kubedash/kubernetes/...
  • 27. Thank you! Rohit Jnagal jnagal@google Anushree Narasimha anushree.bnp@gmail cAdvisor github.com/google/cadvisor kubernetes kubernetes.io irc #google-containers