SlideShare a Scribd company logo
Application Monitoring and Tracing in
Kubernetes: Avoiding Microservice Hell!
David vonThenen
@dvonthenen
http://dvonthenen.com
github.com/dvonthenen
Agenda
• Why do we care?
• Introduction to Metrics
• Introduction to Tracing
• Demo
• Q&A
• Free give away! Slim Power Bank/Chargers!
Why do we care?
Microservices Are Awesome!
• Discrete Set of Functionality
• Resilient / Tolerates Failure
• Distributed / Highly Scalable
• Technology Freedom
• Autonomy of Dev Teams
• Enables Continuous Delivery
Can Be Your Worst Nightmare!
• Complex to Build
• Decentralized Nature
• Interface / Docs Required
• Operational Complexity
• Transaction Management
• Visibility is Difficult
Microservices at Scale (Excuse the pun)
https://www.slideshare.net/aspyker/reinvent-2016-container-scheduling-execution-and-aws-integration
Simple Failures
Complex Failures
Who is Talking to Who?
One Bad Apple…
Logs Aren’t Enough
Gain Visibility Now!
The Answer is…
• Metrics/Instrumentation
– Measure properties of a
given system
– Alarms and Notifications
• Tracing
– Observe interactions at a
request level
– Measure work in time
Introduction to Metrics
What are Metrics?
• Metrics are a quantifiable set of measurements of a
property for a given system, process, or component.
– Performance counters
– Instrumentation
• Observe behavior
• React to changes
Prometheus
• Open-source systems monitoring and
alerting project
• Cloud Native Compute Foundation
(CNCF) hosted project
• Originally built by SoundCloud
• Data model with time series data
• https://github.com/prometheus/prometheus
Let’s Deploy Prometheus
+
Architecture
Types of Metrics
• Counter – only increases in value
• Gauge – value goes up or down over
time
• Histogram – samples observations
and counts them over buckets
• Summary – histogram plus a
summation of value
Alerts
• Create rules based on observed metrics
• Alerts trigger actions to be taken
– Email
– Slack
– Webhooks
• Why do we care?
– Enables dynamic scale up and down
Prometheus Language Bindings
• 15 official and community supported libraries
– Go, Java, Python, Ruby, C++, etc
• https://prometheus.io/docs/instrumenting/clientlibs/
Introduction to Tracing
What is Tracing?
• Enables observability of a given
transaction as it moves through a
(distributed) system
• Allows visualization of which
microservice instances are involved
• Tracks the path through the software
stack + time metrics
Jaeger
• Open-source distributed tracing system
• CNCF hosted project
• Originally built by Uber
• OpenTracing compatible
• Root cause and observe performance
• https://github.com/jaegertracing/jaeger
Let’s Deploy Jaeger
+
Architecture
Traces and Spans
Jaeger Language Bindings
• 5 official and bunch of community supported libraries
– Go, Java, Python, node, C++
– http://jaeger.readthedocs.io/en/latest/client_libraries/
Metrics vs Tracing
• Metrics
– Gives a singular per node,
instance, or component
view of the world
– Health checks,
performance monitoring,
etc
– Alerts and reaction to
change
• Tracing
– Follows a single
transaction, API call, etc
through a given system or
application
– Think what a stack trace
provides except tracing is
doing it in a distributed
fashion
Demo
Demo Time!
Demo Configuration
• Kubernetes 1.7
• Prometheus 2.1
• Jaeger 1.0
• How-to:
https://github.com/dvonthenen/proposals/tree/master/2018_SCALE16
David vonThenen
{code} – Dell Technologies
@dvonthenen
http://dvonthenen.com
github.com/dvonthenen
Thank You

More Related Content

Similar to SCaLE 16x - Application Monitoring And Tracing In Kubernetes (20)

PDF
Bol.com Tech lab September 2017 - Microservices in action at the Dutch Nation...
Bert Jan Schrijver
 
PPTX
DevOpsGuys - DevOps Automation - The Good, The Bad and The Ugly
DevOpsGroup
 
PPTX
Automation: The Good, The Bad and The Ugly with DevOpsGuys - AppD Summit Europe
AppDynamics
 
PPTX
Monitoring Containerized Micro-Services In Azure
Alex Bulankou
 
PDF
Streaming to a new Jakarta EE / JOTB19
Markus Eisele
 
PPTX
DEVNET-1169 CI/CT/CD on a Micro Services Applications using Docker, Salt & Ni...
Cisco DevNet
 
PPTX
Iot cloud service v2.0
Vinod Wilson
 
PPTX
.Net Microservices with Event Sourcing, CQRS, Docker and... Windows Server 20...
Javier García Magna
 
PPTX
Microservices: Yes or not?
Eduard Tomàs
 
PDF
Microservices in action at the Dutch National Police
Bert Jan Schrijver
 
PDF
JavaZone 2017 - Microservices in action at the Dutch National Police
Bert Jan Schrijver
 
PDF
OpenValue meetup October 2017 - Microservices in action at the Dutch National...
Bert Jan Schrijver
 
PPTX
Kubernetes Infra 2.0
Deepak Sood
 
PPTX
All about that reactive ui
Paul van Zyl
 
PDF
GeekOut 2017 - Microservices in action at the Dutch National Police
Bert Jan Schrijver
 
PDF
Devoxx PL 2018 - Microservices in action at the Dutch National Police
Bert Jan Schrijver
 
PPTX
Cerberus : Framework for Manual and Automated Testing (Web Application)
CIVEL Benoit
 
PPTX
Cerberus_Presentation1
CIVEL Benoit
 
PDF
Continuous Delivery Amsterdam - Microservices in action at the Dutch National...
Bert Jan Schrijver
 
PDF
J-Spring 2017 - Microservices in action at the Dutch National Police
Bert Jan Schrijver
 
Bol.com Tech lab September 2017 - Microservices in action at the Dutch Nation...
Bert Jan Schrijver
 
DevOpsGuys - DevOps Automation - The Good, The Bad and The Ugly
DevOpsGroup
 
Automation: The Good, The Bad and The Ugly with DevOpsGuys - AppD Summit Europe
AppDynamics
 
Monitoring Containerized Micro-Services In Azure
Alex Bulankou
 
Streaming to a new Jakarta EE / JOTB19
Markus Eisele
 
DEVNET-1169 CI/CT/CD on a Micro Services Applications using Docker, Salt & Ni...
Cisco DevNet
 
Iot cloud service v2.0
Vinod Wilson
 
.Net Microservices with Event Sourcing, CQRS, Docker and... Windows Server 20...
Javier García Magna
 
Microservices: Yes or not?
Eduard Tomàs
 
Microservices in action at the Dutch National Police
Bert Jan Schrijver
 
JavaZone 2017 - Microservices in action at the Dutch National Police
Bert Jan Schrijver
 
OpenValue meetup October 2017 - Microservices in action at the Dutch National...
Bert Jan Schrijver
 
Kubernetes Infra 2.0
Deepak Sood
 
All about that reactive ui
Paul van Zyl
 
GeekOut 2017 - Microservices in action at the Dutch National Police
Bert Jan Schrijver
 
Devoxx PL 2018 - Microservices in action at the Dutch National Police
Bert Jan Schrijver
 
Cerberus : Framework for Manual and Automated Testing (Web Application)
CIVEL Benoit
 
Cerberus_Presentation1
CIVEL Benoit
 
Continuous Delivery Amsterdam - Microservices in action at the Dutch National...
Bert Jan Schrijver
 
J-Spring 2017 - Microservices in action at the Dutch National Police
Bert Jan Schrijver
 

More from David vonThenen (20)

PDF
The Future of UI/UX AI Generated Interfaces Tailored Just in Time
David vonThenen
 
PDF
Adaptive RAG Systems with Knowledge Graphs Building Reinforcement Learning Dr...
David vonThenen
 
PDF
The Rise of Agentic AI Harnessing Open Source for Dynamic Decision Making
David vonThenen
 
PDF
Explaining the Unexplainable Python Tools for AI Transparency using Captum
David vonThenen
 
PDF
2025 All Things Open AI - Leveraging Knowledge Graphs for RAG - A Smarter App...
David vonThenen
 
PDF
2025 NVIDIA GTC: Crack the AI Black Box: Practical Techniques for Explainable AI
David vonThenen
 
PDF
2025 SCaLE 22x - Training Multi-Modal ML Classification Models for Real Time ...
David vonThenen
 
PDF
2025 SCaLE 22x - Demystifying Building Natural Language Processing ML Models ...
David vonThenen
 
PDF
2025 DeveloperWeek - The Sound of Innovation: Why Voice Cloning Will Redefine...
David vonThenen
 
PDF
2025 Developer Week - Navigating the Edge-Cloud Bridge_ Building Resource Opt...
David vonThenen
 
PDF
2024 RTC Conference - Training Machine Learning Classification Models for Cre...
David vonThenen
 
PDF
2024 RTC CONF - Building Multiple Natural Language Processing Models to Work ...
David vonThenen
 
PDF
AI Dev Summit 2024 - Empower Your AI Journey_ Hands-on Machine Learning with ...
David vonThenen
 
PDF
SCaLE 21x - Voice-Activated AI Collaborators: A Hands-On Guide Using LLMs in ...
David vonThenen
 
PDF
RTC Conference 2023 - Edge Devices as Interactive Personal Assistants_ Unleas...
David vonThenen
 
PDF
RTC Conference 2023 - Enhancing Real-Time WebRTC Conversation Understanding U...
David vonThenen
 
PDF
Cloud Native Rejekts Europe 2022 - Learnings From Creating CI/CD Pipelines
David vonThenen
 
PDF
KubeCon EU 2020 - Provider vSphere All Things vSphere Working Group
David vonThenen
 
PDF
KubeCon Europe 2019 - VMware SIG - Intro to the CSI driver
David vonThenen
 
PPTX
How Container Schedulers and Software-based Storage will Change the Cloud
David vonThenen
 
The Future of UI/UX AI Generated Interfaces Tailored Just in Time
David vonThenen
 
Adaptive RAG Systems with Knowledge Graphs Building Reinforcement Learning Dr...
David vonThenen
 
The Rise of Agentic AI Harnessing Open Source for Dynamic Decision Making
David vonThenen
 
Explaining the Unexplainable Python Tools for AI Transparency using Captum
David vonThenen
 
2025 All Things Open AI - Leveraging Knowledge Graphs for RAG - A Smarter App...
David vonThenen
 
2025 NVIDIA GTC: Crack the AI Black Box: Practical Techniques for Explainable AI
David vonThenen
 
2025 SCaLE 22x - Training Multi-Modal ML Classification Models for Real Time ...
David vonThenen
 
2025 SCaLE 22x - Demystifying Building Natural Language Processing ML Models ...
David vonThenen
 
2025 DeveloperWeek - The Sound of Innovation: Why Voice Cloning Will Redefine...
David vonThenen
 
2025 Developer Week - Navigating the Edge-Cloud Bridge_ Building Resource Opt...
David vonThenen
 
2024 RTC Conference - Training Machine Learning Classification Models for Cre...
David vonThenen
 
2024 RTC CONF - Building Multiple Natural Language Processing Models to Work ...
David vonThenen
 
AI Dev Summit 2024 - Empower Your AI Journey_ Hands-on Machine Learning with ...
David vonThenen
 
SCaLE 21x - Voice-Activated AI Collaborators: A Hands-On Guide Using LLMs in ...
David vonThenen
 
RTC Conference 2023 - Edge Devices as Interactive Personal Assistants_ Unleas...
David vonThenen
 
RTC Conference 2023 - Enhancing Real-Time WebRTC Conversation Understanding U...
David vonThenen
 
Cloud Native Rejekts Europe 2022 - Learnings From Creating CI/CD Pipelines
David vonThenen
 
KubeCon EU 2020 - Provider vSphere All Things vSphere Working Group
David vonThenen
 
KubeCon Europe 2019 - VMware SIG - Intro to the CSI driver
David vonThenen
 
How Container Schedulers and Software-based Storage will Change the Cloud
David vonThenen
 
Ad

Recently uploaded (20)

PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
Biography of Daniel Podor.pdf
Daniel Podor
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
Advancing WebDriver BiDi support in WebKit
Igalia
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
July Patch Tuesday
Ivanti
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
“NPU IP Hardware Shaped Through Software and Use-case Analysis,” a Presentati...
Edge AI and Vision Alliance
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Biography of Daniel Podor.pdf
Daniel Podor
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
Advancing WebDriver BiDi support in WebKit
Igalia
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
July Patch Tuesday
Ivanti
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Ad

SCaLE 16x - Application Monitoring And Tracing In Kubernetes

  • 1. Application Monitoring and Tracing in Kubernetes: Avoiding Microservice Hell! David vonThenen @dvonthenen http://dvonthenen.com github.com/dvonthenen
  • 2. Agenda • Why do we care? • Introduction to Metrics • Introduction to Tracing • Demo • Q&A • Free give away! Slim Power Bank/Chargers!
  • 3. Why do we care?
  • 4. Microservices Are Awesome! • Discrete Set of Functionality • Resilient / Tolerates Failure • Distributed / Highly Scalable • Technology Freedom • Autonomy of Dev Teams • Enables Continuous Delivery
  • 5. Can Be Your Worst Nightmare! • Complex to Build • Decentralized Nature • Interface / Docs Required • Operational Complexity • Transaction Management • Visibility is Difficult
  • 6. Microservices at Scale (Excuse the pun) https://www.slideshare.net/aspyker/reinvent-2016-container-scheduling-execution-and-aws-integration
  • 9. Who is Talking to Who?
  • 13. The Answer is… • Metrics/Instrumentation – Measure properties of a given system – Alarms and Notifications • Tracing – Observe interactions at a request level – Measure work in time
  • 15. What are Metrics? • Metrics are a quantifiable set of measurements of a property for a given system, process, or component. – Performance counters – Instrumentation • Observe behavior • React to changes
  • 16. Prometheus • Open-source systems monitoring and alerting project • Cloud Native Compute Foundation (CNCF) hosted project • Originally built by SoundCloud • Data model with time series data • https://github.com/prometheus/prometheus
  • 19. Types of Metrics • Counter – only increases in value • Gauge – value goes up or down over time • Histogram – samples observations and counts them over buckets • Summary – histogram plus a summation of value
  • 20. Alerts • Create rules based on observed metrics • Alerts trigger actions to be taken – Email – Slack – Webhooks • Why do we care? – Enables dynamic scale up and down
  • 21. Prometheus Language Bindings • 15 official and community supported libraries – Go, Java, Python, Ruby, C++, etc • https://prometheus.io/docs/instrumenting/clientlibs/
  • 23. What is Tracing? • Enables observability of a given transaction as it moves through a (distributed) system • Allows visualization of which microservice instances are involved • Tracks the path through the software stack + time metrics
  • 24. Jaeger • Open-source distributed tracing system • CNCF hosted project • Originally built by Uber • OpenTracing compatible • Root cause and observe performance • https://github.com/jaegertracing/jaeger
  • 28. Jaeger Language Bindings • 5 official and bunch of community supported libraries – Go, Java, Python, node, C++ – http://jaeger.readthedocs.io/en/latest/client_libraries/
  • 29. Metrics vs Tracing • Metrics – Gives a singular per node, instance, or component view of the world – Health checks, performance monitoring, etc – Alerts and reaction to change • Tracing – Follows a single transaction, API call, etc through a given system or application – Think what a stack trace provides except tracing is doing it in a distributed fashion
  • 30. Demo
  • 32. Demo Configuration • Kubernetes 1.7 • Prometheus 2.1 • Jaeger 1.0 • How-to: https://github.com/dvonthenen/proposals/tree/master/2018_SCALE16
  • 33. David vonThenen {code} – Dell Technologies @dvonthenen http://dvonthenen.com github.com/dvonthenen Thank You