chronosphere.io
chronosphere.io
Choose Your Own Adventure
Eric D. Schabell
Director Evangelism
@ericschabell{@fosstodon.org}
Cloud Native Observability Pitfalls
chronosphere.io
Cloud Native Observability
chronosphere.io
Cloud Native
chronosphere.io
Data volume
Experiment:
- Hello World app on 4 node
Kubernetes cluster with
Tracing, End User Metrics
(EUM), Logs, Metrics
(containers / nodes)
- 30 days == +450 GB
chronosphere.io
chronosphere.io
Cloud Native at Scale
chronosphere.io
Observability…
chronosphere.io
Cloud Native Observability
at Scale
chronosphere.io
O11y at Scale (need)
chronosphere.io
Picking Your Pitfalls
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section)
chronosphere.io
1. Ignoring existing
landscape
chronosphere.io
If they can’t
see me…
they can’t
hurt me...
chronosphere.io
chronosphere.io
Prometheus for metrics, alerting, queries
chronosphere.io
Prometheus auto discovery
chronosphere.io
Manual instrumentation (java client lib)
chronosphere.io
Short link: bit.ly/prom-
workshop
chronosphere.io
Applications (Java)
OTel Auto Instrumentation (libraries)
OTel API
OTel SDK
OTel Collector
OTLP
OTLP
OTLP
OpenTelemetry (Auto) instrumentation
chronosphere.io
Host
Observability Backend
(Prometheus, Jaeger, Fluent Bit, etc.),
Applications
OTel Auto Instrumentation
OTel API
OTel SDK
OTel Collector Agent
OTLP
OTLP
OTLP
OTLP
OTLP
OpenTelemetry Collector (agent)
chronosphere.io
Host
Host
Host
Observability Backend
(Prometheus, Jaeger, Fluent Bit, etc.),
Applications
OTel Auto Instrumentation
OTel API
OTel SDK
OTel Collector Agent
OTLP
OTLP
OTLP
OTLP
Collector (gateway)
OTel Collector Gateway
chronosphere.io
Short link: bit.ly/opentelemetry-
workshop
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
2. Focusing on The Pillars
chronosphere.io
Pillars Phases
chronosphere.io
Developer
Technology
Bottom up
chronosphere.io
Pillar problems…
chronosphere.io
Car is on fire…
chronosphere.io
Better outcomes…
Faster remediation…
Easier detection…
Happier customers…
chronosphere.io
Phase 1
Know something is
happening as fast
as possible…
chronosphere.io
Phase 2
Triage with specific
information…
chronosphere.io
Phase 3
Understand to
ensure never
happens again…
chronosphere.io
chronosphere.io
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
3. Sneaky sprawling mess
chronosphere.io
Over 66% of organizations
use more than 10 different
observability tools
– ESG report over exploding data volumes
chronosphere.io
chronosphere.io
Know
Triage
Understand
chronosphere.io
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
4. Controlling costs
chronosphere.io
“It’s remarkable how common
this situation is, where an
organization is paying more for
their observability data, than
they do for their production
infrastructure.”
chronosphere.io
O11y data storage costs
are broken.
Keeping everything
model?
chronosphere.io
Know the cost of
observability
metrics data?
chronosphere.io
DATA
COLLECTION
CONTROL PLANE
PURPOSE-BUILT DATA STORES
PER TELEMETRY TYPE
CHRONOSPHERE LENS
Align cost to value Single Tenanted Architecture w/
99.99% Reliability
Turns raw data into generated
insights for each user
Customer Environment Chronosphere SaaS Platform
METRICS
|
LOGS
|
TRACES
|
EVENTS
Ingest all your data from
any source
chronosphere.io
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
5. The protocol jungle
chronosphere.io
Without open standards,
you’ll not find a way back…
chronosphere.io
chronosphere.io
Host
Observability Backend
(Prometheus, Jaeger, Fluent Bit, etc.),
Applications
OTel Auto Instrumentation
OTel API
OTel SDK
OTel Collector Agent
OTLP
OTLP
OTLP
OTLP
OTLP
OpenTelemetry Collector (agent)
chronosphere.io
Prometheus for metrics, alerting, queries
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
6. Underestimating cardinality
chronosphere.io
The struggle is real
“I don't yet collect spans/traces because I can hardly get our devs to care about basic metrics, let alone
traces.”
“This is a large enterprise with approx. 1000 developers. Cultivating a culture of engineering that cares
about availability is a challenge that we need to solve alongside any technical implementations.”
chronosphere.io
10 hours
on average, per week,
trying to triage and
understand incidents -
a quarter of a
40 hour work week
chronosphere.io
33%
said those issues
disrupted their
personal life
39%
admitting they are
frequently
stressed out
chronosphere.io
Cloud Native
Observability
at Scale
chronosphere.io
DATA
COLLECTION
CONTROL PLANE
PURPOSE-BUILT DATA STORES
PER TELEMETRY TYPE
CHRONOSPHERE LENS
Align cost to value Single Tenanted Architecture w/
99.99% Reliability
Turns raw data into generated
insights for each user
Customer Environment Chronosphere SaaS Platform
METRICS
|
LOGS
|
TRACES
|
EVENTS
Ingest all your data from
any source
chronosphere.io
1. Ignoring existing landscape
2. Focusing on The Pillars
3. Sneaky sprawling mess
4. Controlling costs
5. The protocol jungle
6. Underestimating cardinality
(Click on a pitfall to jump to that section, or jump to end)
Picking Your Next Pitfall
chronosphere.io
What should be
the #1 item on
cloud wishlist?
What should be #1 item on your
cloud native observability
wishlist?
chronosphere.io
chronosphere.io
Questions?
Eric D. Schabell
Director Evangelism
@ericschabell{@fosstodon.org}

More Related Content

PDF
3 Pitfalls Everyone Should Avoid with Cloud Native Observability
PPTX
KCD Porto: Choose Your Own Adventure - Cloud Naive Observability Pitfalls
PPTX
Checking the pulse of your cloud native architecture
PPTX
3 Pitfalls Everyone Should Avoid with Cloud Data
PPTX
3 Pitfalls Everyone Should Avoid with Cloud Data
PPTX
Optimizing Observability Spend: Metrics
PPTX
3 Pitfalls Everyone Should Avoid with Cloud Native Data
PDF
Shift left Observability
3 Pitfalls Everyone Should Avoid with Cloud Native Observability
KCD Porto: Choose Your Own Adventure - Cloud Naive Observability Pitfalls
Checking the pulse of your cloud native architecture
3 Pitfalls Everyone Should Avoid with Cloud Data
3 Pitfalls Everyone Should Avoid with Cloud Data
Optimizing Observability Spend: Metrics
3 Pitfalls Everyone Should Avoid with Cloud Native Data
Shift left Observability

Similar to Choose Your Own Adventure - Cloud Native Observability Pitfalls (20)

PPTX
How to Wrestle Your Observability Data Demons and Win!
PPTX
SRECon EU 2023 - Three Phases to Better Observability Outcomes
PDF
DZone webinar - Shift left Observability
PPTX
Choose Your Own Observability Adventure
PDF
Trajectory 2022 - Shifting Cloud Native Observability to the Left
PDF
From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation
PPTX
Observability For You and Me with OpenTelemetry
PDF
Observability For You and Me with openTelemetry
PPTX
Open Source 101 - Observability For You and Me with OpenTelemetry
PPTX
Infobip Shift EU 2024: Platform Engineers Arise - Adding Observability to You...
PDF
Observability For You and Me with OpenTelemetry
PDF
Observability For You and Me with OpenTelemetry (with demo)
PPTX
Optimizing Observability Spend: Metrics
PPTX
Cloud Native Bedtime Stories - Terrifying Execs into Action
PPTX
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
PDF
Observability For You and Me with OpenTelemetry
PDF
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
PDF
Evolution of deployment tooling @ Chronosphere - CraftConf 2023
PPTX
PromCon EU 2024: Meet the New Kid in the Sandbox - Integrating Visualization ...
PPTX
Engaging Your Execs - Telling Great Observability Tales Inspiring Action
How to Wrestle Your Observability Data Demons and Win!
SRECon EU 2023 - Three Phases to Better Observability Outcomes
DZone webinar - Shift left Observability
Choose Your Own Observability Adventure
Trajectory 2022 - Shifting Cloud Native Observability to the Left
From Cardinal(ity) Sins to Cost-Efficient Metrics Aggregation
Observability For You and Me with OpenTelemetry
Observability For You and Me with openTelemetry
Open Source 101 - Observability For You and Me with OpenTelemetry
Infobip Shift EU 2024: Platform Engineers Arise - Adding Observability to You...
Observability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetry (with demo)
Optimizing Observability Spend: Metrics
Cloud Native Bedtime Stories - Terrifying Execs into Action
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
Observability For You and Me with OpenTelemetry
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
Evolution of deployment tooling @ Chronosphere - CraftConf 2023
PromCon EU 2024: Meet the New Kid in the Sandbox - Integrating Visualization ...
Engaging Your Execs - Telling Great Observability Tales Inspiring Action
Ad

More from Eric D. Schabell (13)

PPTX
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
PPTX
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
PPTX
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
PPTX
Observability-as-a-Service: When Platform Engineers meet SREs
PPTX
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
PPTX
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
PPTX
Taking Back Control of Your Telemetry Data with Fluent Bit
PPTX
Finding observability and DevEx tranquility sailing the monitoring data seas
PPTX
MTTS - Sleep more, slog less with automated cloud native o11y platforms
PPTX
Taking Back Control of Your Telemetry Data with Fluent Bit
PPTX
Power Up with Podman - Cloud Native + K8s Meetup
PDF
Roadmap to Becoming a CNCF Ambassador
PPTX
WTF is SRE - Telling Effective Tales about Production
Meet the New Kid in the Sandbox - Integrating Visualization with Prometheus
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Observability-as-a-Service: When Platform Engineers meet SREs
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
When Platform Engineers meet SREs - The Birth of O11y-as-a-Service Superpowers
Taking Back Control of Your Telemetry Data with Fluent Bit
Finding observability and DevEx tranquility sailing the monitoring data seas
MTTS - Sleep more, slog less with automated cloud native o11y platforms
Taking Back Control of Your Telemetry Data with Fluent Bit
Power Up with Podman - Cloud Native + K8s Meetup
Roadmap to Becoming a CNCF Ambassador
WTF is SRE - Telling Effective Tales about Production
Ad

Recently uploaded (20)

PDF
4 layer Arch & Reference Arch of IoT.pdf
PDF
Data Virtualization in Action: Scaling APIs and Apps with FME
PDF
Comparative analysis of machine learning models for fake news detection in so...
PDF
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
PDF
giants, standing on the shoulders of - by Daniel Stenberg
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PPTX
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
PDF
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
PDF
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
PDF
Flame analysis and combustion estimation using large language and vision assi...
PPTX
Configure Apache Mutual Authentication
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PDF
Advancing precision in air quality forecasting through machine learning integ...
PPTX
Build Your First AI Agent with UiPath.pptx
PPTX
Training Program for knowledge in solar cell and solar industry
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PDF
Co-training pseudo-labeling for text classification with support vector machi...
PDF
Early detection and classification of bone marrow changes in lumbar vertebrae...
PPTX
future_of_ai_comprehensive_20250822032121.pptx
4 layer Arch & Reference Arch of IoT.pdf
Data Virtualization in Action: Scaling APIs and Apps with FME
Comparative analysis of machine learning models for fake news detection in so...
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
giants, standing on the shoulders of - by Daniel Stenberg
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
AI-driven Assurance Across Your End-to-end Network With ThousandEyes
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
Flame analysis and combustion estimation using large language and vision assi...
Configure Apache Mutual Authentication
Improvisation in detection of pomegranate leaf disease using transfer learni...
Advancing precision in air quality forecasting through machine learning integ...
Build Your First AI Agent with UiPath.pptx
Training Program for knowledge in solar cell and solar industry
The influence of sentiment analysis in enhancing early warning system model f...
Convolutional neural network based encoder-decoder for efficient real-time ob...
Co-training pseudo-labeling for text classification with support vector machi...
Early detection and classification of bone marrow changes in lumbar vertebrae...
future_of_ai_comprehensive_20250822032121.pptx

Choose Your Own Adventure - Cloud Native Observability Pitfalls