SlideShare a Scribd company logo
Monitoring – The Open Source Way
MONITORING
when should you start?

Working in IT since 1999

Icinga team member since 2011

Nagios certified

Dog Lover, Crazy cyclist

Whisky Connoisseur – www.maltgeeks.com

My Site: www.aikilinux.com
Who Am I ?
Why do you want monitoring?
Know the state of the server.
Know the state of the service/product you provide.
Be able to anticipate issues before clients are
aware of them (proactive).
When Do Most Companies Start Monitoring?
• When a Client calls about a problem.
• Staging / Pre-Production
• Production.
• When the management is
asking for reports about SLA.
When should you start monitoring?
QA
Load Testing
E2E Testing
Staging
Monitoring = Day 2
Monitoring starts when your code is first
tested!!
Why Do You NEED Monitoring?
Know the state of the server.
Know the state of the service/product you provide.
Be able to anticipate issues before clients are aware of them (proactive).
Gather behavioural data about the system.
Learn about bottlenecks in the system and plan to avoid them.
Tracking the changes that happen on the system.
Know your code/system
Monitoring - When To start (or Metrics led development)
Monitoring - When To start (or Metrics led development)
H
Things that Happen when we hit production.
How do you go about setting up monitoring ?
Market research to select which tool to use.
And many more Tools...
What do we want to monitor ?
Laptops ? Servers? VoIP Phones?
Printers? Switches? PBX ?
Storage? FireWalls?
Routers? AP's ?
Before we select the tool, we need to define the
scope of what to include.
Monitoring - When To start (or Metrics led development)
Capabilities
What capabilities does the tool support ?
SNMP Added custom checks
Active/passive results Database integration
H/A Scaling
Distributed monitoring
Support ?
Community
Professional
Dedicated support personnel.
Provides SLA.
Known entity to communicate with.
Not depended on the “kindness of others”
• Use an external service.
• Hire external consultants.
• Build a full NOC.
How do you go about setting up
monitoring? – cont.
Use the tool you know
For Against
 Fast implementation “waste time for DEV system”
 Easy to set up “will not be the real product”
 Easy to hand over
 You don't care – it is for data
gathering
It is easy to start
Monitoring Hooks
Who knows what to look for ?
• The Person that wrote the code (most often)
How involved are they in the process ?
• Mostly – non at all – must change!!
Conclusion
* Start monitoring early.
* Involve Dev in the monitoring process.
* Make it part of CI (same as Unit testing).
* Gather as much data as possible, as early as
possible.
* Expose the data to the management and
architects.
* Always highlight any abnormal occurrence.
m
Q/A
Email :assaf@aikilinux.com
Twitter: @aikilinux

More Related Content

What's hot (19)

PPTX
What does "monitoring" mean? (FOSDEM 2017)
Brian Brazil
 
PDF
Just enough web ops for web developers
Datadog
 
PDF
Go Observability (in practice)
Eran Levy
 
PDF
Everything You wanted to Know About Distributed Tracing
Amuhinda Hungai
 
PDF
Winston - Netflix's event driven auto remediation and diagnostics tool
Vinay Shah
 
PDF
End to-end monitoring with the prometheus operator - Max Inden
Paris Container Day
 
PPTX
Evolution of the Prometheus TSDB (Percona Live Europe 2017)
Brian Brazil
 
PPTX
Evolution of Monitoring and Prometheus (Dublin 2018)
Brian Brazil
 
PPTX
Sensu Monitoring
Mohanasundaram Ponnusamy
 
PPTX
Sql azure cluster dashboard public.ppt
Qingsong Yao
 
PPTX
Nagios Conference 2014 - Dorance Martinez Cortes - Customizing Nagios
Nagios
 
PDF
Real time data driven applications (SQL vs NoSQL databases)
GoDataDriven
 
PPTX
Counting with Prometheus (CloudNativeCon+Kubecon Europe 2017)
Brian Brazil
 
PDF
OSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris Buytaert
NETWAYS
 
PDF
Observability in Java: Getting Started with OpenTelemetry
DevOps.com
 
PPTX
Anatomy of a Prometheus Client Library (PromCon 2018)
Brian Brazil
 
PPTX
OpenTelemetry For Architects
Kevin Brockhoff
 
PDF
Hacklu2011 tricaud
stricaud
 
PDF
Software cracking and patching
Mayank Gavri
 
What does "monitoring" mean? (FOSDEM 2017)
Brian Brazil
 
Just enough web ops for web developers
Datadog
 
Go Observability (in practice)
Eran Levy
 
Everything You wanted to Know About Distributed Tracing
Amuhinda Hungai
 
Winston - Netflix's event driven auto remediation and diagnostics tool
Vinay Shah
 
End to-end monitoring with the prometheus operator - Max Inden
Paris Container Day
 
Evolution of the Prometheus TSDB (Percona Live Europe 2017)
Brian Brazil
 
Evolution of Monitoring and Prometheus (Dublin 2018)
Brian Brazil
 
Sensu Monitoring
Mohanasundaram Ponnusamy
 
Sql azure cluster dashboard public.ppt
Qingsong Yao
 
Nagios Conference 2014 - Dorance Martinez Cortes - Customizing Nagios
Nagios
 
Real time data driven applications (SQL vs NoSQL databases)
GoDataDriven
 
Counting with Prometheus (CloudNativeCon+Kubecon Europe 2017)
Brian Brazil
 
OSMC 2014: From monitoringsucks to monitoringlove (and back) | Kris Buytaert
NETWAYS
 
Observability in Java: Getting Started with OpenTelemetry
DevOps.com
 
Anatomy of a Prometheus Client Library (PromCon 2018)
Brian Brazil
 
OpenTelemetry For Architects
Kevin Brockhoff
 
Hacklu2011 tricaud
stricaud
 
Software cracking and patching
Mayank Gavri
 

Similar to Monitoring - When To start (or Metrics led development) (20)

PPTX
Icinga Camp Amsterdam - Monitoring – When to start
Icinga
 
PPTX
Icinga camp ams 2016 icinga2
Assaf Flatto
 
PPTX
Webinar: Keep Calm and Scale Out - A proactive guide to Monitoring MongoDB
MongoDB
 
PPTX
SpiceWorks Webinar: Whose logs, what logs, why logs
AlienVault
 
PPTX
Document Control in FDA Regulated Environments - When and how to automate
Jeff Thomas
 
PPTX
DockerCon SF 2019 - TDD is Dead
Kevin Crawley
 
PPT
Troubleshooting: A High-Value Asset For The Service-Provider Discipline
Sagi Brody
 
ODP
Building an Open Source AppSec Pipeline - 2015 Texas Linux Fest
Matt Tesauro
 
PDF
Its Not You Its Me MSSP Couples Counseling
Atif Ghauri
 
PPT
Security Outsourcing - Couples Counseling - Atif Ghauri
Atif Ghauri
 
ODP
Building an Open Source AppSec Pipeline
Matt Tesauro
 
PPTX
Incident Response Fails
Michael Gough
 
PPTX
Nagios Conference 2014 - Scott Wilkerson - Getting Started with Nagios Networ...
Nagios
 
PPTX
Information Security: Advanced SIEM Techniques
ReliaQuest
 
PPTX
So you-want-to-go-faster
Ooblioob
 
PDF
The Seven Deadly Sins of Incident Response
Lancope, Inc.
 
PPTX
Jason Kent - AppSec Without Additional Tools
centralohioissa
 
PPTX
The Unlikely Couple, DevOps and Security. Can it work?
Todd Benson (I.T. SPECIALIST and I.T. SECURITY)
 
PDF
Itsummit2015 blizzard
kevin_donovan
 
PPTX
Vendors, and Risk, and Tigers, and Bears, Oh My: How to Create a Vendor Revie...
Wendy Knox Everette
 
Icinga Camp Amsterdam - Monitoring – When to start
Icinga
 
Icinga camp ams 2016 icinga2
Assaf Flatto
 
Webinar: Keep Calm and Scale Out - A proactive guide to Monitoring MongoDB
MongoDB
 
SpiceWorks Webinar: Whose logs, what logs, why logs
AlienVault
 
Document Control in FDA Regulated Environments - When and how to automate
Jeff Thomas
 
DockerCon SF 2019 - TDD is Dead
Kevin Crawley
 
Troubleshooting: A High-Value Asset For The Service-Provider Discipline
Sagi Brody
 
Building an Open Source AppSec Pipeline - 2015 Texas Linux Fest
Matt Tesauro
 
Its Not You Its Me MSSP Couples Counseling
Atif Ghauri
 
Security Outsourcing - Couples Counseling - Atif Ghauri
Atif Ghauri
 
Building an Open Source AppSec Pipeline
Matt Tesauro
 
Incident Response Fails
Michael Gough
 
Nagios Conference 2014 - Scott Wilkerson - Getting Started with Nagios Networ...
Nagios
 
Information Security: Advanced SIEM Techniques
ReliaQuest
 
So you-want-to-go-faster
Ooblioob
 
The Seven Deadly Sins of Incident Response
Lancope, Inc.
 
Jason Kent - AppSec Without Additional Tools
centralohioissa
 
The Unlikely Couple, DevOps and Security. Can it work?
Todd Benson (I.T. SPECIALIST and I.T. SECURITY)
 
Itsummit2015 blizzard
kevin_donovan
 
Vendors, and Risk, and Tigers, and Bears, Oh My: How to Create a Vendor Revie...
Wendy Knox Everette
 
Ad

Recently uploaded (20)

PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PDF
Staying Human in a Machine- Accelerated World
Catalin Jora
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
LOOPS in C Programming Language - Technology
RishabhDwivedi43
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
Staying Human in a Machine- Accelerated World
Catalin Jora
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Ad

Monitoring - When To start (or Metrics led development)

  • 1. Monitoring – The Open Source Way MONITORING when should you start?
  • 2.  Working in IT since 1999  Icinga team member since 2011  Nagios certified  Dog Lover, Crazy cyclist  Whisky Connoisseur – www.maltgeeks.com  My Site: www.aikilinux.com Who Am I ?
  • 3. Why do you want monitoring? Know the state of the server. Know the state of the service/product you provide. Be able to anticipate issues before clients are aware of them (proactive).
  • 4. When Do Most Companies Start Monitoring? • When a Client calls about a problem. • Staging / Pre-Production • Production. • When the management is asking for reports about SLA.
  • 5. When should you start monitoring? QA Load Testing E2E Testing Staging
  • 6. Monitoring = Day 2 Monitoring starts when your code is first tested!!
  • 7. Why Do You NEED Monitoring? Know the state of the server. Know the state of the service/product you provide. Be able to anticipate issues before clients are aware of them (proactive). Gather behavioural data about the system. Learn about bottlenecks in the system and plan to avoid them. Tracking the changes that happen on the system.
  • 11. H Things that Happen when we hit production.
  • 12. How do you go about setting up monitoring ? Market research to select which tool to use. And many more Tools...
  • 13. What do we want to monitor ? Laptops ? Servers? VoIP Phones? Printers? Switches? PBX ? Storage? FireWalls? Routers? AP's ? Before we select the tool, we need to define the scope of what to include.
  • 15. Capabilities What capabilities does the tool support ? SNMP Added custom checks Active/passive results Database integration H/A Scaling Distributed monitoring
  • 16. Support ? Community Professional Dedicated support personnel. Provides SLA. Known entity to communicate with. Not depended on the “kindness of others”
  • 17. • Use an external service. • Hire external consultants. • Build a full NOC. How do you go about setting up monitoring? – cont.
  • 18. Use the tool you know For Against  Fast implementation “waste time for DEV system”  Easy to set up “will not be the real product”  Easy to hand over  You don't care – it is for data gathering
  • 19. It is easy to start
  • 20. Monitoring Hooks Who knows what to look for ? • The Person that wrote the code (most often) How involved are they in the process ? • Mostly – non at all – must change!!
  • 21. Conclusion * Start monitoring early. * Involve Dev in the monitoring process. * Make it part of CI (same as Unit testing). * Gather as much data as possible, as early as possible. * Expose the data to the management and architects. * Always highlight any abnormal occurrence.