Keeping your DevOps Transformation
from Crushing Your Ops Capacity
Damon Edwards
@damonedwards
Damon
Edwards
Ops Improvement
DevOps Consulting
Ops Tools
Community
This talk is about what happens after deployment
Deploy
Everything
Else
What does Ops do with it’s time?
Let’s start with an (unfortunately) typical incident…
Download the PDF here: https://rundeck.co/incident_does2017
And people still wonder where the time goes.
DevOps, Containers,
Continuous Deployment.. Why
aren’t we getting more done?
Executive Team
It’s only getting worse
The Operations Squeeze
Introduction
“The Operations Squeeze”
Go faster! Be flexible! Lock it down!
Improved Quality
Shorter Time-to-Market
Fast Feedback
From Users
Availability Auditing
Security Compliance
Dev Ops
Ops
“The Ops Squeeze”
It’s only getting worse
The Operations Squeeze
Introduction
“The Operations Squeeze”
Go faster! Be flexible! Lock it down!
Improved Quality
Shorter Time-to-Market
Fast Feedback
From Users
Availability Auditing
Security Compliance
Dev Ops
Ops
More errors
More delays
Less capacity
Less flexibility
“The Ops Squeeze”
Use a “Lean” lens to analyze operations activity
Lean: Empower those closest to the issue
escalate escalate
1° 2° 3°
escalate
4°
Lean: Empower those closest to the issue
escalate escalate
1° 2° 3°
escalate
4°
Push the ability to take action this direction
Lean: Empower those closest to the issue
escalate escalate
1° 2° 3°
escalate
4°
Push the ability to take action this direction
But what gets
in the way?
Lean: Empower those closest to the issue
escalate escalate
1° 2° 3°
escalate
4°
Push the ability to take action this direction
SilosBut what gets
in the way?
Backlog Context
I need X
Backlog
I do X
Requests
for X
Silo A
Priorities
Context
Priorities
Silo B
Tools Tools
Silos ruin everything
Backlog Context
I need X
Backlog
I do X
Requests
for X
Silo A
Priorities
Context
Priorities
Silo B
Tools Tools
Silos ruin everything
How to spot silos? Look for Ticket-Driven Request Queues
Team A
(Dev)
Team B
(Ops)
Ticket
System
??
Team A
(Dev)
Team B
(Ops)
Ticket
System
??
Silo Builder
How to spot silos? Look for Ticket-Driven Request Queues
Team A
(Dev)
Team B
(Ops)
Ticket
System
??
Silo Builder Snowflake Maker
How to spot silos? Look for Ticket-Driven Request Queues
Request queues proven to have a huge economic impact
All of those little delays end up being really expensive
RevenueperWeek
Time
COST OF DELAY Actual Revenue
Market: Long Lifetime Value, Peak Unaffected by Delay
Opportunity Ready
So lets get rid of silos & request queues!
Popular: Replace Silos with Cross Functional Team
Dev/Test Release OperatePlanning
Popular: Replace Silos with Cross Functional Team
Dev/Test Release OperatePlanning
Cross-Functional Teams
Cross-Functional Teams
Cross-Functional Teams
Dev/Test Release OperatePlanning
Popular: Replace Silos with Cross Functional Team
Cross-Functional Teams
Cross-Functional Teams
Cross-Functional Teams
EnvironmentsDBAs Network Security NOC
Dev/Test Release OperatePlanning
Popular: Replace Silos with Cross Functional Team
Cross-Functional Teams
Cross-Functional Teams
Cross-Functional Teams
EnvironmentsDBAs Network Security NOC
Dev/Test Release OperatePlanning
Team A
(Dev)
Team B
(Ops)
Ticket
System
??
Popular: Replace Silos with Cross Functional Team
Cross-Functional Teams
Cross-Functional Teams
Cross-Functional Teams
Team A
(Dev)
Team B
(Ops)
Ticket
System
??
Silo Builder Snowflake Maker
Get rid of the remaining ticket-driven request queues…
… by replacing with Operations as a Service design pattern
Team A
(Dev)
Team B
(Ops)
Ticket
System
Operations
as a
Service
Actual Exceptions
On
Demand
On
Demand
… by replacing with Operations as a Service design pattern
Team A
(Dev)
Team B
(Ops)
Ticket
System
Operations
as a
Service
Actual Exceptions
On
Demand
On
Demand
Change how your organization thinks
about automated procedures…
Automated procedures are comprised of three parts
Definition of the automated procedure
Execution of the automated procedure
Governance of the automated procedure
Define
Execute
Govern
Automated procedures are comprised of three parts
Definition of the automated procedure
Execution of the automated procedure
Governance of the automated procedure
Define
Execute
Govern
(security, oversight, compliance, etc.)
Traditional Ops Silo
Define
Execute
Govern
“Consumers of Ops”
(Dev, QA, Release, NOC, Security, etc.)
Ops
Rigid Self-Service
Define
Execute
Govern
“Consumers of Ops”
(Dev, QA, Release, NOC, Security, etc.)
Ops
Define
Execute
Govern
Execute
“Consumers of Ops”
(Dev, QA, Release, NOC, Security, etc.)
Ops
Rigid Self-Service (limited)
High-Velocity Handoffs
Define
Govern
Execute
“Consumers of Ops”
(Dev, QA, Release, NOC, Security, etc.)
Ops
Self-Service Operations
Define
Govern
Execute
“Consumers of Ops”
(Dev, QA, Release, NOC, Security, etc.)
Ops
Self-Service Operations
Define
Govern
Execute
Govern
“Consumers of Ops”
(Dev, QA, Release, NOC, Security, etc.)
Ops
fdfd
Operations as a Service
Operations
as a
Service
ED G
Team B
(Ops)
Vet
Procedures
Define
Policies
Execute
On Demand
Team A
(Dev)
Define
Procedures
Execute
On Demand
fdfd
Operations as a Service
Move definition, execution, and governance to where you
get the most effective use of labor and best flow of work
Operations
as a
Service
ED G
Team B
(Ops)
Vet
Procedures
Define
Policies
Execute
On Demand
Team A
(Dev)
Define
Procedures
Execute
On Demand
fdfd
Lean: Standardize to eliminate waste and enable improvement
Operations
as a
Service
ED G
Team B
(Ops)
Vet
Procedures
Define
Policies
Execute
On Demand
Team A
(Dev)
Define
Procedures
Execute
On Demand
Collaborate to create standard
operating procedures
Example Operations as a Service Platform
#! ! "# $
Scripts APIs Tools Cloud VMs Containers
Orchestration
Collect and
Process Output
Infrastructure
details
Config.
Man.
CMDB
Monitor.
Metrics
Cloud
Corp
Directory
Authentication
ITSM Tickets
>_
● Collaborate on Standard Procedures
● Manage Access & Governance policies
● Execute Actions & Share Visibility
Web GUI API CLI
Operations as a Service Platform
Example Operations as a Service Platform
#! ! "# $
Scripts APIs Tools Cloud VMs Containers
Orchestration
Collect and
Process Output
Infrastructure
details
Config.
Man.
CMDB
Monitor.
Metrics
Cloud
Corp
Directory
Authentication
ITSM Tickets
>_
● Collaborate on Standard Procedures
● Manage Access & Governance policies
● Execute Actions & Share Visibility
Web GUI API CLI
Operations as a Service Platform
Example Operations as a Service Platform
#! ! "# $
Scripts APIs Tools Cloud VMs Containers
Orchestration
Collect and
Process Output
Infrastructure
details
Config.
Man.
CMDB
Monitor.
Metrics
Cloud
Corp
Directory
Authentication
ITSM Tickets
>_
● Collaborate on Standard Procedures
● Manage Access & Governance policies
● Execute Actions & Share Visibility
Web GUI API CLI
Operations as a Service Platform
Example Operations as a Service Platform
#! ! "# $
Scripts APIs Tools Cloud VMs Containers
Orchestration
Collect and
Process Output
Infrastructure
details
Config.
Man.
CMDB
Monitor.
Metrics
Cloud
Corp
Directory
Authentication
ITSM Tickets
>_
● Collaborate on Standard Procedures
● Manage Access & Governance policies
● Execute Actions & Share Visibility
Web GUI API CLI
Operations as a Service Platform
Example Operations as a Service Platform
#! ! "# $
Scripts APIs Tools Cloud VMs Containers
Orchestration
Collect and
Process Output
Infrastructure
details
Config.
Man.
CMDB
Monitor.
Metrics
Cloud
Corp
Directory
Authentication
ITSM Tickets
>_
● Collaborate on Standard Procedures
● Manage Access & Governance policies
● Execute Actions & Share Visibility
Web GUI API CLI
Operations as a Service Platform
Let’s look at some companies who are
leveraging this design pattern.
Keeping Your DevOps Transformation From Crushing Your Ops Capacity
Keeping Your DevOps Transformation From Crushing Your Ops Capacity
Mark
Maun
Jody
Mulkey
Justin
Dean
90% Reduction in MTTR
50% Reduction in escalations
55% Reduction of overall support costs
Keeping Your DevOps Transformation From Crushing Your Ops Capacity
1. New org, support, and escalation model
escalate
1° 2° 3° 4°
escalate escalate
1. New org, support, and escalation model
escalate
1° 2° 3° 4°
escalate escalate
1. New org, support, and escalation model
EMT ER Trauma
Surgeon
Specialist
Surgeon
TOC (NOC) SRE
Production Eng.
Scrum Teams
Data Services
Platform Eng.
Global Network
> 15 min > 30 min > 60 min
2. Key: Push the ability to take action closest to the problem
escalate
1° 2° 3° 4°
escalate escalate
1. New org, support, and escalation model
EMT ER Trauma
Surgeon
Specialist
Surgeon
TOC (NOC) SRE
Production Eng.
Scrum Teams
Data Services
Platform Eng.
Global Network
> 15 min > 30 min > 60 min
2. Key: Push the ability to take action closest to the problem
escalate
1° 2° 3° 4°
escalate escalate
1. New org, support, and escalation model
EMT ER Trauma
Surgeon
Specialist
Surgeon
TOC (NOC) SRE
Production Eng.
Scrum Teams
Data Services
Platform Eng.
Global Network
> 15 min > 30 min > 60 min
3. Longterm investment in operability
(deployment, configuration, monitoring, automated runbooks)
“Support at the Edge”
Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ
http://rundeck.org/stories/mark_maun.html
“Support at the Edge”
Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ
http://rundeck.org/stories/mark_maun.html
• Automated Ops procedures written/
vetted by the delivery teams
“Support at the Edge”
Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ
http://rundeck.org/stories/mark_maun.html
• Automated Ops procedures written/
vetted by the delivery teams
• Ops remained in full control of what
can run and security policy
“Support at the Edge”
Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ
http://rundeck.org/stories/mark_maun.html
• Automated Ops procedures written/
vetted by the delivery teams
• Ops remained in full control of what
can run and security policy
• Empowered NOC and other support
teams with self-service ops tasks
“Support at the Edge”
Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ
http://rundeck.org/stories/mark_maun.html
• Automated Ops procedures written/
vetted by the delivery teams
• Ops remained in full control of what
can run and security policy
• Empowered NOC and other support
teams with self-service ops tasks
• Empowered developers with limited
self-service operations
“Support at the Edge”
Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ
http://rundeck.org/stories/mark_maun.html
• Automated Ops procedures written/
vetted by the delivery teams
• Ops remained in full control of what
can run and security policy
• Empowered NOC and other support
teams with self-service ops tasks
• Empowered developers with limited
self-service operations
• Combined with new incident response
and escalation model
Fortune 100 Manufacturing & Services: “Force Multiplier”
Team A
Services Scripts/Tools
ENV A
Services Scripts/Tools
ENV B
Services Scripts/Tools
ENV C
Team B Team C
Operations as a Service Platform
Ops
Team n…
Services Scripts/Tools
ENV n
Fortune 100 Manufacturing & Services: “Force Multiplier”
Team A
Services Scripts/Tools
ENV A
Services Scripts/Tools
ENV B
Services Scripts/Tools
ENV C
Team B Team C
Operations as a Service Platform
Ops
Team n…
Services Scripts/Tools
ENV n
• Manufacturing and service conglomerate with need to decouple IT
Fortune 100 Manufacturing & Services: “Force Multiplier”
Team A
Services Scripts/Tools
ENV A
Services Scripts/Tools
ENV B
Services Scripts/Tools
ENV C
Team B Team C
Operations as a Service Platform
Ops
Team n…
Services Scripts/Tools
ENV n
• Manufacturing and service conglomerate with need to decouple IT
• Each line of business has different security and policy needs
Fortune 100 Manufacturing & Services: “Force Multiplier”
Team A
Services Scripts/Tools
ENV A
Services Scripts/Tools
ENV B
Services Scripts/Tools
ENV C
Team B Team C
Operations as a Service Platform
Ops
Team n…
Services Scripts/Tools
ENV n
• Manufacturing and service conglomerate with need to decouple IT
• Each line of business has different security and policy needs
• Handle 5-10x the scale without adding to central Ops organization
Recap
Capacity crunch is only
going to get worse
Make explicit investment in
process and tooling
Operations as a Service: Reshaping IT Operations to Solve Today’s Challenges 4
D
evOps and Digital Transformations are
driving an unprecedented increase in
the pace and volume of daily change.
Who generally finds this to be welcome news?
Development and Product teams. Who has reasons
to be alarmed at the problems and challenges this
might bring? Operations.
Operations organizations in today’s enterprises
are finding themselves squeezed between two
unrelenting forces. On one side there are the
business-driven demands of DevOps and Digital
Transformation (“Go faster! Open things up!). On
the other side there are the demands to maximize
security and stability (“Don’t be the next hack! Don’t
be the next outage! Lock things down!”). And there, in
the middle, is an already over-burdened Operations
organization doing their best to avoid being squeezed
beyond the breaking point.
Operations has reached an inflection point. To deliver
what the business demands, Operations must find
a way to provide increasing levels of organizational
responsiveness and throughput — all while “locking
things down” to sufficiently meet today’s risk profiles.
A lot is riding on how Operations responds to this
challenge. A failure here is not just a localized IT
failure. A failure will undermine a business’s ability
to operate. Failing to solve this will turn into a
competitive disadvantage for the business.
On the flip side, this challenge also presents a great
opportunity. Operations can take this business
mandate and use it to reimagine how both planned
and unplanned work is handled. This is a chance to
improve how Operations both serves the broader
business and improves the day-to-day lives of
Operations professionals.
The Operations Squeeze
Introduction
“The Operations Squeeze”
Go faster! Be flexible! Lock it down!
Improved Quality
Shorter Time-to-Market
Fast Feedback
From Users
Availability Auditing
Security Compliance
Dev Ops
Ops
Use a “Lean lens” to
analyze Ops activity
Team A
(Dev)
Team B
(Ops)
Ticket
System
??
Beware of ticket-driven
request queues
Leverage the Operations as
a Service design pattern
#! ! "# $
Scripts APIs Tools Cloud VMs Containers
Orchestration
Collect and
Process Output
Infrastructure
details
Config.
Man.
CMDB
Monitor.
Metrics
Cloud
Corp
Directory
Authentication
ITSM Tickets
>_
● Collaborate on Standard Procedures
● Manage Access & Governance policies
● Execute Actions & Share Visibility
Web GUI API CLI
Operations as a Service Platform
“Shift-Left” control and
decision making.
Let’s talk…
@damonedwards
damon@rundeck.com
https://www.rundeck.com/oaas
Deep-dive write-up on Operations as a Service:

More Related Content

PDF
The "Ops" Side of DevSecOps
PDF
Operations as a Service: Because Failure Still Happens
PDF
Self-Service Operations: Because Failure Still Happens (Developer Edition)
PDF
Self-Service Operations: Because Ops Still Happens
PDF
Failure Happens: Improving Incident Response In Enterprises
PDF
Modern Operations: Solving DevOps’ Last Mile Problem
PDF
Helping Ops Help You: Development’s Role in Enabling Self-Service Operations
PDF
Ops Happens: Improving Incident Response Using DevOps and SRE Practices
The "Ops" Side of DevSecOps
Operations as a Service: Because Failure Still Happens
Self-Service Operations: Because Failure Still Happens (Developer Edition)
Self-Service Operations: Because Ops Still Happens
Failure Happens: Improving Incident Response In Enterprises
Modern Operations: Solving DevOps’ Last Mile Problem
Helping Ops Help You: Development’s Role in Enabling Self-Service Operations
Ops Happens: Improving Incident Response Using DevOps and SRE Practices

What's hot (20)

PDF
Incident Management in the Age of DevOps and SRE
PDF
SRE for Everyone: Making Tomorrow Better Than Today
PDF
Incident Management in the Age of DevOps and SRE
PDF
Incident Management in the Age of DevOps and SRE
PDF
The Last Mile Continued: Incident Management
PDF
SysAdmin to SRE: Solving the Last Mile Problem
PDF
Operations: The Last Mile
PDF
Clearing the Way For SRE In the Enterprise
PDF
SRE Lessons for the Enterprise
PDF
Tickets Make Operations Work Unnecessarily Miserable
PDF
Making Tomorrow Better than Today - Unlocking the Full Potential of Operations
PDF
DevOpsDays Austin: Helping Horses Become Unicorns, Chef's Operations Maturity...
PDF
Operations: The Last Mile
PDF
SysAdmin to SRE: Creating Capacity to Make Tomorrow Better Than Today
PDF
SRE From Scratch
PDF
Empower Devs, Simplify Ops, and Accelerate your Digital Transformation
PDF
Innovation and Architecture
PDF
DOES16 London - Better Faster Cheaper .. How?
PDF
All daydevops 2016 - Turning Human Capital into High Performance Organizati...
PDF
The History of DevOps (and what you need to do about it)
Incident Management in the Age of DevOps and SRE
SRE for Everyone: Making Tomorrow Better Than Today
Incident Management in the Age of DevOps and SRE
Incident Management in the Age of DevOps and SRE
The Last Mile Continued: Incident Management
SysAdmin to SRE: Solving the Last Mile Problem
Operations: The Last Mile
Clearing the Way For SRE In the Enterprise
SRE Lessons for the Enterprise
Tickets Make Operations Work Unnecessarily Miserable
Making Tomorrow Better than Today - Unlocking the Full Potential of Operations
DevOpsDays Austin: Helping Horses Become Unicorns, Chef's Operations Maturity...
Operations: The Last Mile
SysAdmin to SRE: Creating Capacity to Make Tomorrow Better Than Today
SRE From Scratch
Empower Devs, Simplify Ops, and Accelerate your Digital Transformation
Innovation and Architecture
DOES16 London - Better Faster Cheaper .. How?
All daydevops 2016 - Turning Human Capital into High Performance Organizati...
The History of DevOps (and what you need to do about it)
Ad

Similar to Keeping Your DevOps Transformation From Crushing Your Ops Capacity (20)

PPTX
2011 06 15 velocity conf from visible ops to dev ops final
PDF
Introduction to DevOps slides.pdf
PPTX
2011 09 19 LSPE Dev Ops Cookbook 1a
PDF
Devops (start walking in the same direction) by ops
PPTX
What is DevOps? What is DevOps CoE?
PDF
Creating an Operating Model to enable a high frequency organization
PPTX
JavaOne 2015 Devops and the Darkside CON6447
ODP
Devops, the future is here it's not evenly distributed yet
PPTX
2011 03 14 dev ops meetup - top lessons creating dev-ops super-tribes 2b
PDF
Chris Munns, DevOps @ Amazon: Microservices, 2 Pizza Teams, & 50 Million Depl...
PDF
Devops, Secops, Opsec, DevSec *ops *.* ?
PDF
Without Self-Service Operations, the Cloud is Just Expensive Hosting 2.0 - (a...
PDF
PuppetConf 2017: The Changing Role of Operations- Michael Stahnke, Puppet
PPTX
From DevOps to Operations Science
PDF
Leveraging Your Company's DevOps Transformation (AppSec USA 2014)
PDF
Ops Happens: DevOps Beyond Deployment - Damon Edwards
PPTX
Delivering Applications Continuously to Cloud
PPTX
Enterprise DevOps Transformation
PPTX
Winnipeg ISACA Security is Dead, Rugged DevOps
PDF
Tickets Make Ops Unnecessarily Miserable: The Journey to Self-Service
2011 06 15 velocity conf from visible ops to dev ops final
Introduction to DevOps slides.pdf
2011 09 19 LSPE Dev Ops Cookbook 1a
Devops (start walking in the same direction) by ops
What is DevOps? What is DevOps CoE?
Creating an Operating Model to enable a high frequency organization
JavaOne 2015 Devops and the Darkside CON6447
Devops, the future is here it's not evenly distributed yet
2011 03 14 dev ops meetup - top lessons creating dev-ops super-tribes 2b
Chris Munns, DevOps @ Amazon: Microservices, 2 Pizza Teams, & 50 Million Depl...
Devops, Secops, Opsec, DevSec *ops *.* ?
Without Self-Service Operations, the Cloud is Just Expensive Hosting 2.0 - (a...
PuppetConf 2017: The Changing Role of Operations- Michael Stahnke, Puppet
From DevOps to Operations Science
Leveraging Your Company's DevOps Transformation (AppSec USA 2014)
Ops Happens: DevOps Beyond Deployment - Damon Edwards
Delivering Applications Continuously to Cloud
Enterprise DevOps Transformation
Winnipeg ISACA Security is Dead, Rugged DevOps
Tickets Make Ops Unnecessarily Miserable: The Journey to Self-Service
Ad

More from Rundeck (20)

PDF
Rundeck Community Office Hours: Using Variables with Job Steps
PPTX
Introducing PagerDuty Process Automation
PDF
How to Build a Custom Plugin in Rundeck
PDF
Lunch and learn: Getting started with Rundeck & Ansible
PDF
Self Service Cloud Operations: Safely Delegate the Management of your Cloud ...
PDF
Rundeck Office Hours: Best Practices Access Control Policies
PPTX
Mastering Secrets Management in Rundeck
PDF
What's New in Rundeck 3.4
PDF
Automate Yourself Out of a Job: Safely Delegate the Management of your Azure...
PDF
Super-Charge Your Site Reliability Practices with Runbook Automation
PPTX
Introduction to Rundeck
PPTX
Automated Remediation with Rundeck + Sensu
PDF
Modernizing Incident Response
PDF
Runbook Automation: Old News or a Key to Unlock Performance? [DOES2020]
PDF
Datadog + Rundeck at DASH 2020
PDF
Rundeck Overview
PDF
Empower Devs, Simplify Ops, and Accelerate your Digital Transformation
PPTX
Advanced Cluster Settings
PDF
Maximizing Your Rundeck Migration
PDF
Business Continuity for Humans: Keeping Your Business Running When Your Peopl...
Rundeck Community Office Hours: Using Variables with Job Steps
Introducing PagerDuty Process Automation
How to Build a Custom Plugin in Rundeck
Lunch and learn: Getting started with Rundeck & Ansible
Self Service Cloud Operations: Safely Delegate the Management of your Cloud ...
Rundeck Office Hours: Best Practices Access Control Policies
Mastering Secrets Management in Rundeck
What's New in Rundeck 3.4
Automate Yourself Out of a Job: Safely Delegate the Management of your Azure...
Super-Charge Your Site Reliability Practices with Runbook Automation
Introduction to Rundeck
Automated Remediation with Rundeck + Sensu
Modernizing Incident Response
Runbook Automation: Old News or a Key to Unlock Performance? [DOES2020]
Datadog + Rundeck at DASH 2020
Rundeck Overview
Empower Devs, Simplify Ops, and Accelerate your Digital Transformation
Advanced Cluster Settings
Maximizing Your Rundeck Migration
Business Continuity for Humans: Keeping Your Business Running When Your Peopl...

Recently uploaded (20)

PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
August Patch Tuesday
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
STKI Israel Market Study 2025 version august
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
The various Industrial Revolutions .pptx
PDF
Hybrid model detection and classification of lung cancer
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PPTX
Modernising the Digital Integration Hub
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PPTX
Tartificialntelligence_presentation.pptx
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPTX
Web Crawler for Trend Tracking Gen Z Insights.pptx
PDF
Unlock new opportunities with location data.pdf
PDF
Getting Started with Data Integration: FME Form 101
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
Enhancing emotion recognition model for a student engagement use case through...
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
August Patch Tuesday
O2C Customer Invoices to Receipt V15A.pptx
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
STKI Israel Market Study 2025 version august
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
The various Industrial Revolutions .pptx
Hybrid model detection and classification of lung cancer
Assigned Numbers - 2025 - Bluetooth® Document
Univ-Connecticut-ChatGPT-Presentaion.pdf
Chapter 5: Probability Theory and Statistics
Modernising the Digital Integration Hub
Taming the Chaos: How to Turn Unstructured Data into Decisions
Tartificialntelligence_presentation.pptx
Developing a website for English-speaking practice to English as a foreign la...
Web Crawler for Trend Tracking Gen Z Insights.pptx
Unlock new opportunities with location data.pdf
Getting Started with Data Integration: FME Form 101
A novel scalable deep ensemble learning framework for big data classification...
Enhancing emotion recognition model for a student engagement use case through...

Keeping Your DevOps Transformation From Crushing Your Ops Capacity

  • 1. Keeping your DevOps Transformation from Crushing Your Ops Capacity Damon Edwards @damonedwards
  • 3. This talk is about what happens after deployment Deploy Everything Else What does Ops do with it’s time?
  • 4. Let’s start with an (unfortunately) typical incident… Download the PDF here: https://rundeck.co/incident_does2017
  • 5. And people still wonder where the time goes. DevOps, Containers, Continuous Deployment.. Why aren’t we getting more done? Executive Team
  • 6. It’s only getting worse The Operations Squeeze Introduction “The Operations Squeeze” Go faster! Be flexible! Lock it down! Improved Quality Shorter Time-to-Market Fast Feedback From Users Availability Auditing Security Compliance Dev Ops Ops “The Ops Squeeze”
  • 7. It’s only getting worse The Operations Squeeze Introduction “The Operations Squeeze” Go faster! Be flexible! Lock it down! Improved Quality Shorter Time-to-Market Fast Feedback From Users Availability Auditing Security Compliance Dev Ops Ops More errors More delays Less capacity Less flexibility “The Ops Squeeze”
  • 8. Use a “Lean” lens to analyze operations activity
  • 9. Lean: Empower those closest to the issue escalate escalate 1° 2° 3° escalate 4°
  • 10. Lean: Empower those closest to the issue escalate escalate 1° 2° 3° escalate 4° Push the ability to take action this direction
  • 11. Lean: Empower those closest to the issue escalate escalate 1° 2° 3° escalate 4° Push the ability to take action this direction But what gets in the way?
  • 12. Lean: Empower those closest to the issue escalate escalate 1° 2° 3° escalate 4° Push the ability to take action this direction SilosBut what gets in the way?
  • 13. Backlog Context I need X Backlog I do X Requests for X Silo A Priorities Context Priorities Silo B Tools Tools Silos ruin everything
  • 14. Backlog Context I need X Backlog I do X Requests for X Silo A Priorities Context Priorities Silo B Tools Tools Silos ruin everything
  • 15. How to spot silos? Look for Ticket-Driven Request Queues Team A (Dev) Team B (Ops) Ticket System ??
  • 16. Team A (Dev) Team B (Ops) Ticket System ?? Silo Builder How to spot silos? Look for Ticket-Driven Request Queues
  • 17. Team A (Dev) Team B (Ops) Ticket System ?? Silo Builder Snowflake Maker How to spot silos? Look for Ticket-Driven Request Queues
  • 18. Request queues proven to have a huge economic impact
  • 19. All of those little delays end up being really expensive RevenueperWeek Time COST OF DELAY Actual Revenue Market: Long Lifetime Value, Peak Unaffected by Delay Opportunity Ready
  • 20. So lets get rid of silos & request queues!
  • 21. Popular: Replace Silos with Cross Functional Team Dev/Test Release OperatePlanning
  • 22. Popular: Replace Silos with Cross Functional Team Dev/Test Release OperatePlanning Cross-Functional Teams Cross-Functional Teams Cross-Functional Teams
  • 23. Dev/Test Release OperatePlanning Popular: Replace Silos with Cross Functional Team Cross-Functional Teams Cross-Functional Teams Cross-Functional Teams
  • 24. EnvironmentsDBAs Network Security NOC Dev/Test Release OperatePlanning Popular: Replace Silos with Cross Functional Team Cross-Functional Teams Cross-Functional Teams Cross-Functional Teams
  • 25. EnvironmentsDBAs Network Security NOC Dev/Test Release OperatePlanning Team A (Dev) Team B (Ops) Ticket System ?? Popular: Replace Silos with Cross Functional Team Cross-Functional Teams Cross-Functional Teams Cross-Functional Teams
  • 26. Team A (Dev) Team B (Ops) Ticket System ?? Silo Builder Snowflake Maker Get rid of the remaining ticket-driven request queues…
  • 27. … by replacing with Operations as a Service design pattern Team A (Dev) Team B (Ops) Ticket System Operations as a Service Actual Exceptions On Demand On Demand
  • 28. … by replacing with Operations as a Service design pattern Team A (Dev) Team B (Ops) Ticket System Operations as a Service Actual Exceptions On Demand On Demand
  • 29. Change how your organization thinks about automated procedures…
  • 30. Automated procedures are comprised of three parts Definition of the automated procedure Execution of the automated procedure Governance of the automated procedure Define Execute Govern
  • 31. Automated procedures are comprised of three parts Definition of the automated procedure Execution of the automated procedure Governance of the automated procedure Define Execute Govern (security, oversight, compliance, etc.)
  • 32. Traditional Ops Silo Define Execute Govern “Consumers of Ops” (Dev, QA, Release, NOC, Security, etc.) Ops
  • 33. Rigid Self-Service Define Execute Govern “Consumers of Ops” (Dev, QA, Release, NOC, Security, etc.) Ops
  • 34. Define Execute Govern Execute “Consumers of Ops” (Dev, QA, Release, NOC, Security, etc.) Ops Rigid Self-Service (limited)
  • 35. High-Velocity Handoffs Define Govern Execute “Consumers of Ops” (Dev, QA, Release, NOC, Security, etc.) Ops
  • 36. Self-Service Operations Define Govern Execute “Consumers of Ops” (Dev, QA, Release, NOC, Security, etc.) Ops
  • 37. Self-Service Operations Define Govern Execute Govern “Consumers of Ops” (Dev, QA, Release, NOC, Security, etc.) Ops
  • 38. fdfd Operations as a Service Operations as a Service ED G Team B (Ops) Vet Procedures Define Policies Execute On Demand Team A (Dev) Define Procedures Execute On Demand
  • 39. fdfd Operations as a Service Move definition, execution, and governance to where you get the most effective use of labor and best flow of work Operations as a Service ED G Team B (Ops) Vet Procedures Define Policies Execute On Demand Team A (Dev) Define Procedures Execute On Demand
  • 40. fdfd Lean: Standardize to eliminate waste and enable improvement Operations as a Service ED G Team B (Ops) Vet Procedures Define Policies Execute On Demand Team A (Dev) Define Procedures Execute On Demand Collaborate to create standard operating procedures
  • 41. Example Operations as a Service Platform #! ! "# $ Scripts APIs Tools Cloud VMs Containers Orchestration Collect and Process Output Infrastructure details Config. Man. CMDB Monitor. Metrics Cloud Corp Directory Authentication ITSM Tickets >_ ● Collaborate on Standard Procedures ● Manage Access & Governance policies ● Execute Actions & Share Visibility Web GUI API CLI Operations as a Service Platform
  • 42. Example Operations as a Service Platform #! ! "# $ Scripts APIs Tools Cloud VMs Containers Orchestration Collect and Process Output Infrastructure details Config. Man. CMDB Monitor. Metrics Cloud Corp Directory Authentication ITSM Tickets >_ ● Collaborate on Standard Procedures ● Manage Access & Governance policies ● Execute Actions & Share Visibility Web GUI API CLI Operations as a Service Platform
  • 43. Example Operations as a Service Platform #! ! "# $ Scripts APIs Tools Cloud VMs Containers Orchestration Collect and Process Output Infrastructure details Config. Man. CMDB Monitor. Metrics Cloud Corp Directory Authentication ITSM Tickets >_ ● Collaborate on Standard Procedures ● Manage Access & Governance policies ● Execute Actions & Share Visibility Web GUI API CLI Operations as a Service Platform
  • 44. Example Operations as a Service Platform #! ! "# $ Scripts APIs Tools Cloud VMs Containers Orchestration Collect and Process Output Infrastructure details Config. Man. CMDB Monitor. Metrics Cloud Corp Directory Authentication ITSM Tickets >_ ● Collaborate on Standard Procedures ● Manage Access & Governance policies ● Execute Actions & Share Visibility Web GUI API CLI Operations as a Service Platform
  • 45. Example Operations as a Service Platform #! ! "# $ Scripts APIs Tools Cloud VMs Containers Orchestration Collect and Process Output Infrastructure details Config. Man. CMDB Monitor. Metrics Cloud Corp Directory Authentication ITSM Tickets >_ ● Collaborate on Standard Procedures ● Manage Access & Governance policies ● Execute Actions & Share Visibility Web GUI API CLI Operations as a Service Platform
  • 46. Let’s look at some companies who are leveraging this design pattern.
  • 50. 90% Reduction in MTTR 50% Reduction in escalations 55% Reduction of overall support costs
  • 52. 1. New org, support, and escalation model
  • 53. escalate 1° 2° 3° 4° escalate escalate 1. New org, support, and escalation model
  • 54. escalate 1° 2° 3° 4° escalate escalate 1. New org, support, and escalation model EMT ER Trauma Surgeon Specialist Surgeon TOC (NOC) SRE Production Eng. Scrum Teams Data Services Platform Eng. Global Network > 15 min > 30 min > 60 min
  • 55. 2. Key: Push the ability to take action closest to the problem escalate 1° 2° 3° 4° escalate escalate 1. New org, support, and escalation model EMT ER Trauma Surgeon Specialist Surgeon TOC (NOC) SRE Production Eng. Scrum Teams Data Services Platform Eng. Global Network > 15 min > 30 min > 60 min
  • 56. 2. Key: Push the ability to take action closest to the problem escalate 1° 2° 3° 4° escalate escalate 1. New org, support, and escalation model EMT ER Trauma Surgeon Specialist Surgeon TOC (NOC) SRE Production Eng. Scrum Teams Data Services Platform Eng. Global Network > 15 min > 30 min > 60 min 3. Longterm investment in operability (deployment, configuration, monitoring, automated runbooks)
  • 57. “Support at the Edge” Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ http://rundeck.org/stories/mark_maun.html
  • 58. “Support at the Edge” Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ http://rundeck.org/stories/mark_maun.html • Automated Ops procedures written/ vetted by the delivery teams
  • 59. “Support at the Edge” Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ http://rundeck.org/stories/mark_maun.html • Automated Ops procedures written/ vetted by the delivery teams • Ops remained in full control of what can run and security policy
  • 60. “Support at the Edge” Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ http://rundeck.org/stories/mark_maun.html • Automated Ops procedures written/ vetted by the delivery teams • Ops remained in full control of what can run and security policy • Empowered NOC and other support teams with self-service ops tasks
  • 61. “Support at the Edge” Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ http://rundeck.org/stories/mark_maun.html • Automated Ops procedures written/ vetted by the delivery teams • Ops remained in full control of what can run and security policy • Empowered NOC and other support teams with self-service ops tasks • Empowered developers with limited self-service operations
  • 62. “Support at the Edge” Sources: https://www.youtube.com/watch?v=_hr4KiB19bQ http://rundeck.org/stories/mark_maun.html • Automated Ops procedures written/ vetted by the delivery teams • Ops remained in full control of what can run and security policy • Empowered NOC and other support teams with self-service ops tasks • Empowered developers with limited self-service operations • Combined with new incident response and escalation model
  • 63. Fortune 100 Manufacturing & Services: “Force Multiplier” Team A Services Scripts/Tools ENV A Services Scripts/Tools ENV B Services Scripts/Tools ENV C Team B Team C Operations as a Service Platform Ops Team n… Services Scripts/Tools ENV n
  • 64. Fortune 100 Manufacturing & Services: “Force Multiplier” Team A Services Scripts/Tools ENV A Services Scripts/Tools ENV B Services Scripts/Tools ENV C Team B Team C Operations as a Service Platform Ops Team n… Services Scripts/Tools ENV n • Manufacturing and service conglomerate with need to decouple IT
  • 65. Fortune 100 Manufacturing & Services: “Force Multiplier” Team A Services Scripts/Tools ENV A Services Scripts/Tools ENV B Services Scripts/Tools ENV C Team B Team C Operations as a Service Platform Ops Team n… Services Scripts/Tools ENV n • Manufacturing and service conglomerate with need to decouple IT • Each line of business has different security and policy needs
  • 66. Fortune 100 Manufacturing & Services: “Force Multiplier” Team A Services Scripts/Tools ENV A Services Scripts/Tools ENV B Services Scripts/Tools ENV C Team B Team C Operations as a Service Platform Ops Team n… Services Scripts/Tools ENV n • Manufacturing and service conglomerate with need to decouple IT • Each line of business has different security and policy needs • Handle 5-10x the scale without adding to central Ops organization
  • 67. Recap Capacity crunch is only going to get worse Make explicit investment in process and tooling Operations as a Service: Reshaping IT Operations to Solve Today’s Challenges 4 D evOps and Digital Transformations are driving an unprecedented increase in the pace and volume of daily change. Who generally finds this to be welcome news? Development and Product teams. Who has reasons to be alarmed at the problems and challenges this might bring? Operations. Operations organizations in today’s enterprises are finding themselves squeezed between two unrelenting forces. On one side there are the business-driven demands of DevOps and Digital Transformation (“Go faster! Open things up!). On the other side there are the demands to maximize security and stability (“Don’t be the next hack! Don’t be the next outage! Lock things down!”). And there, in the middle, is an already over-burdened Operations organization doing their best to avoid being squeezed beyond the breaking point. Operations has reached an inflection point. To deliver what the business demands, Operations must find a way to provide increasing levels of organizational responsiveness and throughput — all while “locking things down” to sufficiently meet today’s risk profiles. A lot is riding on how Operations responds to this challenge. A failure here is not just a localized IT failure. A failure will undermine a business’s ability to operate. Failing to solve this will turn into a competitive disadvantage for the business. On the flip side, this challenge also presents a great opportunity. Operations can take this business mandate and use it to reimagine how both planned and unplanned work is handled. This is a chance to improve how Operations both serves the broader business and improves the day-to-day lives of Operations professionals. The Operations Squeeze Introduction “The Operations Squeeze” Go faster! Be flexible! Lock it down! Improved Quality Shorter Time-to-Market Fast Feedback From Users Availability Auditing Security Compliance Dev Ops Ops Use a “Lean lens” to analyze Ops activity Team A (Dev) Team B (Ops) Ticket System ?? Beware of ticket-driven request queues Leverage the Operations as a Service design pattern #! ! "# $ Scripts APIs Tools Cloud VMs Containers Orchestration Collect and Process Output Infrastructure details Config. Man. CMDB Monitor. Metrics Cloud Corp Directory Authentication ITSM Tickets >_ ● Collaborate on Standard Procedures ● Manage Access & Governance policies ● Execute Actions & Share Visibility Web GUI API CLI Operations as a Service Platform “Shift-Left” control and decision making.