Just Eat’s SRE Story
DevOps at Scale
2
Bennie Johnston
Head of Site Reliability Engineering
Rich Haigh
Director of Technology
Our vision
Creating the world’s
greatest food community
Fosduo dolores etoa jasom rebum.
Steto clita kuasd gubogren, nosotra drs
frone.
What makes us
UK/Ukraine/Australia/Canada 500+ ppl in Tech
22.8m active customers
30+ teams
450+ services
2,700+ orders/min
1,500+ AWS instances in production
1.6M+ metrics/min
1.5TB+ logs/day
500+ releases/week
45% Revenue Growth (FY17)
FTSE100 >£5bn Market Cap
Special?
What is SRE at Just Eat?
1 - Relentlessly protect site availability
2 - Enable change to be delivered fast, but with quality
3 - Optimise the use of our infrastructure and resources
4 - Innovate to stay ahead
5 - Foster the right culture at Just Eat
We believe that Dev teams own their product - full stop!
Site Reliability Engineering operates on 5 key principles...
5
How do we structure it?
Our customers are 30+ Dev Teams in multiple countries (these numbers vary)
Central Reliability Engineering department
- 24/7 Service Operations Centre (SOC)
- Development team
- Hosting/Platform
- Delivery Automation (CI/CD)
- Observability
- Service Management
Daily production standups
Weekly risk meeting
Monthly Engineering all-hands
1st class citizen in various architecture/project groups
6
What tools/processes do we own?
In one extreme SRE owns all tools and processes
+ economies of scale
+ faster decisions
- limits innovation
- slows down development teams
In the other extreme Dev teams own all tools and processes
+ maximum flexibility for development teams
- tooling sprawl
- wasted time reinventing the wheel
- support problems
Our solution
+ central support for a range of tooling
+ ability for dev teams to interact via an opensource approach
+ freedom for dev teams to deviate
+ survival of the fittest approach
The Central vs. Distributed debate
7
Lessons learnt as we’ve grown?
What didn’t work?
How do we deal with scale
Example: internal tool we own
Bennie
11
Example: external tool we own
12
A formula for managing chaos?
13
if ( ReliabilityScore() < DesiredReliability() )
{
LetUsHelpYou()
}
else
{
LetUsHighlightYou()
Freedom++
}
What’s next? The FUTURE!
Automation of
observability.
A step jump
from the simple
time series
metrics.
14
The dream of
incident
resolution
automation.
The robots
talking to the
robots.
Questions?
If you want to contact us?
richard.haigh@just-eat.com
bennie.johnston@just-eat.com
If you want to read more about us?
Our tech blog: https://tech.just-eat.com
If you want to work for us ;)
Our Careers site: https://careers.just-eat.com

More Related Content

PDF
Forget 'Monoliths vs Microservices'; focus on Team Cognitive Load @ The Futur...
PDF
Forget Monoliths vs Microservices - Focus on Team Cognitive Load @ DevOps Per...
PDF
SRE in Apiary
PDF
Essential_Skills_of_a_Site_Reliability_E.pdf
PDF
Site-Reliability-Engineering-v2[6241].pdf
PDF
Site Reliability Engineering slide deck 101
PDF
Working together SRE & Platform Engineering
PDF
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf
Forget 'Monoliths vs Microservices'; focus on Team Cognitive Load @ The Futur...
Forget Monoliths vs Microservices - Focus on Team Cognitive Load @ DevOps Per...
SRE in Apiary
Essential_Skills_of_a_Site_Reliability_E.pdf
Site-Reliability-Engineering-v2[6241].pdf
Site Reliability Engineering slide deck 101
Working together SRE & Platform Engineering
ADDO_2020-Driving-Digital-Transformation-through-CloudOps-and-SRE.pdf

Similar to Just Eat: DevOps at Scale at AppD Global Tour London (20)

PDF
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
PPTX
DevOps Torino Meetup - SRE Concepts
PDF
JUST EAT: Tools we use to enable our culture
PPTX
What is Site Reliability Engineering (SRE)
PDF
Getting started with Site Reliability Engineering (SRE)
PDF
S.R.E - create ultra-scalable and highly reliable systems
PPTX
Site reliability engineering
PPTX
SRE (service reliability engineer) on big DevOps platform running on the clou...
PPTX
System Accidents: Understanding Common Accidents
PDF
JUST EAT: Embracing DevOps
PPTX
"10 Pitfalls of a Platform Team", Yura Rochniak
PDF
SRE & Kubernetes
PDF
SRE - drupal day aveiro 2016
PPTX
Site (Service) Reliability Engineering
PPTX
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
PPTX
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
PPTX
ChefConf 2015 Cleaning up the Kitchen
PPTX
Cleaning Up the Kitchen: Migrating to Enterprise Chef From Open Source - Chef...
PDF
Bjorn Rabenstein. SRE, DevOps, Google, and you
PDF
DevOps Vs SRE Major Differences That You Need To Know - Hidden Brains Infotech
Site Reliability Engineering: An Enterprise Adoption Story (an ITSM Academy W...
DevOps Torino Meetup - SRE Concepts
JUST EAT: Tools we use to enable our culture
What is Site Reliability Engineering (SRE)
Getting started with Site Reliability Engineering (SRE)
S.R.E - create ultra-scalable and highly reliable systems
Site reliability engineering
SRE (service reliability engineer) on big DevOps platform running on the clou...
System Accidents: Understanding Common Accidents
JUST EAT: Embracing DevOps
"10 Pitfalls of a Platform Team", Yura Rochniak
SRE & Kubernetes
SRE - drupal day aveiro 2016
Site (Service) Reliability Engineering
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
ADDO_2022_SRE Architectural Patterns_Nov10.pptx
ChefConf 2015 Cleaning up the Kitchen
Cleaning Up the Kitchen: Migrating to Enterprise Chef From Open Source - Chef...
Bjorn Rabenstein. SRE, DevOps, Google, and you
DevOps Vs SRE Major Differences That You Need To Know - Hidden Brains Infotech
Ad

More from AppDynamics (20)

PPTX
Good Migrations: APM Essentials For Cloud Success at AppD Global Tour London
PPTX
Top Tips For AppD Adoption Success at AppD Global Tour London
PPTX
How To Create An AppD Centre of Excellence at AppD Global Tour London
PPTX
Ensure Every Customer Matters With End User Monitoring at AppD Global Tour Lo...
PPTX
What’s Next For AppDynamics and Cisco? AppD Global Tour London
PPTX
Unlock The Power Of Real-Time Performance Data With Business iQ - AppD Global...
PPTX
Overcoming Transformational Barriers with Ensono - AppD Global Tour London
PPTX
Equinor: What does normal look like?
PPTX
Unlock The Power Of Real-Time Performance Data With Business iQ - AppD Global...
PPTX
Top Tips For AppD Adoption Success - AppD Global Tour Stockholm
PPTX
What's next for AppD and Cisco? - AppD Global Tour
PPTX
Cisco and AppDynamics: Redefining Application Intelligence - AppD Summit Europe
PPTX
British Medical Journal: Refine Your Metrics For Digital Success - AppD Summi...
PPTX
Forrester Research: How To Organise Your Business For Digital Success - AppD ...
PPTX
Mastering APM With End User Monitoring - AppD Summit Europe
PPTX
Become an AppDynamics Dashboard Rockstar - AppD Summit Europe
PPTX
Business iQ: What It Is and How to Start - AppD Summit Europe
PPTX
Containers: Give Me The Facts, Not The Hype - AppD Summit Europe
PPTX
Automation: The Good, The Bad and The Ugly with DevOpsGuys - AppD Summit Europe
PPTX
Standard Bank: How APM Supports DevOps, Agile and Engineering Transformation ...
Good Migrations: APM Essentials For Cloud Success at AppD Global Tour London
Top Tips For AppD Adoption Success at AppD Global Tour London
How To Create An AppD Centre of Excellence at AppD Global Tour London
Ensure Every Customer Matters With End User Monitoring at AppD Global Tour Lo...
What’s Next For AppDynamics and Cisco? AppD Global Tour London
Unlock The Power Of Real-Time Performance Data With Business iQ - AppD Global...
Overcoming Transformational Barriers with Ensono - AppD Global Tour London
Equinor: What does normal look like?
Unlock The Power Of Real-Time Performance Data With Business iQ - AppD Global...
Top Tips For AppD Adoption Success - AppD Global Tour Stockholm
What's next for AppD and Cisco? - AppD Global Tour
Cisco and AppDynamics: Redefining Application Intelligence - AppD Summit Europe
British Medical Journal: Refine Your Metrics For Digital Success - AppD Summi...
Forrester Research: How To Organise Your Business For Digital Success - AppD ...
Mastering APM With End User Monitoring - AppD Summit Europe
Become an AppDynamics Dashboard Rockstar - AppD Summit Europe
Business iQ: What It Is and How to Start - AppD Summit Europe
Containers: Give Me The Facts, Not The Hype - AppD Summit Europe
Automation: The Good, The Bad and The Ugly with DevOpsGuys - AppD Summit Europe
Standard Bank: How APM Supports DevOps, Agile and Engineering Transformation ...
Ad

Recently uploaded (20)

PDF
AI-Powered Fuzz Testing: The Future of QA
PDF
Building an Inclusive Web Accessibility Made Simple with Accessibility Analyzer
PDF
Understanding the Need for Systemic Change in Open Source Through Intersectio...
PPTX
Plex Media Server 1.28.2.6151 With Crac5 2022 Free .
PPTX
Why 2025 Is the Best Year to Hire Software Developers in India
PDF
What Makes a Great Data Visualization Consulting Service.pdf
PPTX
Human Computer Interaction lecture Chapter 2.pptx
PDF
Lumion Pro Crack New latest version Download 2025
PPTX
Chapter 1 - Transaction Processing and Mgt.pptx
PDF
infoteam HELLAS company profile 2025 presentation
PPTX
Human-Computer Interaction for Lecture 2
PPTX
Human-Computer Interaction for Lecture 1
PPTX
Viber For Windows 25.7.1 Crack + Serial Keygen
PDF
CapCut PRO for PC Crack New Download (Fully Activated 2025)
PPTX
Streamlining Project Management in the AV Industry with D-Tools for Zoho CRM ...
PPTX
Bandicam Screen Recorder 8.2.1 Build 2529 Crack
PDF
IT Consulting Services to Secure Future Growth
PDF
Website Design & Development_ Professional Web Design Services.pdf
PPTX
ROI from Efficient Content & Campaign Management in the Digital Media Industry
PDF
MiniTool Power Data Recovery 12.6 Crack + Portable (Latest Version 2025)
AI-Powered Fuzz Testing: The Future of QA
Building an Inclusive Web Accessibility Made Simple with Accessibility Analyzer
Understanding the Need for Systemic Change in Open Source Through Intersectio...
Plex Media Server 1.28.2.6151 With Crac5 2022 Free .
Why 2025 Is the Best Year to Hire Software Developers in India
What Makes a Great Data Visualization Consulting Service.pdf
Human Computer Interaction lecture Chapter 2.pptx
Lumion Pro Crack New latest version Download 2025
Chapter 1 - Transaction Processing and Mgt.pptx
infoteam HELLAS company profile 2025 presentation
Human-Computer Interaction for Lecture 2
Human-Computer Interaction for Lecture 1
Viber For Windows 25.7.1 Crack + Serial Keygen
CapCut PRO for PC Crack New Download (Fully Activated 2025)
Streamlining Project Management in the AV Industry with D-Tools for Zoho CRM ...
Bandicam Screen Recorder 8.2.1 Build 2529 Crack
IT Consulting Services to Secure Future Growth
Website Design & Development_ Professional Web Design Services.pdf
ROI from Efficient Content & Campaign Management in the Digital Media Industry
MiniTool Power Data Recovery 12.6 Crack + Portable (Latest Version 2025)

Just Eat: DevOps at Scale at AppD Global Tour London

  • 1. Just Eat’s SRE Story DevOps at Scale
  • 2. 2 Bennie Johnston Head of Site Reliability Engineering Rich Haigh Director of Technology
  • 3. Our vision Creating the world’s greatest food community
  • 4. Fosduo dolores etoa jasom rebum. Steto clita kuasd gubogren, nosotra drs frone. What makes us UK/Ukraine/Australia/Canada 500+ ppl in Tech 22.8m active customers 30+ teams 450+ services 2,700+ orders/min 1,500+ AWS instances in production 1.6M+ metrics/min 1.5TB+ logs/day 500+ releases/week 45% Revenue Growth (FY17) FTSE100 >£5bn Market Cap Special?
  • 5. What is SRE at Just Eat? 1 - Relentlessly protect site availability 2 - Enable change to be delivered fast, but with quality 3 - Optimise the use of our infrastructure and resources 4 - Innovate to stay ahead 5 - Foster the right culture at Just Eat We believe that Dev teams own their product - full stop! Site Reliability Engineering operates on 5 key principles... 5
  • 6. How do we structure it? Our customers are 30+ Dev Teams in multiple countries (these numbers vary) Central Reliability Engineering department - 24/7 Service Operations Centre (SOC) - Development team - Hosting/Platform - Delivery Automation (CI/CD) - Observability - Service Management Daily production standups Weekly risk meeting Monthly Engineering all-hands 1st class citizen in various architecture/project groups 6
  • 7. What tools/processes do we own? In one extreme SRE owns all tools and processes + economies of scale + faster decisions - limits innovation - slows down development teams In the other extreme Dev teams own all tools and processes + maximum flexibility for development teams - tooling sprawl - wasted time reinventing the wheel - support problems Our solution + central support for a range of tooling + ability for dev teams to interact via an opensource approach + freedom for dev teams to deviate + survival of the fittest approach The Central vs. Distributed debate 7
  • 8. Lessons learnt as we’ve grown?
  • 10. How do we deal with scale
  • 11. Example: internal tool we own Bennie 11
  • 13. A formula for managing chaos? 13 if ( ReliabilityScore() < DesiredReliability() ) { LetUsHelpYou() } else { LetUsHighlightYou() Freedom++ }
  • 14. What’s next? The FUTURE! Automation of observability. A step jump from the simple time series metrics. 14 The dream of incident resolution automation. The robots talking to the robots.
  • 15. Questions? If you want to contact us? [email protected] [email protected] If you want to read more about us? Our tech blog: https://tech.just-eat.com If you want to work for us ;) Our Careers site: https://careers.just-eat.com