SlideShare a Scribd company logo
Failure Happens
F***, the F***ing thing is F***king F***ed*
            *Official WebOps term from Artur Bergman




         Jesse Robbins
       jesse@oreilly.com
This will be on the test:

 FAILURE HAPPENS!
Failure Happens Interop Nyc
Failure Happens Interop Nyc
25%

75%
25%

75%         Paranoid
25%   Pyromaniac



75%         Paranoid
Failure Happens Interop Nyc
Good
Book!
“multiple and unexpected
interactions of failures are
        inevitable”
                 -­‐Charles	
  Perrow
Failure Happens
define:
 Nines (roughly)
define:
 Nines (roughly)
   99%	 5256 min (3.5 days)
define:
 Nines (roughly)
   99%	 5256 min (3.5 days)
   99.9%	 528 min ( 8.8 hours )
define:
 Nines (roughly)
   99%	 5256 min (3.5 days)
   99.9%	 528 min ( 8.8 hours )
   99.99% 53 min
define:
 Nines (roughly)
   99%	 5256 min (3.5 days)
   99.9%	 528 min ( 8.8 hours )
   99.99% 53 min
   99.999% 5 min
define:
 Nines (roughly)
   99%	 5256 min (3.5 days)
   99.9%	 528 min ( 8.8 hours )
   99.99% 53 min
   99.999% 5 min
   99.9999% 30 Seconds
define:
 Nines (roughly)
   99%	 5256 min (3.5 days)
   99.9%	 528 min ( 8.8 hours )
   99.99% 53 min
   99.999% 5 min
   99.9999% 30 Seconds
   99.99999% 3 Seconds
Internet Routing... won’t.
Failure Happens Interop Nyc
Failure Happens Interop Nyc
Failure Happens Interop Nyc
Failure Happens Interop Nyc
;''-1(<"=/-)"3.1>0?-'"@'-':




!"#$$%"&'(')*)"+,-.,-/01,(   +/.01210*"345467"89:   #
Failure Happens Interop Nyc
#googlefail
YOU
Continuous Power...
       isn’t
365 Main SF
365 364.96 Main SF
Failure Happens Interop Nyc
Failure Happens Interop Nyc
Failure happens

 A single datacenter is the
 problem
 • Since they all fail at some point

 Recovery procedures after
 failure
 • Power was gone ~45 minutes
 • Most services took hours to come back
 • Some unnamed ones more than 12 hours
Truck 1, Rackspace 0
Failure Happens Interop Nyc
Geography is a
Single Point of Failure
Failure Happens Interop Nyc
Failure Happens Interop Nyc
Failure Happens Interop Nyc
+2304,$5%67"#,-8$1




 !"#$%#&'()(#*&+,&!"#$%&!'()* #%-#%*%,.&'(/,.#+%*&0+.1&-#%2+3&(/."4%*&(2&".&)%"*.&5678
!"#$%&''(                                   +#,$-#$,%./-$0,1                             )*
Taser weilding robbers

C I Hosts' Chicago facility
robbed twice!

(the other two times were
merely "break-ins where things
were stolen")
Failure Happens Interop Nyc
Providers are
baskets too.
Failure Happens Interop Nyc
Failure Happens Interop Nyc
Failure Happens.
Anyone promising otherwise
 is either foolish or lying
          (or both).
Go Here!



  Jesse Robbins
jesse@oreilly.com

More Related Content

PDF
Continuous Deployment & Delivery + Culture Hacks @ QCON 2012
Jesse Robbins
 
PDF
Jesse Robbins @ MWC 2015 - Building Orion Onyx - Real-time wearable push to t...
Jesse Robbins
 
PDF
Orion Labs - From Bits to Atoms
Jesse Robbins
 
PDF
Jesse Robbins Keynote - Hacking Culture @ Cloud Expo Europe 2013
Jesse Robbins
 
PDF
Hacking Culture at VelocityConf
Jesse Robbins
 
PDF
Rebooting a Cloud
Jesse Robbins
 
PDF
GameDay: Creating Resiliency Through Destruction - LISA11
Jesse Robbins
 
PDF
DevOps @ InterOP Las Vegas - Jesse Robbins - Opscode
Jesse Robbins
 
Continuous Deployment & Delivery + Culture Hacks @ QCON 2012
Jesse Robbins
 
Jesse Robbins @ MWC 2015 - Building Orion Onyx - Real-time wearable push to t...
Jesse Robbins
 
Orion Labs - From Bits to Atoms
Jesse Robbins
 
Jesse Robbins Keynote - Hacking Culture @ Cloud Expo Europe 2013
Jesse Robbins
 
Hacking Culture at VelocityConf
Jesse Robbins
 
Rebooting a Cloud
Jesse Robbins
 
GameDay: Creating Resiliency Through Destruction - LISA11
Jesse Robbins
 
DevOps @ InterOP Las Vegas - Jesse Robbins - Opscode
Jesse Robbins
 

More from Jesse Robbins (7)

PDF
Gov 2.0: Scaling, Automation, & Management in the Cloud
Jesse Robbins
 
PDF
Cloud Operations Bootcamp: Culture - Jesse Robbins
Jesse Robbins
 
KEY
Using Chef for Automated Infrastructure in the Cloud
Jesse Robbins
 
PDF
Serving Those That Serve Others Web2 Summit Jesse Robbins Final
Jesse Robbins
 
KEY
Failure Happens: CloudCamp Interop
Jesse Robbins
 
PDF
DisasterTech Presentation @ NEMA
Jesse Robbins
 
PDF
ETech2008 DisasterTech Robbins Maron 20080305a
Jesse Robbins
 
Gov 2.0: Scaling, Automation, & Management in the Cloud
Jesse Robbins
 
Cloud Operations Bootcamp: Culture - Jesse Robbins
Jesse Robbins
 
Using Chef for Automated Infrastructure in the Cloud
Jesse Robbins
 
Serving Those That Serve Others Web2 Summit Jesse Robbins Final
Jesse Robbins
 
Failure Happens: CloudCamp Interop
Jesse Robbins
 
DisasterTech Presentation @ NEMA
Jesse Robbins
 
ETech2008 DisasterTech Robbins Maron 20080305a
Jesse Robbins
 
Ad

Recently uploaded (20)

PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Artjoker Software Development Company
 
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
codernjn73
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Ad

Failure Happens Interop Nyc