SlideShare a Scribd company logo
Steve Loughran HP Laboratories, Bristol, UK April 2008 Deploying on EC2
Researcher at HP Laboratories Area of interest: Deployment Author of  Ant in Action Steve Loughran
How to host big applications across distributed resources Automatically Repeatably Dynamically Correctly Securely How to manage them from installation to removal How to make dynamically allocated servers useful Our research - see smartfrog.org
Who had breakfast this morning? Question
Who harvested wheat or corn,  or killed an animal for  that breakfast? Question
Farms provide food. It is  somebody else's problem
Old world installation: single server Single web server, Single DB RAID filestore -SPOF -limitations of scale
yesterday: clustering Multiple web servers, Replicated DB RAID Network filestore Load-balancing router -Cost -Complexity -Limitations of scale Maintains the illusion of a single server
Now: server farms +  Agile Infrastructure 500+ servers Distributed filestore Rented storage  & CPU Scales up No capital outlay http://www.linuxjournal.com/
Assumptions that are now invalid System failure is an unusual event 100% availability can be achieved Data is always near the server You need physical access to the servers Databases are the best form of storage You need millions of $/£/€ to play
Who has the servers? Yahoo!, Google, MSN, Amazon, eBay: services MMORPG Game Vendors:  World of Warcraft, Second Life EU Grid: Scientists HP, IBM, Sun: rent to companies (some resold)  -focus on CPU performance for enterprise Amazon: rent to anyone with an Amazon account -focus on startups
Amazon S3 Multiple geo-located data storage No limits on size Cost of write is high (guarantee of written remotely) Read is cheap; may be out of date Cost: Low S3 is a global file system at a low price
Amazon S3 Charges S3 sets the limit on costs for reliable data storage over the network For Amazon, indexing and writes are the big costs…small files are the enemy  Storage $0.15/GB/month Upload $0.10 per GB - all data transfer in Download $0.18 per GB - first 10 TB / month data transfer out $0.16 per GB - next 40 TB / month data transfer out $0.13 per GB - data transfer out / month over 50 TB  Requests $0.01 per 1,000 PUT or LIST $0.01 per 10,000 GET or HEAD  $0 DELETE
SmartFrog S3 Components Restlet API (restlet.org) HTTP operations Has Amazon AWS authentication support  TransientS3Bucket extends S3Bucket { startActions [PUT_ACTION]; livenessActions [HEAD_ACTION]; terminateActions [S3_DELETE_ACTION]; } PersistentS3Bucket extends TransientS3Bucket { terminateActions []; }
Amazon EC2 Pay as you go Virtual Machine Hosting No persistent storage other than S3 filestore -uses HTTP GET/PUT/DELETE operations $0.10 per CPU/hour Resold OS images for more (RedHat) In 2008: static IP, failover/balancing In 2008: RAID-like storage
Amazon EC2 Host S3 Storage AMI (Xen VM) AMI (Xen VM) /mnt Host AMI (Xen VM) AMI (Xen VM) Public Internet /mnt /mnt /mnt Fast (free) network free access; slow initial read time pay per GET; per megabyte $ $ $ $ $
Demo
SmartFrog EC2 Components service extends ImageInstance { id "0X03DS92MX8K2A29P082"; imageID "ami-26b6534f"; key "EmlMg61YbNoThisIsNotMyKey"; minCount 10; maxCount 100; }; List available images Instantiate any number of images List deployed instances Terminate deployed instances Currently built on Typica
EC2 Limitations Can't talk to peers using public IP addresses No persistent file system other than S3 Most addresses are dynamic No managed redundancy/restart No multicast IP No movement of VMs off high-traffic racks Expensive to create/destroy per test case
EC2 and Apache Great platform for 'ready to use' machines Good for interop testing Need to automate machine update Need to improve the EC2 tooling Need to convince Amazon to give us lower cost S3/EC2 with lower QoS Hadoop, Tomcat, Geronimo…
Problems for us farmers Power management Predictive disk failure management Load balancing for availability, power  File management Billing Routing Security/isolation Managing machine images Diagnostics Evolution of datacentre hardware
Feb 2008 Amazon Outage S3 and AWS suddenly started failing Intermittent, system wide, not visible to all Root cause: authentication service overloaded A Single Point of Failure will always find you <?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?>  <Error><Code>InternalError</Code>  <Message>We encountered an internal error. Please try again.</Message>  <RequestId>A2A7E5395E27DFBB</RequestId>  <HostId>f691zulHNsUqonsZkjhIL/sGsn6K</HostId>  </Error>

More Related Content

What's hot (14)

PDF
Deep Learning을 위한 AWS 기반 인공 지능(AI) 서비스 (윤석찬)
Amazon Web Services Korea
 
PPTX
Heap Dump Analysis - AEM: Real World Issues
Kanika Gera
 
PPTX
Web scale architecture design
NepalAdz
 
PPT
Cloud Computing Primer: Using cloud computing tools in your museum
Robert J. Stein
 
PPTX
Optimizing AWS Economics
Aaron Klein
 
KEY
Scaling Django for X Factor - DJUGL Oct 2012
Malcolm Box
 
PPTX
AWS Meet-up Atlanta: AWS Economics
Aaron Klein
 
PPT
Developing And Running A Website On Amazon S E
jaymuntz
 
PDF
Leveraging Databricks for Spark pipelines
Rose Toomey
 
PPTX
Designing for elasticity on AWS - 9.11.2015
Anton Babenko
 
PDF
Terraform @Base
Miroslaw Nagas
 
PPTX
AWS Meet-up San Francisco: Cloud Security
Aaron Klein
 
PPTX
Operational challenges behind Serverless architectures
Laurent Bernaille
 
PDF
Дмитро Волошин "High[Page]load"
Fwdays
 
Deep Learning을 위한 AWS 기반 인공 지능(AI) 서비스 (윤석찬)
Amazon Web Services Korea
 
Heap Dump Analysis - AEM: Real World Issues
Kanika Gera
 
Web scale architecture design
NepalAdz
 
Cloud Computing Primer: Using cloud computing tools in your museum
Robert J. Stein
 
Optimizing AWS Economics
Aaron Klein
 
Scaling Django for X Factor - DJUGL Oct 2012
Malcolm Box
 
AWS Meet-up Atlanta: AWS Economics
Aaron Klein
 
Developing And Running A Website On Amazon S E
jaymuntz
 
Leveraging Databricks for Spark pipelines
Rose Toomey
 
Designing for elasticity on AWS - 9.11.2015
Anton Babenko
 
Terraform @Base
Miroslaw Nagas
 
AWS Meet-up San Francisco: Cloud Security
Aaron Klein
 
Operational challenges behind Serverless architectures
Laurent Bernaille
 
Дмитро Волошин "High[Page]load"
Fwdays
 

Viewers also liked (19)

PDF
Hadoop & Hep
Steve Loughran
 
PPTX
HA Hadoop -ApacheCon talk
Steve Loughran
 
PPTX
Help! My Hadoop doesn't work!
Steve Loughran
 
PPT
When Web Services Go Bad
Steve Loughran
 
PPT
Beyond Unit Testing
Steve Loughran
 
PDF
Benchmarking
Steve Loughran
 
PPT
Testing
Steve Loughran
 
PPT
The Wondrous Curse of Interoperability
Steve Loughran
 
PPTX
Hadoop: today and tomorrow
Steve Loughran
 
PPT
My other computer is a datacentre - 2012 edition
Steve Loughran
 
PDF
Hadoop Futures
Steve Loughran
 
PPTX
New Roles In The Cloud
Steve Loughran
 
ODP
Farming hadoop in_the_cloud
Steve Loughran
 
PPTX
Hadoop and Kerberos: the Madness Beyond the Gate: January 2016 edition
Steve Loughran
 
PPTX
Spark Summit East 2017: Apache spark and object stores
Steve Loughran
 
PPT
Application Architecture For The Cloud
Steve Loughran
 
PPTX
Apache Spark and Object Stores
Steve Loughran
 
PPTX
Household INFOSEC in a Post-Sony Era
Steve Loughran
 
PPTX
Hadoop gets Groovy
Steve Loughran
 
Hadoop & Hep
Steve Loughran
 
HA Hadoop -ApacheCon talk
Steve Loughran
 
Help! My Hadoop doesn't work!
Steve Loughran
 
When Web Services Go Bad
Steve Loughran
 
Beyond Unit Testing
Steve Loughran
 
Benchmarking
Steve Loughran
 
The Wondrous Curse of Interoperability
Steve Loughran
 
Hadoop: today and tomorrow
Steve Loughran
 
My other computer is a datacentre - 2012 edition
Steve Loughran
 
Hadoop Futures
Steve Loughran
 
New Roles In The Cloud
Steve Loughran
 
Farming hadoop in_the_cloud
Steve Loughran
 
Hadoop and Kerberos: the Madness Beyond the Gate: January 2016 edition
Steve Loughran
 
Spark Summit East 2017: Apache spark and object stores
Steve Loughran
 
Application Architecture For The Cloud
Steve Loughran
 
Apache Spark and Object Stores
Steve Loughran
 
Household INFOSEC in a Post-Sony Era
Steve Loughran
 
Hadoop gets Groovy
Steve Loughran
 
Ad

Similar to Deploying On EC2 (20)

PPT
Amazon S3 and EC2
george.james
 
PPT
view the deck as powerpoint
webhostingguy
 
PPT
view the deck as powerpoint
webhostingguy
 
PPTX
How to run your Hadoop Cluster in 10 minutes
Vladimir Simek
 
PPT
Exploring The Cloud
awesomesos
 
PDF
Cloud Talk
John Willis
 
PPT
Building Highly Scalable Web Applications
IWMW
 
PPT
Amazon
iamzkz
 
PPTX
Crunch Your Data in the Cloud with Elastic Map Reduce - Amazon EMR Hadoop
Adrian Cockcroft
 
ODP
Amazon Aws Presentation Drupal
guest856f3a
 
PPTX
Windows Azure - Uma Plataforma para o Desenvolvimento de Aplicações
Comunidade NetPonto
 
PPT
Hadoop and Voldemort @ LinkedIn
Hadoop User Group
 
PPT
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Bhupesh Bansal
 
PPT
UnConference for Georgia Southern Computer Science March 31, 2015
Christopher Curtin
 
PPTX
Windows Azure: Lessons From The Field
Rob Gillen
 
PDF
Scale, baby, scale!
Julien SIMON
 
PPTX
AZUG.BE - Azure User Group Belgium - First public meeting
Maarten Balliauw
 
PDF
AWS 101 December 2014
Ian Massingham
 
PPTX
GIS & Cloud Computing - GAASC 2010 Fall Summit - Florence, SC
Jim Tochterman
 
PDF
Jeff Barr Amazon Services Cloud Computing
deimos
 
Amazon S3 and EC2
george.james
 
view the deck as powerpoint
webhostingguy
 
view the deck as powerpoint
webhostingguy
 
How to run your Hadoop Cluster in 10 minutes
Vladimir Simek
 
Exploring The Cloud
awesomesos
 
Cloud Talk
John Willis
 
Building Highly Scalable Web Applications
IWMW
 
Amazon
iamzkz
 
Crunch Your Data in the Cloud with Elastic Map Reduce - Amazon EMR Hadoop
Adrian Cockcroft
 
Amazon Aws Presentation Drupal
guest856f3a
 
Windows Azure - Uma Plataforma para o Desenvolvimento de Aplicações
Comunidade NetPonto
 
Hadoop and Voldemort @ LinkedIn
Hadoop User Group
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Bhupesh Bansal
 
UnConference for Georgia Southern Computer Science March 31, 2015
Christopher Curtin
 
Windows Azure: Lessons From The Field
Rob Gillen
 
Scale, baby, scale!
Julien SIMON
 
AZUG.BE - Azure User Group Belgium - First public meeting
Maarten Balliauw
 
AWS 101 December 2014
Ian Massingham
 
GIS & Cloud Computing - GAASC 2010 Fall Summit - Florence, SC
Jim Tochterman
 
Jeff Barr Amazon Services Cloud Computing
deimos
 
Ad

More from Steve Loughran (20)

PPTX
Hadoop Vectored IO
Steve Loughran
 
PPTX
The age of rename() is over
Steve Loughran
 
PPTX
What does Rename Do: (detailed version)
Steve Loughran
 
PPTX
Put is the new rename: San Jose Summit Edition
Steve Loughran
 
PPTX
@Dissidentbot: dissent will be automated!
Steve Loughran
 
PPTX
PUT is the new rename()
Steve Loughran
 
PPT
Extreme Programming Deployed
Steve Loughran
 
PPT
Testing
Steve Loughran
 
PPTX
I hate mocking
Steve Loughran
 
PPTX
What does rename() do?
Steve Loughran
 
PPTX
Dancing Elephants: Working with Object Storage in Apache Spark and Hive
Steve Loughran
 
PPTX
Apache Spark and Object Stores —for London Spark User Group
Steve Loughran
 
PPTX
Hadoop, Hive, Spark and Object Stores
Steve Loughran
 
PPTX
Hadoop and Kerberos: the Madness Beyond the Gate
Steve Loughran
 
PPTX
Slider: Applications on YARN
Steve Loughran
 
PPTX
YARN Services
Steve Loughran
 
PPTX
Datacentre stack
Steve Loughran
 
PPTX
Overview of slider project
Steve Loughran
 
ODP
2014 01-02-patching-workflow
Steve Loughran
 
PPTX
2013 11-19-hoya-status
Steve Loughran
 
Hadoop Vectored IO
Steve Loughran
 
The age of rename() is over
Steve Loughran
 
What does Rename Do: (detailed version)
Steve Loughran
 
Put is the new rename: San Jose Summit Edition
Steve Loughran
 
@Dissidentbot: dissent will be automated!
Steve Loughran
 
PUT is the new rename()
Steve Loughran
 
Extreme Programming Deployed
Steve Loughran
 
I hate mocking
Steve Loughran
 
What does rename() do?
Steve Loughran
 
Dancing Elephants: Working with Object Storage in Apache Spark and Hive
Steve Loughran
 
Apache Spark and Object Stores —for London Spark User Group
Steve Loughran
 
Hadoop, Hive, Spark and Object Stores
Steve Loughran
 
Hadoop and Kerberos: the Madness Beyond the Gate
Steve Loughran
 
Slider: Applications on YARN
Steve Loughran
 
YARN Services
Steve Loughran
 
Datacentre stack
Steve Loughran
 
Overview of slider project
Steve Loughran
 
2014 01-02-patching-workflow
Steve Loughran
 
2013 11-19-hoya-status
Steve Loughran
 

Recently uploaded (20)

PDF
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
PDF
Python basic programing language for automation
DanialHabibi2
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
July Patch Tuesday
Ivanti
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
CIFDAQ Market Insights for July 7th 2025
CIFDAQ
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
Python basic programing language for automation
DanialHabibi2
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
July Patch Tuesday
Ivanti
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 

Deploying On EC2

  • 1. Steve Loughran HP Laboratories, Bristol, UK April 2008 Deploying on EC2
  • 2. Researcher at HP Laboratories Area of interest: Deployment Author of Ant in Action Steve Loughran
  • 3. How to host big applications across distributed resources Automatically Repeatably Dynamically Correctly Securely How to manage them from installation to removal How to make dynamically allocated servers useful Our research - see smartfrog.org
  • 4. Who had breakfast this morning? Question
  • 5. Who harvested wheat or corn, or killed an animal for that breakfast? Question
  • 6. Farms provide food. It is somebody else's problem
  • 7. Old world installation: single server Single web server, Single DB RAID filestore -SPOF -limitations of scale
  • 8. yesterday: clustering Multiple web servers, Replicated DB RAID Network filestore Load-balancing router -Cost -Complexity -Limitations of scale Maintains the illusion of a single server
  • 9. Now: server farms + Agile Infrastructure 500+ servers Distributed filestore Rented storage & CPU Scales up No capital outlay http://www.linuxjournal.com/
  • 10. Assumptions that are now invalid System failure is an unusual event 100% availability can be achieved Data is always near the server You need physical access to the servers Databases are the best form of storage You need millions of $/£/€ to play
  • 11. Who has the servers? Yahoo!, Google, MSN, Amazon, eBay: services MMORPG Game Vendors: World of Warcraft, Second Life EU Grid: Scientists HP, IBM, Sun: rent to companies (some resold) -focus on CPU performance for enterprise Amazon: rent to anyone with an Amazon account -focus on startups
  • 12. Amazon S3 Multiple geo-located data storage No limits on size Cost of write is high (guarantee of written remotely) Read is cheap; may be out of date Cost: Low S3 is a global file system at a low price
  • 13. Amazon S3 Charges S3 sets the limit on costs for reliable data storage over the network For Amazon, indexing and writes are the big costs…small files are the enemy Storage $0.15/GB/month Upload $0.10 per GB - all data transfer in Download $0.18 per GB - first 10 TB / month data transfer out $0.16 per GB - next 40 TB / month data transfer out $0.13 per GB - data transfer out / month over 50 TB Requests $0.01 per 1,000 PUT or LIST $0.01 per 10,000 GET or HEAD $0 DELETE
  • 14. SmartFrog S3 Components Restlet API (restlet.org) HTTP operations Has Amazon AWS authentication support TransientS3Bucket extends S3Bucket { startActions [PUT_ACTION]; livenessActions [HEAD_ACTION]; terminateActions [S3_DELETE_ACTION]; } PersistentS3Bucket extends TransientS3Bucket { terminateActions []; }
  • 15. Amazon EC2 Pay as you go Virtual Machine Hosting No persistent storage other than S3 filestore -uses HTTP GET/PUT/DELETE operations $0.10 per CPU/hour Resold OS images for more (RedHat) In 2008: static IP, failover/balancing In 2008: RAID-like storage
  • 16. Amazon EC2 Host S3 Storage AMI (Xen VM) AMI (Xen VM) /mnt Host AMI (Xen VM) AMI (Xen VM) Public Internet /mnt /mnt /mnt Fast (free) network free access; slow initial read time pay per GET; per megabyte $ $ $ $ $
  • 17. Demo
  • 18. SmartFrog EC2 Components service extends ImageInstance { id &quot;0X03DS92MX8K2A29P082&quot;; imageID &quot;ami-26b6534f&quot;; key &quot;EmlMg61YbNoThisIsNotMyKey&quot;; minCount 10; maxCount 100; }; List available images Instantiate any number of images List deployed instances Terminate deployed instances Currently built on Typica
  • 19. EC2 Limitations Can't talk to peers using public IP addresses No persistent file system other than S3 Most addresses are dynamic No managed redundancy/restart No multicast IP No movement of VMs off high-traffic racks Expensive to create/destroy per test case
  • 20. EC2 and Apache Great platform for 'ready to use' machines Good for interop testing Need to automate machine update Need to improve the EC2 tooling Need to convince Amazon to give us lower cost S3/EC2 with lower QoS Hadoop, Tomcat, Geronimo…
  • 21. Problems for us farmers Power management Predictive disk failure management Load balancing for availability, power File management Billing Routing Security/isolation Managing machine images Diagnostics Evolution of datacentre hardware
  • 22. Feb 2008 Amazon Outage S3 and AWS suddenly started failing Intermittent, system wide, not visible to all Root cause: authentication service overloaded A Single Point of Failure will always find you <?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?> <Error><Code>InternalError</Code> <Message>We encountered an internal error. Please try again.</Message> <RequestId>A2A7E5395E27DFBB</RequestId> <HostId>f691zulHNsUqonsZkjhIL/sGsn6K</HostId> </Error>

Editor's Notes

  • #2: 1/14/2004 this is a fast feather talk at apachecon 2008