SlideShare a Scribd company logo
High Performance
   Cloud Computing



                                 Deepak Singh
         P r i n c i p a l   P r o d u c t   M a n a g e r
Via butteryflysha under a CC-BY license
Image: Simon Cockell under CC-BY
“High Performance”
    Computing
using a large number of
computers at the same
 time for a single task
Image: pennstatelive
batch
tightly coupled
data intensive
4
1. Infrastructure
High Performance Cloud Computing
High Performance Cloud Computing
ec2-run-instances
instance types
t1.micro




  standard (m1)
high memory (m2)
  high CPU (c1)
elastic
High Performance Cloud Computing
programmable
High Performance Cloud Computing
cluster computing
MPI
Cluster Compute
    Instance
2*Intel Xeon 5570
   23 GB RAM
   1.7 TB disk
10 gig E
Placement
  Group
Placement
  group
linpack
Cores      7040
R   max
           41.82
R   peak
           82.51
231
450
WIEN2K Parallel
                                                                    Performance

                                                                          H size 56,000 (25GB)
                                                                     Runtime (16x8 processors)
                                                                        Local (Infiniband) 3h:48
                                                                   Cloud (10Gbps) 1h:30 ($40)




                    1200 atom unit cell; SCALAPACK+MPI
                    diagonalization, matrix size 50k-100k

Credit: K. Jorissen, F. D. Villa, and J. J. Rehr (U. Washington)
HPC is evolving
High Performance Cloud Computing
2*Intel Xeon 5570
   22 GB RAM
   1.7 TB disk
2*NVidia M2050
2. Provision & Manage
High Performance Cloud Computing
AWS CloudFormation
bootstrap
chef/puppet
include_recipe "packages"
include_recipe "ruby"
include_recipe "apache2"

if platform?("centos","redhat")
  if dist_only?
     # just the gem, we'll install the apache module within apache2
     package "rubygem-passenger"
     return
  else
     package "httpd-devel"
  end
else
  %w{ apache2-prefork-dev libapr1-dev }.each do |pkg|
     package pkg do
       action :upgrade
     end
  end
end

gem_package "passenger" do
  version node[:passenger][:version]
end

execute "passenger_module" do
  command 'echo -en "nnnn" | passenger-install-apache2-module'
  creates node[:passenger][:module_path]
end
familiar tools
Oracle Grid Engine
LSF
Moab/Torque
Condor
StackIQ Rocks+
combine worlds
MIT Starcluster
$ starcluster start mycluster
$ starcluster listclusters
http://www.bioteam.net/2011/03/dude-you-got-some-chef-in-my-starcluster/
High Performance Cloud Computing
High Performance Cloud Computing
High Performance Cloud Computing
30,472 cores
‘nuff said
High Performance Cloud Computing
Amazon
Elastic MapReduce
S3

        Input data




Code     Elastic              Name                            Output
       MapReduce              node                          S3 + SimpleDB


                         Queries
                                                     HDFS
                          + BI
                     Via JDBC, Pig, Hive
                                           Elastic
                                           cluster
3. Applications
http://usegalaxy.org/cloud
http://cloudbiolinux.org/
mapreduce for
  genomics
 http://bowtie-bio.sourceforge.net/crossbow/index.shtml
            http://contrail-bio.sourceforge.net
   http://bowtie-bio.sourceforge.net/myrna/index.shtml
High Performance Cloud Computing
High Performance Cloud Computing
High Performance Cloud Computing
4. People


Credit: Pieter Musterd a CC-BY-NC-ND license
most valuable
removing barriers
Tasks




Instances
Tasks



Queue




Instances
Tasks



Queue




Instances
Tasks



            Queue




            Instances




 Increase
instance
  count
Tasks



Queue




Instances




Results



Store
Tasks



Queue



On-premise


Instances


Results



Store
Tasks



Queue



On-premise


Instances


Results



Store
High Performance Cloud Computing
High Performance Cloud Computing
High Performance Cloud Computing
optimize for cost
on-demand
reserved
spot
High Performance Cloud Computing
High Performance Cloud Computing
http://aws.amazon.com/ec2/spot-and-science/
Credit: Angel Pizzaro, U. Penn
NASA JPL
Stochastic Dual Dynamic Programming




44,000 CPU hrs in Oct 2010
            http://aws.amazon.com/solutions/case-studies/psr/
High Performance Cloud Computing
Credit: Angel Pizzaro, U. Penn
High Performance Cloud Computing
4
1. Infrastructure
2. Provision & Manage
3. Applications
4. People
deesingh@amazon.com
                                                            Twitter:@mndoci
                                               http://slideshare.net/mndoci
                                                   http://mndoci.github.com




        Inspiration and ideas from
        Matt Wood, James Hamilton
               & Larry Lessig

Credit” Oberazzi under a CC-BY-NC-SA license

More Related Content

PDF
High Performance Cloud Computing
Deepak Singh
 
PDF
Advances in GPU Computing
Frédéric Parienté
 
PDF
The Revolution of Deep Learning
Frédéric Parienté
 
PDF
JAWS-UG HPC #17 - Supercomputing'19 参加報告 - PFN 福田圭祐
Preferred Networks
 
PDF
Deep Learning Update May 2016
Frédéric Parienté
 
PDF
XeMPUPiL: Towards Performance-aware Power Capping Orchestrator for the Xen Hy...
NECST Lab @ Politecnico di Milano
 
PDF
Scale-out AI Training on Massive Core System from HPC to Fabric-based SOC
inside-BigData.com
 
PPTX
MEW22 22nd Machine Evaluation Workshop Microsoft
Lee Stott
 
High Performance Cloud Computing
Deepak Singh
 
Advances in GPU Computing
Frédéric Parienté
 
The Revolution of Deep Learning
Frédéric Parienté
 
JAWS-UG HPC #17 - Supercomputing'19 参加報告 - PFN 福田圭祐
Preferred Networks
 
Deep Learning Update May 2016
Frédéric Parienté
 
XeMPUPiL: Towards Performance-aware Power Capping Orchestrator for the Xen Hy...
NECST Lab @ Politecnico di Milano
 
Scale-out AI Training on Massive Core System from HPC to Fabric-based SOC
inside-BigData.com
 
MEW22 22nd Machine Evaluation Workshop Microsoft
Lee Stott
 

What's hot (19)

PDF
kubectl internal / Kubernetes Internal #8
Preferred Networks
 
PDF
Workshop actualización SVG CESGA 2012
CESGA Centro de Supercomputación de Galicia
 
PDF
Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...
Indrajit Poddar
 
PDF
HPC on Azure for Reserach
Jürgen Ambrosi
 
PDF
Chainer v4 and v5
Preferred Networks
 
PDF
Microsoft Azure in HPC scenarios
mictc
 
PDF
Integration of Cloud and Grid Middleware at DGRZR
Stefan Freitag
 
PDF
CuPy v4 and v5 roadmap
Preferred Networks
 
PDF
GTC Taiwan 2017 企業端深度學習與人工智慧應用
NVIDIA Taiwan
 
PDF
20201128_OSC_Fukuoka_Online_GPUPostGIS
Kohei KaiGai
 
PDF
HPC Cloud: Clouds on supercomputers for HPC
Ryousei Takano
 
PDF
PCCC21:日本電気株式会社「一台何役?SX-Aurora TSUBASA最新情報」
PC Cluster Consortium
 
PDF
Nvidia SC16: The Greatest Challenges Can't Wait
inside-BigData.com
 
PDF
Hadoop analytics provisioning based on a virtual infrastructure
CESGA Centro de Supercomputación de Galicia
 
PDF
Ai Forum at Computex 2017 - Keynote Slides by Jensen Huang
NVIDIA Taiwan
 
PDF
XeMPUPiL: Towards Performance-aware Power Capping Orchestrator for the Xen Hy...
NECST Lab @ Politecnico di Milano
 
PDF
最新の HPC 技術を生かした AI・ビッグデータインフラの東工大 TSUBAME3.0 及び産総研 ABCI
NVIDIA Japan
 
PPTX
Taking High Performance Computing to the Cloud: Windows HPC and
Saptak Sen
 
PDF
SGI HPC Update for June 2013
inside-BigData.com
 
kubectl internal / Kubernetes Internal #8
Preferred Networks
 
Workshop actualización SVG CESGA 2012
CESGA Centro de Supercomputación de Galicia
 
Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...
Indrajit Poddar
 
HPC on Azure for Reserach
Jürgen Ambrosi
 
Chainer v4 and v5
Preferred Networks
 
Microsoft Azure in HPC scenarios
mictc
 
Integration of Cloud and Grid Middleware at DGRZR
Stefan Freitag
 
CuPy v4 and v5 roadmap
Preferred Networks
 
GTC Taiwan 2017 企業端深度學習與人工智慧應用
NVIDIA Taiwan
 
20201128_OSC_Fukuoka_Online_GPUPostGIS
Kohei KaiGai
 
HPC Cloud: Clouds on supercomputers for HPC
Ryousei Takano
 
PCCC21:日本電気株式会社「一台何役?SX-Aurora TSUBASA最新情報」
PC Cluster Consortium
 
Nvidia SC16: The Greatest Challenges Can't Wait
inside-BigData.com
 
Hadoop analytics provisioning based on a virtual infrastructure
CESGA Centro de Supercomputación de Galicia
 
Ai Forum at Computex 2017 - Keynote Slides by Jensen Huang
NVIDIA Taiwan
 
XeMPUPiL: Towards Performance-aware Power Capping Orchestrator for the Xen Hy...
NECST Lab @ Politecnico di Milano
 
最新の HPC 技術を生かした AI・ビッグデータインフラの東工大 TSUBAME3.0 及び産総研 ABCI
NVIDIA Japan
 
Taking High Performance Computing to the Cloud: Windows HPC and
Saptak Sen
 
SGI HPC Update for June 2013
inside-BigData.com
 
Ad

Viewers also liked (19)

PPTX
How to Build Scalable Websites in the Cloud
RightScale
 
PDF
Cloud Security - Security Aspects of Cloud Computing
Jim Geovedi
 
PDF
2013 State of Cloud Survey SMB Results
Symantec
 
PDF
Breaking through the Clouds
Andy Piper
 
PPTX
2013 Future of Cloud Computing - 3rd Annual Survey Results
Michael Skok
 
PDF
Intro to cloud computing — MegaCOMM 2013, Jerusalem
Reuven Lerner
 
PPTX
Cloud security ppt
Venkatesh Chary
 
PDF
Can we hack open source #cloud platforms to help reduce emissions?
Tom Raftery
 
PDF
Summer School Scale Cloud Across the Enterprise
WSO2
 
PPTX
Simplifying The Cloud Top 10 Questions By SMBs
Sun Digital, Inc.
 
PPTX
Penetrating the Cloud: Opportunities & Challenges for Businesses
CompTIA
 
PPTX
The Inevitable Cloud Outage
Newvewm
 
PPTX
Avoiding Cloud Outage
Nati Shalom
 
PDF
LinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud Computing
Mark Hinkle
 
PPTX
Delivering IaaS with Open Source Software
Mark Hinkle
 
PPTX
Linthicum what is-the-true-future-of-cloud-computing
David Linthicum
 
PPTX
Cloud Computing Security
Ninh Nguyen
 
PPTX
Cloud security and security architecture
Vladimir Jirasek
 
PPTX
Data security in cloud computing
Prince Chandu
 
How to Build Scalable Websites in the Cloud
RightScale
 
Cloud Security - Security Aspects of Cloud Computing
Jim Geovedi
 
2013 State of Cloud Survey SMB Results
Symantec
 
Breaking through the Clouds
Andy Piper
 
2013 Future of Cloud Computing - 3rd Annual Survey Results
Michael Skok
 
Intro to cloud computing — MegaCOMM 2013, Jerusalem
Reuven Lerner
 
Cloud security ppt
Venkatesh Chary
 
Can we hack open source #cloud platforms to help reduce emissions?
Tom Raftery
 
Summer School Scale Cloud Across the Enterprise
WSO2
 
Simplifying The Cloud Top 10 Questions By SMBs
Sun Digital, Inc.
 
Penetrating the Cloud: Opportunities & Challenges for Businesses
CompTIA
 
The Inevitable Cloud Outage
Newvewm
 
Avoiding Cloud Outage
Nati Shalom
 
LinuxFest NW 2013: Hitchhiker's Guide to Open Source Cloud Computing
Mark Hinkle
 
Delivering IaaS with Open Source Software
Mark Hinkle
 
Linthicum what is-the-true-future-of-cloud-computing
David Linthicum
 
Cloud Computing Security
Ninh Nguyen
 
Cloud security and security architecture
Vladimir Jirasek
 
Data security in cloud computing
Prince Chandu
 
Ad

Similar to High Performance Cloud Computing (20)

PDF
Talk at West Coast Association of Shared Resource Directors
Deepak Singh
 
PPTX
Above the cloud: Big Data and BI
Denny Lee
 
PDF
Systems Bioinformatics Workshop Keynote
Deepak Singh
 
PDF
JOSA TechTalks - Downgrade your Costs
Jordan Open Source Association
 
PDF
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Lablup Inc.
 
PDF
Intel Theater Presentation - SC11
Deepak Singh
 
PDF
OSDC 2017 | Something Openshift Kubernetes Containers by Kristian Köhntopp
NETWAYS
 
PPTX
Scientific Computing @ Fred Hutch
Dirk Petersen
 
PPTX
Docker training
Kiran Kumar
 
PPT
Computing Outside The Box September 2009
Ian Foster
 
PDF
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
Mario-Leander Reimer
 
PDF
A hitchhiker‘s guide to the cloud native stack
QAware GmbH
 
PDF
EclipseCon Keynote: Apache Hadoop - An Introduction
Cloudera, Inc.
 
PDF
HPC on AWS
ReadMaloney
 
PPT
Computing Outside The Box June 2009
Ian Foster
 
PPTX
Spark and Shark: Lightning-Fast Analytics over Hadoop and Hive Data
Jetlore
 
PDF
Kubernetes - Starting with 1.2
William Stewart
 
PPT
High Performance Cyberinfrastructure Enables Data-Driven Science in the Globa...
Larry Smarr
 
PDF
k8s-batch-sig_-_Dask_on_Kubernetes.pptx__1_.pdf
RyzaAlvieMancunian
 
PDF
The Impact of Hardware and Software Version Changes on Apache Kafka Performan...
Paul Brebner
 
Talk at West Coast Association of Shared Resource Directors
Deepak Singh
 
Above the cloud: Big Data and BI
Denny Lee
 
Systems Bioinformatics Workshop Keynote
Deepak Singh
 
JOSA TechTalks - Downgrade your Costs
Jordan Open Source Association
 
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Lablup Inc.
 
Intel Theater Presentation - SC11
Deepak Singh
 
OSDC 2017 | Something Openshift Kubernetes Containers by Kristian Köhntopp
NETWAYS
 
Scientific Computing @ Fred Hutch
Dirk Petersen
 
Docker training
Kiran Kumar
 
Computing Outside The Box September 2009
Ian Foster
 
A Hitchhiker’s Guide to the Cloud Native Stack. #CDS17
Mario-Leander Reimer
 
A hitchhiker‘s guide to the cloud native stack
QAware GmbH
 
EclipseCon Keynote: Apache Hadoop - An Introduction
Cloudera, Inc.
 
HPC on AWS
ReadMaloney
 
Computing Outside The Box June 2009
Ian Foster
 
Spark and Shark: Lightning-Fast Analytics over Hadoop and Hive Data
Jetlore
 
Kubernetes - Starting with 1.2
William Stewart
 
High Performance Cyberinfrastructure Enables Data-Driven Science in the Globa...
Larry Smarr
 
k8s-batch-sig_-_Dask_on_Kubernetes.pptx__1_.pdf
RyzaAlvieMancunian
 
The Impact of Hardware and Software Version Changes on Apache Kafka Performan...
Paul Brebner
 

More from Deepak Singh (18)

PDF
Platforms for Data Science - Computing on the Brink
Deepak Singh
 
PPTX
#arseniclife
Deepak Singh
 
PDF
Talk at NCRR P41 Director's Meeting
Deepak Singh
 
PDF
Platforms for data science
Deepak Singh
 
PDF
Discovery 2015 Workshop
Deepak Singh
 
PDF
Bio-IT World 2010 - Keynote talk
Deepak Singh
 
PDF
Talk at Microsoft Cloud Futures 2010
Deepak Singh
 
PDF
NHGRI Cloud Computing talk
Deepak Singh
 
PDF
Plenary Talk at ACAT 2010
Deepak Singh
 
PDF
Masterworks talk on Big Data and the implications of petascale science
Deepak Singh
 
PDF
Talk given at "Cloud Computing for Systems Biology" workshop
Deepak Singh
 
KEY
Hadoop for Bioinformatics
Deepak Singh
 
KEY
Big Data & the networked future of Science (at Ignite Seattle 7)
Deepak Singh
 
PPT
Science Big, Science Connected
Deepak Singh
 
PPT
Bioscreencast: Capturing the life sciences frame by frame
Deepak Singh
 
PPT
Searching Science
Deepak Singh
 
PPT
Nanotechnology and medicine
Deepak Singh
 
PPT
An Open Scientific Future
Deepak Singh
 
Platforms for Data Science - Computing on the Brink
Deepak Singh
 
#arseniclife
Deepak Singh
 
Talk at NCRR P41 Director's Meeting
Deepak Singh
 
Platforms for data science
Deepak Singh
 
Discovery 2015 Workshop
Deepak Singh
 
Bio-IT World 2010 - Keynote talk
Deepak Singh
 
Talk at Microsoft Cloud Futures 2010
Deepak Singh
 
NHGRI Cloud Computing talk
Deepak Singh
 
Plenary Talk at ACAT 2010
Deepak Singh
 
Masterworks talk on Big Data and the implications of petascale science
Deepak Singh
 
Talk given at "Cloud Computing for Systems Biology" workshop
Deepak Singh
 
Hadoop for Bioinformatics
Deepak Singh
 
Big Data & the networked future of Science (at Ignite Seattle 7)
Deepak Singh
 
Science Big, Science Connected
Deepak Singh
 
Bioscreencast: Capturing the life sciences frame by frame
Deepak Singh
 
Searching Science
Deepak Singh
 
Nanotechnology and medicine
Deepak Singh
 
An Open Scientific Future
Deepak Singh
 

Recently uploaded (20)

PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PPTX
Simple and concise overview about Quantum computing..pptx
mughal641
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
Simple and concise overview about Quantum computing..pptx
mughal641
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 

High Performance Cloud Computing