SlideShare a Scribd company logo
Lessons from the Field:Azure for ScienceRob Gillengillenre@ornl.govrob.gillenfamily.net@argodev
AgendaIntroductionsWhy is ORNL looking at Cloud Computing
Azure in 5 minutesPost-Processing and Data Distribution in the Cloud Using Cloud Computing for Post-Processing
Data hosting/distributionLessons (being) Learned General Lessons
PerformanceOak Ridge National Laboratory is DOE’s largest science and energy labWorld’s most powerful open scientific computing facility
Nation’s largest concentrationof open source materials research
$1.6B budget
4,350 employees
3,900 researchguests annually
$350 million investedin modernization
Nation’s most diverse energy portfolio
Operating the world’s most intense pulsed neutron source
Managing the billion-dollar U.S. ITER projectDelivering science and technologyUltrascale computingEnergy technologiesBioenergyITERNeutron sciencesClimateMaterials at the nanoscaleNational securityNuclear energy
UltrascaleScientific ComputingLeadership Computing Facility:
World’s most powerful open scientific computing facility
Peak speed of 2.33 petaflops (> two thousand trillion calculations/sec)
18,688 nodes, 224,526 compute cores, 299 TB RAM, 10,000 TB Disk
4,352 ft2 floor space
Exascale system by the end of the next decade
Focus on computationally intensive projects of large scale and high scientific impact
Addressing key science and technology issues
Climate
Fusion
Materials
Bioenergy
Home of the 1st and 3rd fastest super computers in the world.The world’s most powerful system for open science
Then Why Look at Cloud Computing???Science Takes Different FormsTight Simulations
Data-Parallelized
Embarrassingly ParallelDearth of Mid-Range Assets256-1,000 cores
1 of many possible solutionsScaling IssuesPower Consumption
Programming Struggles
Fault-ToleranceForward-LookingNext-Generation Problems
Next-Generation ResearchersPrivate(On-Premise)Infrastructure(as a Service)Platform(as a Service)Types of CloudsYou manageApplicationsApplicationsApplicationsYou manageRuntimesRuntimesRuntimesSecurity & IntegrationSecurity & IntegrationSecurity & IntegrationManaged by vendorDatabasesDatabasesDatabasesYou manageServersServersServersManaged by vendorVirtualizationVirtualizationVirtualizationServer HWServer HWServer HWStorageStorageStorageNetworkingNetworkingNetworking
Private(On-Premise)Types of CloudsInfrastructure(as a Service)Platform(as a Service)
Application Services“Dublin”“Velocity”Frameworks“Geneva”SecurityAccess ControlProject “Sydney”ConnectivityService BusSQL Azure Data SyncDataComputeWindows Azure PlatformTable StorageBlob StorageQueueDriveContent Delivery NetworkStorage
Windows Azure ComputeDevelopment, service hosting, & management environment.NET, Java PHP, Python, Ruby, native code (C/C++, Win32, etc.)ASP.NET providers, FastCGI, memcached, MySQL, TomcatFull-trust – supports standard languages and APIsSecure certificate storeManagement API’s, and logging and diagnostics systemsMultiple roles – Web, Worker, Virtual Machine (VHD)Multiple VM sizes1.6 GHz CPU x64, 1.75GB RAM, 100Mbps network, 250GB volatile storageSmall (1X), Medium (2X), Large (4X), X-Large (8X)In-place rolling upgrades, organized by upgrade domainsWalk each upgrade  domain one at a timeCompute
Windows Azure DiagnosticsConfigurable trace, performance counter, Windows event log, IIS log & file bufferingLocal data buffering quota managementQuery & modify from the cloud and from the desktop per role instanceTransfer to storage scheduled & on-demandFilter by data type, verbosity & time rangeCompute
Windows Azure StorageRich data abstractions – tables, blobs, queues, drives, CDNCapacity (100TB), throughput (100MB/sec), transactions (1K req/sec)High accessibilitySupports geo-locationLanguage & platform agnostic REST APIsURL: http://<account>.<store>.core.windows.netClient libraries for .NET, Java, PHP, etc.High durability – data is replicated 3 times within a cluster, and (Feb 2010) across datacentersHigh scalability – data is automatically partitioned and load balanced across serversStorageStorage
Windows Azure Table StorageDesigned for structured data, not relational dataData definition is part of the applicationA Table is a set of Entities (records)An Entity is a set of Properties (fields)No fixed schemaEach property is stored as a <name, typed value> pairTwo entities within the same table can have different propertiesNo schema is enforcedTable Storage
Windows Azure Blob StorageStorage for  large, named files plus their metadataBlock Blob Targeted at streaming workloadsEach blob consists of a sequence of blocksEach block is identified by a Block IDSize limit 200GB per blobPage BlobTargeted at random read/write workloadsEach blob consists of an array of pagesEach page is identified by its offset from the start of the blobSize limit 1TB per blobBlob Storage
Windows Azure QueuePerformance efficient, highly available and provide reliable message deliveryAsynchronous work dispatchInter-role communication Polling based model; best-effort FIFO data structureQueue operationsCreate QueueDelete QueueList QueuesGet/Set Queue MetadataMessage operationsAdd MessageGet Message(s)Peek Message(s)Delete MessageQueue
Windows Azure DriveProvides a durable NTFS volume for Windows Azure applications to useUse existing NTFS APIs to access a durable driveDurability and survival of data on application failover Enables migrating existing NTFS applications to the cloudDrives can be up to 1TB; a VM can dynamically mount up to 8 drivesA Windows Azure Drive is a Page BlobExample, mount Page Blob as X:\http://<account>.blob.core.windows.net/<container>/<blob>All writes to drive are made durable to the Page BlobDrive made durable through standard Page Blob replicationDrive
Windows Azure Content Delivery NetworkProvides high-bandwidth global blob content delivery18 locations globally (US, Europe, Asia, Australia and South America), and growingBlob service URL vs. CDN URLBlob URL: http://<account>.blob.core.windows.net/CDN URL: http://<guid>.vo.msecnd.net/ Support for custom domain namesAccess detailsBlobs are cached in CDN until the TTL passesUse per-blob HTTP Cache-Control policy for TTL (new)CDN provides only anonymous HTTP accessContent Delivery Network
Tenants of Internet-Scale Application ArchitectureDesignHorizontal scaling
Service-oriented composition

More Related Content

PPTX
Windows Azure: Lessons From The Field
Rob Gillen
 
PPTX
NoSQL Introduction
John Kerley-Weeks
 
PPTX
Mongodb - NoSql Database
Prashant Gupta
 
PPT
Windows Azure and a little SQL Data Services
ukdpe
 
PDF
Microsoft R - Data Science at Scale
Sascha Dittmann
 
PPT
Lecture 40 1
patib5
 
PPTX
The CIOs Guide to NoSQL
DATAVERSITY
 
PPT
NoSql Databases
Nimat Khattak
 
Windows Azure: Lessons From The Field
Rob Gillen
 
NoSQL Introduction
John Kerley-Weeks
 
Mongodb - NoSql Database
Prashant Gupta
 
Windows Azure and a little SQL Data Services
ukdpe
 
Microsoft R - Data Science at Scale
Sascha Dittmann
 
Lecture 40 1
patib5
 
The CIOs Guide to NoSQL
DATAVERSITY
 
NoSql Databases
Nimat Khattak
 

What's hot (20)

PPT
NoSQL Slideshare Presentation
Ericsson Labs
 
PDF
AWS Athena vs. Google BigQuery for interactive SQL Queries
DoiT International
 
PDF
CouchDB
codebits
 
PDF
A Rusty introduction to Apache Arrow and how it applies to a time series dat...
Andrew Lamb
 
PPTX
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
In-Memory Computing Summit
 
PDF
From sql server to mongo db
Ryan Hoffman
 
PPTX
MongoDB Best Practices for Developers
Moshe Kaplan
 
PDF
Migrating to postgresql
botsplash.com
 
PPTX
Google BigQuery 101 & What’s New
DoiT International
 
PDF
Imply at Apache Druid Meetup in London 1-15-20
Jelena Zanko
 
PPTX
Getting Started with MongoDB Using the Microsoft Stack
MongoDB
 
PPTX
Couchbase Data Platform | Big Data Demystified
Omid Vahdaty
 
PDF
Hudi: Large-Scale, Near Real-Time Pipelines at Uber with Nishith Agarwal and ...
Databricks
 
PPTX
Introduction to NoSQL
PolarSeven Pty Ltd
 
PDF
Engineering practices in big data storage and processing
Schubert Zhang
 
PDF
Easy, scalable, fault tolerant stream processing with structured streaming - ...
Databricks
 
PPTX
Introduction to Google BigQuery
Csaba Toth
 
PPTX
Building Spring Data with MongoDB
MongoDB
 
PPTX
EclipseCon 2021 NoSQL Endgame
Thodoris Bais
 
PPTX
Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...
Michael Rys
 
NoSQL Slideshare Presentation
Ericsson Labs
 
AWS Athena vs. Google BigQuery for interactive SQL Queries
DoiT International
 
CouchDB
codebits
 
A Rusty introduction to Apache Arrow and how it applies to a time series dat...
Andrew Lamb
 
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
In-Memory Computing Summit
 
From sql server to mongo db
Ryan Hoffman
 
MongoDB Best Practices for Developers
Moshe Kaplan
 
Migrating to postgresql
botsplash.com
 
Google BigQuery 101 & What’s New
DoiT International
 
Imply at Apache Druid Meetup in London 1-15-20
Jelena Zanko
 
Getting Started with MongoDB Using the Microsoft Stack
MongoDB
 
Couchbase Data Platform | Big Data Demystified
Omid Vahdaty
 
Hudi: Large-Scale, Near Real-Time Pipelines at Uber with Nishith Agarwal and ...
Databricks
 
Introduction to NoSQL
PolarSeven Pty Ltd
 
Engineering practices in big data storage and processing
Schubert Zhang
 
Easy, scalable, fault tolerant stream processing with structured streaming - ...
Databricks
 
Introduction to Google BigQuery
Csaba Toth
 
Building Spring Data with MongoDB
MongoDB
 
EclipseCon 2021 NoSQL Endgame
Thodoris Bais
 
Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...
Michael Rys
 
Ad

Viewers also liked (20)

PPTX
Último Cómputo - Persona 2011
Último Minuto Chile
 
DOCX
Revista
paola063
 
PPTX
الرحمة العالمية | إغاثة اليمن
khaironline
 
PDF
Articlefunctional theory
Subhankar Basu
 
PDF
Leveraging TFS for Driving Process Improvement using Lean Principles
Srini Kadiam
 
PDF
SME Yalista Job title
Erick Antonson
 
PPT
Rider motors ppt
FARAZ HAIDER
 
DOCX
Final Report Literature Review
Jarett Pederson
 
PPTX
Responsivewebdesign part2
kshima02x
 
PPTX
150210111017
neha chandrani
 
PDF
PRWEEK MSLGROUP Social Media Survey
MSLGROUP Americas
 
PDF
MSLGROUP Hispanic Marketing White Paper
MSLGROUP Americas
 
PDF
R cuda presentation_ib_features_120704
Olexandr Isayev
 
PPTX
Application Architecture Jumpstart
Clint Edmonson
 
PPTX
4.john milton and his time
maliterature
 
PDF
PTW Water Cube
Nicolas Capo
 
PPT
Herramientas Web 2.0 y su aplicación en salud
Sofia
 
PPTX
Chapter 8 transport in humans
chenghong03
 
PPT
Paravertebral Cevical Sympathetic Block
cairo1957
 
PDF
Q2 2010-mobile-video-ad-report final
MediaRoni
 
Último Cómputo - Persona 2011
Último Minuto Chile
 
Revista
paola063
 
الرحمة العالمية | إغاثة اليمن
khaironline
 
Articlefunctional theory
Subhankar Basu
 
Leveraging TFS for Driving Process Improvement using Lean Principles
Srini Kadiam
 
SME Yalista Job title
Erick Antonson
 
Rider motors ppt
FARAZ HAIDER
 
Final Report Literature Review
Jarett Pederson
 
Responsivewebdesign part2
kshima02x
 
150210111017
neha chandrani
 
PRWEEK MSLGROUP Social Media Survey
MSLGROUP Americas
 
MSLGROUP Hispanic Marketing White Paper
MSLGROUP Americas
 
R cuda presentation_ib_features_120704
Olexandr Isayev
 
Application Architecture Jumpstart
Clint Edmonson
 
4.john milton and his time
maliterature
 
PTW Water Cube
Nicolas Capo
 
Herramientas Web 2.0 y su aplicación en salud
Sofia
 
Chapter 8 transport in humans
chenghong03
 
Paravertebral Cevical Sympathetic Block
cairo1957
 
Q2 2010-mobile-video-ad-report final
MediaRoni
 
Ad

Similar to Azure: Lessons From The Field (20)

PPTX
Azure Platform
Wes Yanaga
 
PPTX
A Lap Around Azure
DavidGristwood
 
PPTX
Understanding the Windows Azure Platform - Dec 2010
DavidGristwood
 
PPTX
Understanding The Azure Platform March 2010
DavidGristwood
 
PPTX
Azure platform for customers
Rateb Abu Hawieleh
 
PPTX
MSDN - Converting an existing ASP.NET application to Windows Azure
Maarten Balliauw
 
PPT
Building Cloud-Native Applications with Microsoft Windows Azure
Bill Wilder
 
PPTX
Azure, Cloud Computing & Services
Alan Dean
 
PPTX
Windowsazureplatform Overviewlatest
rajramab
 
PPTX
Understanding The Azure Platform Jan
DavidGristwood
 
PPTX
Windows Azure Platform Overview
BusinessIntelligenze
 
PPTX
Creation of cloud application using microsoft azure by vaishali sahare [katkar]
vaishalisahare123
 
PPTX
Sky High With Azure
Clint Edmonson
 
PPTX
Microsoft Partner Roadshow - To the Cloud
Nigel Watson
 
PPTX
Microsoft Azure
Pratik Sawant
 
PPTX
Understanding The Azure Platform November 09
DavidGristwood
 
PPTX
Building services using windows azure
Suliman AlBattat
 
PPTX
Windows azure camp - Kolkata
Abhijit Jana
 
PDF
Microsoft azure platforms
Motty Ben Atia
 
PPT
ArcReady - Architecting For The Cloud
Microsoft ArcReady
 
Azure Platform
Wes Yanaga
 
A Lap Around Azure
DavidGristwood
 
Understanding the Windows Azure Platform - Dec 2010
DavidGristwood
 
Understanding The Azure Platform March 2010
DavidGristwood
 
Azure platform for customers
Rateb Abu Hawieleh
 
MSDN - Converting an existing ASP.NET application to Windows Azure
Maarten Balliauw
 
Building Cloud-Native Applications with Microsoft Windows Azure
Bill Wilder
 
Azure, Cloud Computing & Services
Alan Dean
 
Windowsazureplatform Overviewlatest
rajramab
 
Understanding The Azure Platform Jan
DavidGristwood
 
Windows Azure Platform Overview
BusinessIntelligenze
 
Creation of cloud application using microsoft azure by vaishali sahare [katkar]
vaishalisahare123
 
Sky High With Azure
Clint Edmonson
 
Microsoft Partner Roadshow - To the Cloud
Nigel Watson
 
Microsoft Azure
Pratik Sawant
 
Understanding The Azure Platform November 09
DavidGristwood
 
Building services using windows azure
Suliman AlBattat
 
Windows azure camp - Kolkata
Abhijit Jana
 
Microsoft azure platforms
Motty Ben Atia
 
ArcReady - Architecting For The Cloud
Microsoft ArcReady
 

More from Rob Gillen (20)

PDF
CodeStock14: Hiding in Plain Sight
Rob Gillen
 
PDF
What's in a password
Rob Gillen
 
PPTX
How well do you know your runtime
Rob Gillen
 
PPTX
Software defined radio and the hacker
Rob Gillen
 
PPTX
So whats in a password
Rob Gillen
 
PPTX
Hiding in plain sight
Rob Gillen
 
PPTX
ETCSS: Into the Mind of a Hacker
Rob Gillen
 
PPTX
DevLink - WiFu: You think your wireless is secure?
Rob Gillen
 
PPTX
You think your WiFi is safe?
Rob Gillen
 
PPTX
Anatomy of a Buffer Overflow Attack
Rob Gillen
 
PPTX
Intro to GPGPU with CUDA (DevLink)
Rob Gillen
 
PPTX
AWS vs. Azure
Rob Gillen
 
PPTX
A Comparison of AWS and Azure - Part2
Rob Gillen
 
PPTX
A Comparison of AWS and Azure - Part 1
Rob Gillen
 
PPTX
Intro to GPGPU Programming with Cuda
Rob Gillen
 
PPTX
Scaling Document Clustering in the Cloud
Rob Gillen
 
PPTX
Hands On with Amazon Web Services (StirTrek)
Rob Gillen
 
PPTX
Amazon Web Services for the .NET Developer
Rob Gillen
 
PPT
05561 Xfer Research 02
Rob Gillen
 
PPT
05561 Xfer Research 01
Rob Gillen
 
CodeStock14: Hiding in Plain Sight
Rob Gillen
 
What's in a password
Rob Gillen
 
How well do you know your runtime
Rob Gillen
 
Software defined radio and the hacker
Rob Gillen
 
So whats in a password
Rob Gillen
 
Hiding in plain sight
Rob Gillen
 
ETCSS: Into the Mind of a Hacker
Rob Gillen
 
DevLink - WiFu: You think your wireless is secure?
Rob Gillen
 
You think your WiFi is safe?
Rob Gillen
 
Anatomy of a Buffer Overflow Attack
Rob Gillen
 
Intro to GPGPU with CUDA (DevLink)
Rob Gillen
 
AWS vs. Azure
Rob Gillen
 
A Comparison of AWS and Azure - Part2
Rob Gillen
 
A Comparison of AWS and Azure - Part 1
Rob Gillen
 
Intro to GPGPU Programming with Cuda
Rob Gillen
 
Scaling Document Clustering in the Cloud
Rob Gillen
 
Hands On with Amazon Web Services (StirTrek)
Rob Gillen
 
Amazon Web Services for the .NET Developer
Rob Gillen
 
05561 Xfer Research 02
Rob Gillen
 
05561 Xfer Research 01
Rob Gillen
 

Recently uploaded (20)

PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Precisely
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 

Azure: Lessons From The Field