SlideShare a Scribd company logo
PowerAI
Mandie Quartly Ph.D.
WW lead, Machine Learning & High Performance Analytics,
OpenPOWER ISV Ecosystem
@mandieq
Artificial
Intelligence &
Cognitive
Applications
Big
Data
Machine
Learning
Deep
Learning
(Neural Nets)
Cognitive landscape: terms and relationship
Core concepts: Training & Inference
Training
• Data intensive: historical
data sets
• Compute intensive: 100%
accelerated
• Develop a model for use
on the edge as inference
Inference
• Enables the computer to
act in real time
• Low Power
• Out at the edge
Deep Learning
Power AI introduction
Industry / Function AI Revolution
Automotive Assisted / Autonomous Driving, Navigation, Voice Interface
Defense Computer Vision for Drones, Surveillance, Satellites
Manufacturing and Robotics Computer Vision for Manufacturing, Data Streams for Instrumented Tools
Healthcare Diagnostic Image Recognition (CAT Scan/MRI), Predictive Diagnostics
Finance Fraud Detection, Risk Analytics, Improved UX (Natural Language Processing)
Energy Seismic Resource Analytics
Education Adaptive Learning, Real Time Translation
Media and Entertainment Augmented Reality, Intelligent Game Agents, Precision Recommendation
Supply Chain Dynamic Routing Optimization
Consumer / Mobile App Speech Recognition, Natural Language, Content Tagging and Recognition
Retail Computer Vision for Shopping Carts, Precision Recommendation
Pharma / Life Sciences Drug Discovery, Genome Analytics, Image Classification (molecules / whole systems)
Cross-industry benefits of PowerAI
• For Marketing executives:
Better understanding of client perspectives, recommendations, improve
interaction
• For IT Executives:
Enterprise class support, better price performance and a shared infrastructure
• For Data Scientists and Developers:
NVLink means new opportunities for innovation, faster access to larger
memory, execute larger training models and more iteration … refine, improve
• Research Scientists:
Explore new ways to understand existing datasets
Gartner Views…
9
Introducing PowerAI:
Get Started Fast with Deep Learning
Enabled by High Performance Computing Infrastructure
Package of Pre-Compiled
Major Deep Learning
Frameworks
Easy to install & get started
with Deep Learning with
Enterprise-Class Support
Optimised for performance
to take advantage of NVLink
Machine Learning / Deep Learning Software Stack
Healthcare
Diagnosis / Treatment
Banking
Fraud analysis
Pattern
Recognition
Self-driving Cars
Retail Customer
Service Agents
Vision / Image
processing
NLP
On-Prem & Cloud
Scale-out servers
Cloud Deployment
DL Frameworks
CAFFE, Torch,
TensorFlow, …
Industry
Solutions
MLDL Building
Blocks
Software Libraries
Frameworks, Services
Infrastructure
...
Knowledge
Representation
Data Layer Accelerated DBsNoSQL DBs Streams
Machine
Learning
Hadoop / HDFS
Accelerators
GPUs, FPGAs, CAPI Flash, DL
Accelerators
Unique Power Technologies
NVLink, CAPI
ML Libraries
Spark ML, Spark R,
SciKit, Numpy,
SystemML, Mahout
Cognitive Platforms
Watson
Cloud Services
Amazon, Microsoft,
Google, IBM Cloud
Distributed
Computing
Hadoop, Spark, MPI
...
Legacy DBs
Math Libraries: OpenBLAS/MASS, ATLAS, ESSL, LAPACK, cuDNN, cuBLAS, cuSparse, …
PowerAI
12
CrowdedSpace–WithLotsofStartupActivity
d
PowerAI
Partners
PowerAI platform
Deep Learning
Frameworks
Accelerated
Servers and
Infrastructure for
Scaling
Spectrum Scale:
High-Speed Parallel File
System
Scale to
Cloud
Cluster of NVLink Servers
Coming Soon
CAFFE NVCaffe TorchIBMCaffe
TensorFlow
OpenBLAS Bazel
Theano Chainer
Distributed
Frameworks
Supporting
libraries
DL4J
NCCL DIGITS
Who is using PowerAI?
PowerAI takes advantage of NVLink between
POWER8 & Tesla P100 to increase system
bandwidth
P100
GPU
POWER8
CPU
GPU
Memory
System
Memory
P100
GPU
80 GB/s
GPU
Memory
NVLink
115 GB/s
P100
GPU
POWER8
CPU
GPU
Memory
System
Memory
P100
GPU
80 GB/s
GPU
Memory
NVLink
115 GB/s
S822LC for HPC: recommended configuration for PowerAI
2 Socket, 4 GPU System with NVLink
Required:
• 2 POWER8 10 Core CPUs
• 4 NVIDIA P100 ”Pascal” GPUs
• 256 GB System Memory
• 2 SSD storage devices
• High-speed interconnect
(IB or Ethernet, depending on
infrastructure)
Optional:
• Up to 1 TB System Memory
• PCIe attached NVMe storage (aka Minsky)
TensorFlow on Tesla P100: PowerAI is 30% faster
(larger is better)
S822LC - Optimized E5-2640v4
0
50
100
150
200
250
300
Images Processed (Images/Sec)
IBM S822LC 20-cores 2.86GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs / Ubuntu 16.04 / CUDA
8.0.44 / cuDNN 5.1 / TensorFlow 0.12.0 / Inception v3 Benchmark (64 image minbatch)
Intel Broadwell E5-2640v4 20-core 2.6 GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs/ Ubuntu 16.04 /
CUDA 8.0.44 / cuDNN 5.1 / TensorFlow 0.12.0 / Inception v3 Benchmark (64 image minbatch)
Deep Learning on Minsky
x86 with 4x M40 / PCIe Power8 with 4x P100 / NVLink
0
20
40
60
80
100
120
140
AlexNet using Caffe
Time to top-1, 50% Accuracy
(Lower is better)
x86 with 8x M40 / PCIe Power8 with 4x P100 / NVLink
0:00
1:12
2:24
3:36
4:48
6:00
7:12
8:24
VGGNet using BVLC Caffe vs IBM Caffe
Time to Top-1 50% accuracy:
(Lower is better)
S822LC/HPC with 4 Tesla P100
Tesla GPUs is 24% Faster than
8x Tesla M40 GPUs
S822LC/HPC with 4 Tesla
P100 GPUs is 2.2x Faster
than 4x Tesla M40 GPUs
IBM S822LC 20-cores 2.86GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs / Ubuntu 16.04 / CUDA 8.0.44 / cuDNN 5.1 / IBM
Caffe 1.0.0-rc3 / Imagenet Data
Intel Broadwell E5-2640v4 20-core 2.6 GHz 512GB memory / 4 or 8 NVIDIA Tesla M40 GPUs / Ubuntu 16.04 / CUDA 8.0.44 /
cuDNN 5.1 / BVLC Caffe 1.0.0-rc3 / Imagenet Data
Trainingtimes(mins)
NVLink and P100 advantage:
reducing communication time, incorporating the fastest GPU for deep learning
• NVLink reduces communication time and overhead
• Data gets from GPU-GPU, Memory-GPU faster, for shorter training times
x86 based
GPU system
POWER8 +
Tesla
P100+NVLink
ImageNet / Alexnet: Minibatch size = 128
170 ms
78 ms
IBM advantage: data communication
and GPU performance
Getting started
http://ibm.biz/powerai
PowerAI Simplifies Access and Installation
• Tested, binary builds of common Deep Learning
frameworks for ease of implementation
• Simple, complete installation process
documented on ibm.biz/powerai
• Future focus on optimizing specific packages for
POWER: OpenBLAS, NVIDIA Caffe, TensorFlow,
and Torch
PowerAI
OS Ubuntu 16.04
CUDA 8.0
cuDNN 5.1
Built w/ MASS Yes
OpenBLAS 0.2.19
Caffe 1.0 rc3
NVIDIA Caffe
0.14.5 +
0.15.3
IBM Caffe 1.0 rc3
Chainer 1.18
NVIDIA DIGITS 5
Torch 7
Theano 0.8.2
TensorFlow 0.12.0
GPU 4 x P100
Base System S822LC/HPC
Easy Installation… Up and running...
Software repository Setup
The Deep Learning packages are published as an Ubuntu package that
sets up an installation repository on the local machine. The repository
can be enabled as follows:
Download the latest mldl-repo-local .deb file from
https://download.boulder.ibm.com/ibmdl/pub/software/server/mldl/
Install the repository package:
$ sudo dpkg -i mldl-repo-local*.deb
Update the package cache
$ sudo apt-get update
Installing all frameworks at once
All the Deep Learning frameworks can be installed at once using
the power-mldl meta-package:
$ sudo apt-get install power-mldl
Installing frameworks individually
The Deep Learning frameworks can be installed individually if preferred.
The framework packages are:
● caffe-bvlc - Berkeley Vision and Learning Center (BVLC) upstream
Caffe, v1.0.0rc3
● caffe-ibm - IBM Optimized version of BVLC Caffe, v1.0.0rc3
● caffe-nv - NVIDIA fork of Caffe, v0.15.13 and v0.14.5
● chainer - Chainer, v1.18.0
● digits - DIGITS, v5.0.0-rc.1
● tensorflow - Google TensorFlow, v0.12.0
● theano - Theano, v0.8.2
● torch - Torch, v7
Each can be installed with:
$ sudo apt-get install <framework>
Tuning recommendations
Recommended settings for optimal Deep Learning performance on the
S822LC for High Performance Computing are:
Enable Performance Governor
$ sudo apt-get install linux-tools-common
cpufrequtils
$ sudo cpupower -c all frequency-set -g
performance
Enable GPU persistence mode
Use nvidia-persistenced (
http://docs.nvidia.com/deploy/driver-persistence/index
.html
) or
$ sudo nvidia-smi -pm ENABLED
Set GPU memory and graphics clocks
$ sudo nvidia-smi -ac 715,1480
For TensorFlow, set the SMT mode
$ sudo ppc64_64 --smt=2
PowerAI in
NIMBIX Cloud
https://power.jarvice.com/
Get started today
● Things are moving VERY quickly in this space
● Where could you take advantage of AI capabilities?
● Find out more at: ibm.biz/powerai
Mandie Quartly
@mandieq
25
Copyright © 2016 by International Business Machines Corporation. All rights reserved.
No part of this document may be reproduced or transmitted in any form without written permission from IBM Corporation.
Product data has been reviewed for accuracy as of the date of initial publication. Product data is subject to change without notice. This document could include technical inaccuracies or
typographical errors. IBM may make improvements and/or changes in the product(s) and/or program(s) described herein at any time without notice. Any statements regarding IBM's future
direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only. References in this document to IBM products, programs, or services does
not imply that IBM intends to make such products, programs or services available in all countries in which IBM operates or does business. Any reference to an IBM Program Product in
this document is not intended to state or imply that only that program product may be used. Any functionally equivalent program, that does not infringe IBM's intellectually property rights,
may be used instead.
THE INFORMATION PROVIDED IN THIS DOCUMENT IS DISTRIBUTED "AS IS" WITHOUT ANY WARRANTY, EITHER OR IMPLIED. IBM LY DISCLAIMS ANY WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR NONINFRINGEMENT. IBM shall have no responsibility to update this information. IBM products are warranted, if at
all, according to the terms and conditions of the agreements (e.g., IBM Customer Agreement, Statement of Limited Warranty, International Program License Agreement, etc.) under which
they are provided. Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements or other publicly available sources. IBM
has not tested those products in connection with this publication and cannot confirm the accuracy of performance, compatibility or any other claims related to non-IBM products. IBM
makes no representations or warranties, ed or implied, regarding non-IBM products and services.
The provision of the information contained herein is not intended to, and does not, grant any right or license under any IBM patents or copyrights. Inquiries regarding patent or copyright
licenses should be made, in writing, to:
IBM Director of Licensing
IBM Corporation
North Castle Drive
Armonk, NY 1 0504- 785
U.S.A.
Legal notices
|
2
26
Information and trademarks
IBM, the IBM logo, ibm.com, IBM System Storage, IBM Spectrum Storage, IBM Spectrum Control, IBM Spectrum Protect, IBM Spectrum Archive, IBM Spectrum Virtualize, IBM Spectrum Scale, IBM Spectrum Accelerate, Softlayer, and XIV are
trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at http://www.ibm.com/legal/copytrade.shtml
The following are trademarks or registered trademarks of other companies.
Adobe, the Adobe logo, PostScript, and the PostScript logo are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States, and/or other countries.
IT Infrastructure Library is a Registered Trade Mark of AXELOS Limited.
Linear Tape-Open, LTO, the LTO Logo, Ultrium, and the Ultrium logo are trademarks of HP, IBM Corp. and Quantum in the U.S. and other countries.
Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other
countries.
Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.
Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both.
Java and all Java-based trademarks and logos are trademarks or registered trademarks of Oracle and/or its affiliates.
Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc. in the United States, other countries, or both and is used under license therefrom.
ITIL is a Registered Trade Mark of AXELOS Limited.
UNIX is a registered trademark of The Open Group in the United States and other countries.
* All other products may be trademarks or registered trademarks of their respective companies.
Notes:
Performance is in Internal Throughput Rate (ITR) ratio based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput that any user will experience will vary depending upon considerations
such as the amount of multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve throughput improvements
equivalent to the performance ratios stated here.
All customer examples cited or described in this presentation are presented as illustrations of the manner in which some customers have used IBM products and the results they may have achieved. Actual environmental costs and performance
characteristics will vary depending on individual customer configurations and conditions.
This publication was produced in the United States. IBM may not offer the products, services or features discussed in this document in other countries, and the information may be subject to change without notice. Consult your local IBM business
contact for information on the product or services available in your area.
All statements regarding IBM's future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.
Information about non-IBM products is obtained from the manufacturers of those products or their published announcements. IBM has not tested those products and cannot confirm the performance, compatibility, or any other claims related to non-IBM
products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products.
Prices subject to change without notice. Contact your IBM representative or Business Partner for the most current pricing in your geography.
This presentation and the claims outlined in it were reviewed for compliance with US law. Adaptations of these claims for use in other geographies must be reviewed
by the local country counsel for compliance with local laws.
|
2
27
Special notices
This document was developed for IBM offerings in the United States as of the date of publication. IBM may not make these offerings available in other countries, and the information is
subject to change without notice. Consult your local IBM business contact for information on the IBM offerings available in your area.
Information in this document concerning non-IBM products was obtained from the suppliers of these products or other public sources. Questions on the capabilities of non-IBM products
should be addressed to the suppliers of those products.
IBM may have patents or pending patent applications covering subject matter in this document. The furnishing of this document does not give you any license to these patents. Send
license inquires, in writing, to IBM Director of Licensing, IBM Corporation, New Castle Drive, Armonk, NY 10504-1785 USA.
All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.
The information contained in this document has not been submitted to any formal IBM test and is provided "AS IS" with no warranties or guarantees either expressed or implied.
All examples cited or described in this document are presented as illustrations of the manner in which some IBM products can be used and the results that may be achieved. Actual
environmental costs and performance characteristics will vary depending on individual client configurations and conditions.
IBM Global Financing offerings are provided through IBM Credit Corporation in the United States and other IBM subsidiaries and divisions worldwide to qualified commercial and
government clients. Rates are based on a client's credit rating, financing terms, offering type, equipment type and options, and may vary by country. Other restrictions may apply. Rates
and offerings are subject to change, extension or withdrawal without notice.
IBM is not responsible for printing errors in this document that result in pricing or information inaccuracies.
All prices shown are IBM's United States suggested list prices and are subject to change without notice; reseller prices may vary.
IBM hardware products are manufactured from new parts, or new and serviceable used parts. Regardless, our warranty terms apply.
Any performance data contained in this document was determined in a controlled environment. Actual results may vary significantly and are dependent on many factors including system
hardware configuration and software design and configuration. Some measurements quoted in this document may have been made on development-level systems. There is no guarantee
these measurements will be the same on generally-available systems. Some measurements quoted in this document may have been estimated through extrapolation. Users of this
document should verify the applicable data for their specific environment.
|
2

More Related Content

PDF
Gschwind, PowerAI: A Co-Optimized Software Stack for AI on Power
Michael Gschwind
 
PDF
Intel's Machine Learning Strategy
inside-BigData.com
 
PPTX
Deep Learning
Büşra İçöz
 
PDF
Ibm watson - who what why
Rick Bouter
 
PPTX
Innovation with ai at scale on the edge vt sept 2019 v0
Ganesan Narayanasamy
 
PDF
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
Willy Marroquin (WillyDevNET)
 
PDF
Near Data Computing Architectures: Opportunities and Challenges for Apache Spark
Ahsan Javed Awan
 
PDF
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
mlaij
 
Gschwind, PowerAI: A Co-Optimized Software Stack for AI on Power
Michael Gschwind
 
Intel's Machine Learning Strategy
inside-BigData.com
 
Deep Learning
Büşra İçöz
 
Ibm watson - who what why
Rick Bouter
 
Innovation with ai at scale on the edge vt sept 2019 v0
Ganesan Narayanasamy
 
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
Willy Marroquin (WillyDevNET)
 
Near Data Computing Architectures: Opportunities and Challenges for Apache Spark
Ahsan Javed Awan
 
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
mlaij
 

Similar to Power AI introduction (20)

PPTX
Introduction to PowerAI - The Enterprise AI Platform
Indrajit Poddar
 
PDF
Transparent Hardware Acceleration for Deep Learning
Indrajit Poddar
 
PDF
BSC LMS DDL
Ganesan Narayanasamy
 
PPT
Enabling a hardware accelerated deep learning data science experience for Apa...
DataWorks Summit
 
PPTX
OpenPOWER and IBM AI overview
Ganesan Narayanasamy
 
PPTX
Introduction to Machine Learning on IBM Power Systems
David Spurway
 
PPTX
PowerAI Deep dive
Ganesan Narayanasamy
 
PDF
AI/Cloud Technology access
Ganesan Narayanasamy
 
PPT
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
Indrajit Poddar
 
PDF
IBM Cloud Paris Meetup - 20190520 - IA & Power
IBM France Lab
 
PPTX
IBM Power Systems Update 2Q17
David Spurway
 
PDF
IBM Cloud Paris 20180517 - La solution Power AI
IBM France Lab
 
PDF
Enabling a hardware accelerated deep learning data science experience for Apa...
Indrajit Poddar
 
PPTX
2018 bsc power9 and power ai
Ganesan Narayanasamy
 
PPTX
PowerAI Deep Dive ( key points )
Paulo Sergio Lemes Queiroz
 
PPTX
Ac922 watson 180208 v1
IBM Sverige
 
PPTX
Innovations using PowerAI
Ganesan Narayanasamy
 
PDF
OpenPOWER/POWER9 Webinar from MIT and IBM
Ganesan Narayanasamy
 
PPTX
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Indrajit Poddar
 
PPTX
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
DataWorks Summit/Hadoop Summit
 
Introduction to PowerAI - The Enterprise AI Platform
Indrajit Poddar
 
Transparent Hardware Acceleration for Deep Learning
Indrajit Poddar
 
BSC LMS DDL
Ganesan Narayanasamy
 
Enabling a hardware accelerated deep learning data science experience for Apa...
DataWorks Summit
 
OpenPOWER and IBM AI overview
Ganesan Narayanasamy
 
Introduction to Machine Learning on IBM Power Systems
David Spurway
 
PowerAI Deep dive
Ganesan Narayanasamy
 
AI/Cloud Technology access
Ganesan Narayanasamy
 
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
Indrajit Poddar
 
IBM Cloud Paris Meetup - 20190520 - IA & Power
IBM France Lab
 
IBM Power Systems Update 2Q17
David Spurway
 
IBM Cloud Paris 20180517 - La solution Power AI
IBM France Lab
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Indrajit Poddar
 
2018 bsc power9 and power ai
Ganesan Narayanasamy
 
PowerAI Deep Dive ( key points )
Paulo Sergio Lemes Queiroz
 
Ac922 watson 180208 v1
IBM Sverige
 
Innovations using PowerAI
Ganesan Narayanasamy
 
OpenPOWER/POWER9 Webinar from MIT and IBM
Ganesan Narayanasamy
 
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Indrajit Poddar
 
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
DataWorks Summit/Hadoop Summit
 
Ad

Recently uploaded (20)

PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
PDF
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
PDF
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
Architecture of the Future (09152021)
EdwardMeyman
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 
Advances in Ultra High Voltage (UHV) Transmission and Distribution Systems.pdf
Nabajyoti Banik
 
Presentation about Hardware and Software in Computer
snehamodhawadiya
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Trying to figure out MCP by actually building an app from scratch with open s...
Julien SIMON
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Architecture of the Future (09152021)
EdwardMeyman
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
The-Ethical-Hackers-Imperative-Safeguarding-the-Digital-Frontier.pptx
sujalchauhan1305
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Ad

Power AI introduction

  • 1. PowerAI Mandie Quartly Ph.D. WW lead, Machine Learning & High Performance Analytics, OpenPOWER ISV Ecosystem @mandieq
  • 3. Core concepts: Training & Inference Training • Data intensive: historical data sets • Compute intensive: 100% accelerated • Develop a model for use on the edge as inference Inference • Enables the computer to act in real time • Low Power • Out at the edge
  • 6. Industry / Function AI Revolution Automotive Assisted / Autonomous Driving, Navigation, Voice Interface Defense Computer Vision for Drones, Surveillance, Satellites Manufacturing and Robotics Computer Vision for Manufacturing, Data Streams for Instrumented Tools Healthcare Diagnostic Image Recognition (CAT Scan/MRI), Predictive Diagnostics Finance Fraud Detection, Risk Analytics, Improved UX (Natural Language Processing) Energy Seismic Resource Analytics Education Adaptive Learning, Real Time Translation Media and Entertainment Augmented Reality, Intelligent Game Agents, Precision Recommendation Supply Chain Dynamic Routing Optimization Consumer / Mobile App Speech Recognition, Natural Language, Content Tagging and Recognition Retail Computer Vision for Shopping Carts, Precision Recommendation Pharma / Life Sciences Drug Discovery, Genome Analytics, Image Classification (molecules / whole systems)
  • 7. Cross-industry benefits of PowerAI • For Marketing executives: Better understanding of client perspectives, recommendations, improve interaction • For IT Executives: Enterprise class support, better price performance and a shared infrastructure • For Data Scientists and Developers: NVLink means new opportunities for innovation, faster access to larger memory, execute larger training models and more iteration … refine, improve • Research Scientists: Explore new ways to understand existing datasets
  • 9. 9
  • 10. Introducing PowerAI: Get Started Fast with Deep Learning Enabled by High Performance Computing Infrastructure Package of Pre-Compiled Major Deep Learning Frameworks Easy to install & get started with Deep Learning with Enterprise-Class Support Optimised for performance to take advantage of NVLink
  • 11. Machine Learning / Deep Learning Software Stack Healthcare Diagnosis / Treatment Banking Fraud analysis Pattern Recognition Self-driving Cars Retail Customer Service Agents Vision / Image processing NLP On-Prem & Cloud Scale-out servers Cloud Deployment DL Frameworks CAFFE, Torch, TensorFlow, … Industry Solutions MLDL Building Blocks Software Libraries Frameworks, Services Infrastructure ... Knowledge Representation Data Layer Accelerated DBsNoSQL DBs Streams Machine Learning Hadoop / HDFS Accelerators GPUs, FPGAs, CAPI Flash, DL Accelerators Unique Power Technologies NVLink, CAPI ML Libraries Spark ML, Spark R, SciKit, Numpy, SystemML, Mahout Cognitive Platforms Watson Cloud Services Amazon, Microsoft, Google, IBM Cloud Distributed Computing Hadoop, Spark, MPI ... Legacy DBs Math Libraries: OpenBLAS/MASS, ATLAS, ESSL, LAPACK, cuDNN, cuBLAS, cuSparse, … PowerAI
  • 13. PowerAI platform Deep Learning Frameworks Accelerated Servers and Infrastructure for Scaling Spectrum Scale: High-Speed Parallel File System Scale to Cloud Cluster of NVLink Servers Coming Soon CAFFE NVCaffe TorchIBMCaffe TensorFlow OpenBLAS Bazel Theano Chainer Distributed Frameworks Supporting libraries DL4J NCCL DIGITS
  • 14. Who is using PowerAI?
  • 15. PowerAI takes advantage of NVLink between POWER8 & Tesla P100 to increase system bandwidth P100 GPU POWER8 CPU GPU Memory System Memory P100 GPU 80 GB/s GPU Memory NVLink 115 GB/s P100 GPU POWER8 CPU GPU Memory System Memory P100 GPU 80 GB/s GPU Memory NVLink 115 GB/s
  • 16. S822LC for HPC: recommended configuration for PowerAI 2 Socket, 4 GPU System with NVLink Required: • 2 POWER8 10 Core CPUs • 4 NVIDIA P100 ”Pascal” GPUs • 256 GB System Memory • 2 SSD storage devices • High-speed interconnect (IB or Ethernet, depending on infrastructure) Optional: • Up to 1 TB System Memory • PCIe attached NVMe storage (aka Minsky)
  • 17. TensorFlow on Tesla P100: PowerAI is 30% faster (larger is better) S822LC - Optimized E5-2640v4 0 50 100 150 200 250 300 Images Processed (Images/Sec) IBM S822LC 20-cores 2.86GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs / Ubuntu 16.04 / CUDA 8.0.44 / cuDNN 5.1 / TensorFlow 0.12.0 / Inception v3 Benchmark (64 image minbatch) Intel Broadwell E5-2640v4 20-core 2.6 GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs/ Ubuntu 16.04 / CUDA 8.0.44 / cuDNN 5.1 / TensorFlow 0.12.0 / Inception v3 Benchmark (64 image minbatch)
  • 18. Deep Learning on Minsky x86 with 4x M40 / PCIe Power8 with 4x P100 / NVLink 0 20 40 60 80 100 120 140 AlexNet using Caffe Time to top-1, 50% Accuracy (Lower is better) x86 with 8x M40 / PCIe Power8 with 4x P100 / NVLink 0:00 1:12 2:24 3:36 4:48 6:00 7:12 8:24 VGGNet using BVLC Caffe vs IBM Caffe Time to Top-1 50% accuracy: (Lower is better) S822LC/HPC with 4 Tesla P100 Tesla GPUs is 24% Faster than 8x Tesla M40 GPUs S822LC/HPC with 4 Tesla P100 GPUs is 2.2x Faster than 4x Tesla M40 GPUs IBM S822LC 20-cores 2.86GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs / Ubuntu 16.04 / CUDA 8.0.44 / cuDNN 5.1 / IBM Caffe 1.0.0-rc3 / Imagenet Data Intel Broadwell E5-2640v4 20-core 2.6 GHz 512GB memory / 4 or 8 NVIDIA Tesla M40 GPUs / Ubuntu 16.04 / CUDA 8.0.44 / cuDNN 5.1 / BVLC Caffe 1.0.0-rc3 / Imagenet Data Trainingtimes(mins)
  • 19. NVLink and P100 advantage: reducing communication time, incorporating the fastest GPU for deep learning • NVLink reduces communication time and overhead • Data gets from GPU-GPU, Memory-GPU faster, for shorter training times x86 based GPU system POWER8 + Tesla P100+NVLink ImageNet / Alexnet: Minibatch size = 128 170 ms 78 ms IBM advantage: data communication and GPU performance
  • 21. PowerAI Simplifies Access and Installation • Tested, binary builds of common Deep Learning frameworks for ease of implementation • Simple, complete installation process documented on ibm.biz/powerai • Future focus on optimizing specific packages for POWER: OpenBLAS, NVIDIA Caffe, TensorFlow, and Torch PowerAI OS Ubuntu 16.04 CUDA 8.0 cuDNN 5.1 Built w/ MASS Yes OpenBLAS 0.2.19 Caffe 1.0 rc3 NVIDIA Caffe 0.14.5 + 0.15.3 IBM Caffe 1.0 rc3 Chainer 1.18 NVIDIA DIGITS 5 Torch 7 Theano 0.8.2 TensorFlow 0.12.0 GPU 4 x P100 Base System S822LC/HPC
  • 22. Easy Installation… Up and running... Software repository Setup The Deep Learning packages are published as an Ubuntu package that sets up an installation repository on the local machine. The repository can be enabled as follows: Download the latest mldl-repo-local .deb file from https://download.boulder.ibm.com/ibmdl/pub/software/server/mldl/ Install the repository package: $ sudo dpkg -i mldl-repo-local*.deb Update the package cache $ sudo apt-get update Installing all frameworks at once All the Deep Learning frameworks can be installed at once using the power-mldl meta-package: $ sudo apt-get install power-mldl Installing frameworks individually The Deep Learning frameworks can be installed individually if preferred. The framework packages are: ● caffe-bvlc - Berkeley Vision and Learning Center (BVLC) upstream Caffe, v1.0.0rc3 ● caffe-ibm - IBM Optimized version of BVLC Caffe, v1.0.0rc3 ● caffe-nv - NVIDIA fork of Caffe, v0.15.13 and v0.14.5 ● chainer - Chainer, v1.18.0 ● digits - DIGITS, v5.0.0-rc.1 ● tensorflow - Google TensorFlow, v0.12.0 ● theano - Theano, v0.8.2 ● torch - Torch, v7 Each can be installed with: $ sudo apt-get install <framework> Tuning recommendations Recommended settings for optimal Deep Learning performance on the S822LC for High Performance Computing are: Enable Performance Governor $ sudo apt-get install linux-tools-common cpufrequtils $ sudo cpupower -c all frequency-set -g performance Enable GPU persistence mode Use nvidia-persistenced ( http://docs.nvidia.com/deploy/driver-persistence/index .html ) or $ sudo nvidia-smi -pm ENABLED Set GPU memory and graphics clocks $ sudo nvidia-smi -ac 715,1480 For TensorFlow, set the SMT mode $ sudo ppc64_64 --smt=2
  • 24. Get started today ● Things are moving VERY quickly in this space ● Where could you take advantage of AI capabilities? ● Find out more at: ibm.biz/powerai Mandie Quartly @mandieq
  • 25. 25 Copyright © 2016 by International Business Machines Corporation. All rights reserved. No part of this document may be reproduced or transmitted in any form without written permission from IBM Corporation. Product data has been reviewed for accuracy as of the date of initial publication. Product data is subject to change without notice. This document could include technical inaccuracies or typographical errors. IBM may make improvements and/or changes in the product(s) and/or program(s) described herein at any time without notice. Any statements regarding IBM's future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only. References in this document to IBM products, programs, or services does not imply that IBM intends to make such products, programs or services available in all countries in which IBM operates or does business. Any reference to an IBM Program Product in this document is not intended to state or imply that only that program product may be used. Any functionally equivalent program, that does not infringe IBM's intellectually property rights, may be used instead. THE INFORMATION PROVIDED IN THIS DOCUMENT IS DISTRIBUTED "AS IS" WITHOUT ANY WARRANTY, EITHER OR IMPLIED. IBM LY DISCLAIMS ANY WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR NONINFRINGEMENT. IBM shall have no responsibility to update this information. IBM products are warranted, if at all, according to the terms and conditions of the agreements (e.g., IBM Customer Agreement, Statement of Limited Warranty, International Program License Agreement, etc.) under which they are provided. Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements or other publicly available sources. IBM has not tested those products in connection with this publication and cannot confirm the accuracy of performance, compatibility or any other claims related to non-IBM products. IBM makes no representations or warranties, ed or implied, regarding non-IBM products and services. The provision of the information contained herein is not intended to, and does not, grant any right or license under any IBM patents or copyrights. Inquiries regarding patent or copyright licenses should be made, in writing, to: IBM Director of Licensing IBM Corporation North Castle Drive Armonk, NY 1 0504- 785 U.S.A. Legal notices | 2
  • 26. 26 Information and trademarks IBM, the IBM logo, ibm.com, IBM System Storage, IBM Spectrum Storage, IBM Spectrum Control, IBM Spectrum Protect, IBM Spectrum Archive, IBM Spectrum Virtualize, IBM Spectrum Scale, IBM Spectrum Accelerate, Softlayer, and XIV are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at http://www.ibm.com/legal/copytrade.shtml The following are trademarks or registered trademarks of other companies. Adobe, the Adobe logo, PostScript, and the PostScript logo are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States, and/or other countries. IT Infrastructure Library is a Registered Trade Mark of AXELOS Limited. Linear Tape-Open, LTO, the LTO Logo, Ultrium, and the Ultrium logo are trademarks of HP, IBM Corp. and Quantum in the U.S. and other countries. Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both. Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both. Java and all Java-based trademarks and logos are trademarks or registered trademarks of Oracle and/or its affiliates. Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc. in the United States, other countries, or both and is used under license therefrom. ITIL is a Registered Trade Mark of AXELOS Limited. UNIX is a registered trademark of The Open Group in the United States and other countries. * All other products may be trademarks or registered trademarks of their respective companies. Notes: Performance is in Internal Throughput Rate (ITR) ratio based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput that any user will experience will vary depending upon considerations such as the amount of multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve throughput improvements equivalent to the performance ratios stated here. All customer examples cited or described in this presentation are presented as illustrations of the manner in which some customers have used IBM products and the results they may have achieved. Actual environmental costs and performance characteristics will vary depending on individual customer configurations and conditions. This publication was produced in the United States. IBM may not offer the products, services or features discussed in this document in other countries, and the information may be subject to change without notice. Consult your local IBM business contact for information on the product or services available in your area. All statements regarding IBM's future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only. Information about non-IBM products is obtained from the manufacturers of those products or their published announcements. IBM has not tested those products and cannot confirm the performance, compatibility, or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. Prices subject to change without notice. Contact your IBM representative or Business Partner for the most current pricing in your geography. This presentation and the claims outlined in it were reviewed for compliance with US law. Adaptations of these claims for use in other geographies must be reviewed by the local country counsel for compliance with local laws. | 2
  • 27. 27 Special notices This document was developed for IBM offerings in the United States as of the date of publication. IBM may not make these offerings available in other countries, and the information is subject to change without notice. Consult your local IBM business contact for information on the IBM offerings available in your area. Information in this document concerning non-IBM products was obtained from the suppliers of these products or other public sources. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. IBM may have patents or pending patent applications covering subject matter in this document. The furnishing of this document does not give you any license to these patents. Send license inquires, in writing, to IBM Director of Licensing, IBM Corporation, New Castle Drive, Armonk, NY 10504-1785 USA. All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only. The information contained in this document has not been submitted to any formal IBM test and is provided "AS IS" with no warranties or guarantees either expressed or implied. All examples cited or described in this document are presented as illustrations of the manner in which some IBM products can be used and the results that may be achieved. Actual environmental costs and performance characteristics will vary depending on individual client configurations and conditions. IBM Global Financing offerings are provided through IBM Credit Corporation in the United States and other IBM subsidiaries and divisions worldwide to qualified commercial and government clients. Rates are based on a client's credit rating, financing terms, offering type, equipment type and options, and may vary by country. Other restrictions may apply. Rates and offerings are subject to change, extension or withdrawal without notice. IBM is not responsible for printing errors in this document that result in pricing or information inaccuracies. All prices shown are IBM's United States suggested list prices and are subject to change without notice; reseller prices may vary. IBM hardware products are manufactured from new parts, or new and serviceable used parts. Regardless, our warranty terms apply. Any performance data contained in this document was determined in a controlled environment. Actual results may vary significantly and are dependent on many factors including system hardware configuration and software design and configuration. Some measurements quoted in this document may have been made on development-level systems. There is no guarantee these measurements will be the same on generally-available systems. Some measurements quoted in this document may have been estimated through extrapolation. Users of this document should verify the applicable data for their specific environment. | 2