SlideShare a Scribd company logo
INTEL® HPCDEVELOPERCONFERENCE
FUELYOURINSIGHT
INTEL® HPCDEVELOPERCONFERENCE
FUELYOURINSIGHT
VISUALIZATIONTRACKWELCOMEANDSDVISUPDATE
Intel HPC Developer Conference
November 2016
Outline
§  The Pendulum of Computing
§  State of The Union Software Defined Visualization(SDVis)
–  Quick Refresher – What is SDVis? Why?
–  We are we today? (Hint: Launched and Active Integrations)
§  Visualization @ HPCDC and SC’16 Overview
§  Summary
3
PENDULUM
The Challenge with ‘Traditional’ Large Scale Visualization
HPC cluster performs
modeling and simulations
Dedicated Visualization
HW (GPUs) and SW
Client devices view
the final images
HW Visualization
Using Dedicated hardware and specialized software
Bottlenecks…
I/O, Scheduling, Memory Size, Power,…
Circa 2010 - 2015
6
The Pendulum of Computing
General Purpose
Software + Integration
Special Purpose
Fixed Function
Hybrid
7
The Pendulum of Computing
Example: Graphics/Visualization – 1970’s->80’s
Special Purpose
Fixed Function
Vector Displays
Tektronics
Silicon Graphics
Gfx Workstations
10’s-100’s of Dev Engs
Digital Graphics starting to emerge but in the realm of “specialty” uses
General Purpose
Software + Integration
Hybrid
8
The Pendulum of Computing
Example: Graphics/Visualization – ~1980’s-90’s
General Purpose
Software + Integration
Special Purpose
Fixed Function
Hybrid
Vector Displays
Tektronics
Silicon Graphics
Gfx Workstations
100’s of SW Engs
VGA/SVGA ”Frame Buffers”.
Personal Computers
DOS-Apps / Games
1000’s SW Engs
Limited Standards
High SW Innovation
Inflection Point: PCs Democratize Computing and Development
9
The Pendulum of Computing
Example: Graphics/Visualization – ~1990-’95
General Purpose
Software + Integration
Special Purpose
Fixed Function
Hybrid
Vector Displays
Tektronics
Silicon Graphics
Gfx Workstations
100’s of SW Engs
HW Cursor
Windowing GUI
2D FB HW Copy
Rect Fill; Line Draw
Device Drivers
100’s -1000’s SW Engs
Inflection Point: GUI Systems + Standards Emerge - Algo’s Mature
VGA/SVGA F.B.
Personal Computers
DOS-Apps / Games
1000’s SW Engs
Limited Standards
High SW Innovation
10
The Pendulum of Computing
Example: Graphics/Visualization – ~1995->2000
General Purpose
Software + Integration
Special Purpose
Fixed Function
Hybrid
Video / Audio Compression
Graphics/Video Integrated
3D Fixed Function Pipelines
”High Speed Buses – PCI”
100’s SW Engs
Limited Software Innovation
HW Cursor
Windowing GUI
2D FB HW Copy
Rect Fill; Line Draw
Inflection Point: Windows 95, DirectX/OpenGL, 3D Gaming
VGA/SVGA F.B.
Personal Computers
DOS-Apps / Games
1000’s SW Engs
Limited Standards
High SW Innovation
11
The Pendulum of Computing
Example: Graphics/Visualization – ~2000-‘14
General Purpose
Software + Integration
Special Purpose
Fixed Function
Hybrid
Video / Audio Compression
Graphics/Video Integrated
3D Fixed Function Pipelines
”High Speed Buses – PCI”
Shaders; Limited Compute
Advanced 3D Texture Modes
Integrated M.B. Graphics
High Speed RAM (GDDR)
3D Scientific Vis
CUDA, Multicore CPU’s
1000’s SW Engs
Inflection Point: 3D Gaming “Realism” drives programmable shaders, improved flexibility
HPC Compute Adjacency + CUDA + Multicore -> Parallel Computing Emerges
VGA/SVGA F.B.
Personal Computers
DOS-Apps / Games
1000’s SW Engs
Limited Standards
High SW Innovation
12
The Pendulum of Computing
Example: Graphics/Visualization – 2014 -> ?
General Purpose
Software + Integration
Special Purpose
Fixed Function
Hybrid
Video / Audio Compression
Graphics/Video Integrated
3D Fixed Function Pipelines
”High Speed Buses – PCI”
Shaders; Limited Compute
Advanced 3D Texture Modes
Integrated M.B. Graphics
High Speed RAM (GDDR)
3D Scientific Vis
CUDA, Multicore CPU’s
1000’s SW Engs
A ”New” Inflection Point?: Big Data, I/O Bottlenecks, Power, TCO challenges
8+ cores, SIMD CPUs, On-Package High Bandwidth Memory
Multicore and Many-Core CPU’s
Integrated CPU Graphics
HBW Memory and Caches
Big Data
High SW innovation (perf+fidelity)
SciVis analysis growth
1000’s - 10,000’s+
“
13
So what does this mean for Visual Analysis
and Workflows?
Example: EPFL Blue Brain Project
“Brayns” + OSPRay
Brain Neuron Growth 3D Visualization
Web-Based GUI Large Display or Display Wall Output
Real-time Simulation of Brain Neuron Formation and Electrical Impulses
~70 GB of Input Data @ 15-20fps; Shown using 4 Knights Landing Processors @ ISC’16 Intel Booth
IMPOSSIBLE FOR A TETHERED GPU WITH LIMITED MEMORY AND Bandwidth!
Example:GR-Chombo Black Hole Collision
Stephen Hawking CTC
LIGO Black Hole Collision simulation and resulting Gravitational Waves
1200 Timesteps over 0.2s – Collide @ ‘Speed of Light’
3.6 TB Postprocessed (from 36 TB) Dataset
IMPOSSIBLE FOR A TETHERED GPU WITH LIMITED MEMORY AND Bandwidth!
STATEOFSOFTWAREDEFINEDVISUALIZATION-
2016
17
Refresher – Why SDVis and What is it?
Our Vision: Scalable, Flexible Vis Rendering that Runs
Anywhere!
Standalone Laptops or
Workstations
Big Memory Nodes or Rendering
Focused Clusters
Large Compute+Vis clusters with
Local or Remote Clients
Cloud or
Network
How?
Intel® SSF and Intel-Supported Software Defined Visualization (SDVis)!
Embree
-  CPU Optimized Ray Tracing Algorithms
-  ‘Tool kit’ for Building Ray Tracings Apps
-  Broadly Adopted by 3rd Party ISVs
-  More at http://embree.github.io
OSPRay
-  Rendering Engine Based on Embree
-  API Designed to Ease Creation of
Visualization Software
-  More at http://ospray.org
OpenSWR
-  High Performance CPU Vis Rasterization
-  Fully Integrated into MESA v12.0+
-  Supports ParaView, Visit, VTK, EnSight,
VL3
-  More at http://mesa3d.org
Standard OpenGL Image
Image Rendered by OSPRay
HW
Visualization
Software Defined
Visualization
Addressing Large-scale, High Performance, and
High Fidelity Visualization with SDVis
Gain deeper understanding of data impacting science & discovery
High fidelity, more realistic images even as data sets become increasingly larger,
and more complex; no need to compromise data resolution
Solve computing + modeling problem together (in-situ vis)
Essential SW development suite that makes concurrent simulation and
visualization efficient – users can work interactively and get results quicker
One system
Use same system for both simulation and visualization, avoid data transfer
delays and memory size constraints – faster insights for solving toughest problems
Your Work. No Compromises.
Benefits of SDVis
•  Open-sourced technology delivering vivid
visualization of complex, enormous data sets
•  Innovative software libraries for visualizing
results with high performance by unlocking
the parallelism already in your system
•  High-fidelity images for gaining deeper
insights in science and industry, faster
•  Software Only solution lowers costs – no card
cost, no card maintenance, lower power bills
Magnetic Reconnection Model, Courtesy Bill
Duaghton(LANL) and Berc Geveci(Kitware)
Gravational Waves : GR-Chombo AMR Data, Stephen Hawking CTC, UCambridge;
Queens College, London; visualization, Carson Brownlee, Intel, ParaView)
Ribosome: Data: Max-Planck
Institute for Biophysical Chemistry
Hi-Fidelity Visualization
with…
§  … scalable image quality
§  … scalable model size
§  … scalable in rendering cost
Data set provided by Florida International University
OpenSWR Software Rasterizer
(www.mesa3d.org
www.openswr.org)
•  High performance open source
software implementation of OpenGL*
rasterizer
‑  Fully multi-threaded and vectorized for Intel® processors
‑  Can access full system memory - highest resolution data
‑  Leverages community development effort (MESA)
•  Drop in replacement for OpenGL
library
•  Available since July’16 in Mesa v12.0+
targeting features and performance for
leading ScIVis Apps
VL3
Ray Tracing Foundation:
Embree Ray Tracing Kernel Library
24
Provides highly optimized and scalable ray tracing kernels
§  Acceleration structure build and ray traversal
§  Single Ray, Ray Packets(4,8,16), Ray Streams(N)
Targets up to photorealistic professional and scientific rendering applications
Highest ray tracing performance on CPUs
§  1.5–6× speedup reported by users
Support for latest CPUs / ISAs
§  Intel® Xeon Phi™ Processor (codenamed Knights Landing) – AVX-512
API for easy integration into applications
Free and open source under Apache 2.0 license
§  http://embree.github.com
24
Performance: Embree vs. NVIDIA* OptiX* (Pascal)
0
5
10
15
20
25
30
35
40
45
50
Bentley
(2.3M Tris)
Crown
(4.8M Tris)
Dragon
(7.4M Tris)
Karst Fluid Flow
(8.4M Tris)
Power Plant
(12.8M Tris)
Intel® Xeon® Processor
E5-2699 v4
2 x 22 cores, 2.2 GHz
Intel® Xeon Phi™ Processor
7250
68 cores, 1.4 GHz
NVIDIA TITAN X (Pascal)
Coprocessor
12 GB RAM
Frames Per Second (Higher is Better), 1024x1024 image resolution
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark* and MobileMark*, are measured using specific
computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you
in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance.
Embree 2.12.0, ICC 2016 Update 1, Intel®
SPMD
Program Compiler
(Intel® ISPC) 1.9.1
NVIDIA* OptiX* 4.0.1, CUDA* 8.0.44
Source: Intel
26
Intel
Path Tracer Renderer Sample Application: Path Tracer Renderer using Embree and OptiX* RT Libraries
Description: Path tracing app for use in benchmarking RT libraries
Availability:
§  Code: embree.github.io
§  Recipe: embree.github.io.
Usage Model:
§  TBB, ISPC and Intrinsics
Highlights:
§  The code represents a typical ray tracing rendering pipeline used throughout DCC to
show comparative performance on different types of hardware with a variety of
input 3D data models. It has been optimized for all Intel ISA;s and delivered to the
community and available for download from embree.github.io
§  End-User benefits: Ability to achieve competitive performance and the flexibility of
IA for rendering and render farm applications
Results:
§  Embree on dual socket (44 cores total) Intel® Xeon® E5-2699 v4 Processor performs
20% - 30% faster than Intel® Xeon Phi™ 7250 Processor
–  Path tracing causes low SIMD utilization (better for Xeon)
•  Embree on dual socket (44 cores total) Intel® Xeon® E5-2699 v4 Processor more
than 2x faster than OptiX on NVIDIA TITAN X (Pascal) GPU.
–  Path tracing causes low SIMD utilization (better for Xeon)
–  Large path tracing kernel requires many registers (thus fewer threads
executed on GPU)
*	
  Other	
  names	
  and	
  brands	
  may	
  be	
  claimed	
  as	
  the	
  property	
  of	
  others.	
  
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are
measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other
information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. See
benchmark tests and configurations in the speaker notes. For more information go to http://www.intel.com/performance
0
5
10
15
20
25
30
35
40
45
50
Bentley
(2.3M Tris)
Crown
(4.8M Tris)
Dragon
(7.4M Tris)
Karst Fluid Flow
(8.4M Tris)
Power Plant
(12.8M Tris)
Intel® Xeon® Processor
E5-2699 v4
2 x 22 cores, 2.2 GHz
Intel® Xeon Phi™ Processor
7250
68 cores, 1.4 GHz
NVIDIA TITAN X (Pascal)
Coprocessor
12 GB RAM
Frames Per Second (Higher is Better), 1024x1024
image resolution
Diffuse Path Tracing Performance: Embree vs. NVIDIA* OptiX* Prime
0
20
40
60
80
100
120
140
160
180
Mazda
(5.7M Tris)
Villa
(37.7M Tris)
Art Deco
(10.7M Tris)
Power Plant
(12.8M Tris)
San Miguel
(10.5M Tris)
Intel® Xeon® Processor
E5-2699 v4
2 x 22 cores, 2.2 GHz
Intel® Xeon Phi™ Processor
7250
68 cores, 1.4 GHz
NVIDIA TITAN X (Pascal)
Coprocessor
12 GB RAM
Million Rays Per Second (Higher is Better), 3840x2160 image resolution
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark* and MobileMark*, are measured using specific
computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you
in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance.
Embree 2.12.0, Intel® C++ Compiler 17.0
NVIDIA* OptiX* Prime 4.0.1, CUDA* 8.0.44
Source: Intel
Embree Adoption*
28
Image rendered with FluidRay RT
Rendered with StingRay,
SURVICE Engineering
Courtesy of Jeff Patton, Rendered with Corona Renderer
pCon.planner rendered courtesty EasternGraphics
*Many other announced users incl.: Pixar, Weta Digital, Activision, Chaos V-Ray, Ready At Dawn,
FrostBite, EpicGames UnReal, High Moon, Blue Sky, UBISoft MP, Framestore,….
29
§  Build on top of Embree; Launched June 2016
§  Scalable Visualization targeted features
–  Surfaces (both polygonal and non-
polygonal)
–  Volumes, and volume rendering
–  High-Fidelity rendering/shading
methods
–  Scalable Cluster Wide Rendering
§  Packed it up in an ‘easy-to-use’ rendering
library for visualization
–  Same "spirit" as OpenGL, but different
API VL3
Brayns PowerCT
OSPRay: A Ray-Tracing based Rendering Engine for
High-Fidelity Visualization
NASA
WHEREAREWETODAY?
WHATISBEINGDONEWITH SDVIS?
ParaView v5.2 with integrated OSPRay and OpenSWR
31
•  Brain Tumor monitoring
and treatment
•  3D interactive @ 10-20fps
•  Intel® Xeon Phi™ processor
cluster
•  Ambient occlusion plus
shadows
•  Stop by the Intel SC’16
booth to see it live!
•  Data courtesy Kitware. Visualization,
Carson Brownlee, Intel
32
NASA – Custom OSPRay App
•  Simulated on Pleiades
supercomputer
•  Rendered on attached ‘hyperwall’
cluster
•  Dataset: SRB booster separation
from SLS
Simulation: Jeff Onufer and Tom Pulliam, NASA Ames
Visualization: Tim Sandstrom and Pat Moran, NASA Ames
33
Stephen Hawking Centre for Theoretical Cosmology –
ParaView / VTK with OSPRay
Gravational Waves : GR-Chombo AMR Data, Stephen Hawking CTC, UCambridge; Queens College, London;
visualization, Carson Brownlee, Intel)
•  600 GB Memory Footprint
•  36 TB Simulation Data Set
•  4 Intel® Xeon Phi™ 7230
Processors
•  1 Intel® Xeon® E5 v4 Dual
Socket node
•  Intel® Omni-Path Fabric
•  ~10 fps
•  See a demo in the SC’16
Intel “Discovery Zone”
34
Stephen Hawking Centre for Theoretical Cosmology –
‘Walls’ in situ with OSPRay Rendering
•  10 TB Memory Footprint
•  SGI UV-300 16TB SMP
•  >1000 Shared memory Intel® Xeon® E5
v3 processors
•  ~15 fps
•  Domain Wall formation in the universe
from Big Bang to today (13.8 billion
years)
•  Simulation code by Shellard et al, Visualizaiton by Johannes
Gunther (Intel)
35
Argonne VL3 Distributed Volume Renderer
VL3 with Mesa v13.0 with OpenSWR VL3 Compositing with OSPRay
Visualizations: Silvio Rizzi, Joe Insley: Argonne; Aaron Knoll: SCI @ UUtah
VISUALIZATION@HPCDCANDSC’16
OVERVIEW
36
SDVis Track Schedule (SATURDAY)
37
Technical Sessions	
  
SW Visualization
(Powder Mountain)	
  
 	
  
Lab Sessions	
  
Lab Room 3
(Capacity 50)	
  
Start 	
   End	
   Technical Sessions	
    	
   Start 	
   End	
   Hands On Lab	
  
12:00 PM	
   1:00 PM	
   Registration Opens	
  
1:00 PM	
   1:50 PM	
   Welcome Kick Off	
  
1:50 PM	
   2:05 PM	
   Break	
  
2:05 PM	
   2:55 PM	
  
Talk 1 - SDVis Update (Jim Jeffers)
Talk 2 – OpenSWR Update (Jeff
Amstutz)  	
  
2:05 PM	
   3:30 PM	
  
2:55 PM	
   3:10 PM	
    Break	
    	
  
3:10 PM	
   4:00 PM	
  
Talk 1 - OSPRay 1.0 and Beyond (Jeff
A, Intel)
Talk 2 - MPI Data-Parallel Rendering
w/OSPRay (Carson B, Intel)
 	
  
 	
   3:30 PM	
   3:40PM	
   Break	
  
 	
  
3:40PM	
   4:30PM	
  
Software Defined
Visualization : Getting
the most out of
ParaView OSPRay (Paul
A. Navrátil & David E.
DeMarle, Kitware)
4:00 PM	
   4:15 PM	
     Break	
    	
  
4:15 PM	
   5:05 PM	
  
Talk 1 - Realizing Multi-Hit Ray Tracing
in Embree and OSPRay (Christiaan
Gribble, Intel/SURVICE)
Talk 2 - Visualization w/Visit on Knights
Landing (Jian Huang & Hank Childs,
UOregon / UTennessee)
 	
  
 	
  
4:30PM	
   5:05PM	
  
SDVis Track Schedule (SUNDAY)
38
Technical Sessions	
  
SW Visualization
(Powder Mountain)	
    	
  
Lab Sessions	
  
Lab Room 2
(Capacity 25)	
  
7:00 AM	
   9:00 AM	
   Registration andBreakfast	
  
8:45 AM	
   9:30 AM	
   Keynote	
  
9:30 AM	
   9:45 AM	
   Break 	
  
9:45 AM	
   10:35 AM	
  
Talk 1 - SDVis Efforts @ Intel® PCC
Aaron Knoll, Unv. Of Utah)
Talk 2 - OSPRay Integration into
Pcon-Planner (Caglar Özgür & Frank
Wicht, Eastern Graphics)
9:45 AM	
   10:35 AM	
   Software Defined
Visualization : Getting the
most out of ParaView OSPRay
(Kitware)	
  10:35 AM	
   10:50 AM	
   Break	
  
10:35 AM	
   11:25AM	
  
10:50 AM	
   11:40 AM	
  
Bio-Molecular Vis on Knight
Landing (John Stone, UIUC)
11:25 AM	
   11:40AM	
   Break	
  
11:40 AM	
   1:00 PM	
   Lunch time Panel	
   11:40 AM	
   1:00 PM	
   Lunch time Panel	
  
1:00 PM	
   1:50 PM	
  
Paraview & VTK w/OSPRay and
OpenSWR (David DeMarle, Kitware) 1:00 PM	
   2:00PM	
  
1:50 PM	
   2:05 PM	
   Break	
  
2:05 PM	
   2:55PM	
  
SDVIs and In-Situ Visualization on
TACC's Stampede (Paul Navratil)
2:00PM	
   2:40 AM	
  
2:40AM	
   2:50PM	
   Break	
  
2:55 PM	
   3:10 PM	
    Break	
  
2:50PM	
   4:00 PM	
  
3:10 PM	
   4:00 PM	
  
Live Demos and Open Discussion
on Software Defined Visualization
(All Vis Track Presenters)
4:00 PM	
   4:15 PM	
   Break	
  
4:15 PM	
   4:45 PM	
   Closing Keynote	
  
7:00 PM	
   10:00 PM	
   Intel® Networking Reception	
  
SC’16 Software Defined Visualization Demos
Intel Main Booth (#1819):
Intel® SSF Cluster (Intel® Xeon Phi™ Processors, Intel Xeon® v4 Processors, Intel® Omni-
Path Fabric, Intel® HPC Orchestrator, Intel® Lustre
•  1) ParaView v5.2 w/OSPRay&OpenSWR: Brain Tumor Analysis
•  2) VMD v1.9.x w/OSPRay: Cryo-EM Reconstruction with ROME
•  3) VMD v1.9.x w/OSPRay: LAMMPS for Cancer Research
Intel Discover Zone (#2121) – Intel® Xeon Phi™ Processor DAPs
•  Argonne VL3 w/OSPRay: HACC Dark Matter Analysis
•  ParaView v5.2 w/OSPRay: Stephen Hawking CTC – Ligo based Black Hole collision
Partner Booths
•  Dell, SuperMicro, Kitware, NASA, Univ of Utah, NCSA, …
39
SCI-X Open House
Univ. of Utah
40
Weds	
  Nov.	
  16	
  
1:00	
  p.m.	
  -­‐	
  7:00	
  p.m.	
  
	
  
1-­‐5	
  p.m.:	
  Open	
  House:	
  Cont.	
  Buses	
  	
  
between	
  the	
  Salt	
  Palace	
  and	
  the	
  
University	
  (10	
  minutes	
  each	
  way).	
  
	
  
5	
  p.m.:	
  Keynote	
  presentaKon	
  by	
  Jim	
  
Clark	
  -­‐	
  Warnock	
  Engineering	
  Building	
  
L104	
  (overflow	
  -­‐	
  WEB	
  2230)	
  
	
  
6:00	
  p.m.	
  -­‐	
  RecepKon	
  -­‐	
  Catmull	
  
Gallery	
  
Summary:
Software Defined Visualization
www.sdvis.org
•  Addresses ever-growing HPC challenges for data
size, flexibility, reliability and maintainability
•  OpenSWR, OSPRay and Embree rendering
libraries optimize use of CPUs and main memory
•  Integrating into prominent Vis tools, ParaView*,
VisIt, EnSight*, VMD, Brayns, VL3, and more ….
•  All freely available (Open Source), developed and
maintained by Intel
SDVis = Performance, Fidelity and Lower Cost!!
QUESTIONS?
Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)
Legal Notices and Disclaimers
Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or
service activation. Performance varies depending on system configuration. No computer system can be absolutely
secure. Software and workloads used in performance tests may have been optimized for performance only on Intel
microprocessors.
Performance tests, are measured using specific computer systems, components, software, operations and functions. Any
change to any of those factors may cause the results to vary. You should consult other information and performance
tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when
combined with other products.
Copyright © 2016 Intel Corporation. All rights reserved. Intel, Intel Inside, the Intel logo, Intel Xeon and Intel Xeon Phi are
trademarks of Intel Corporation in the United States and other countries. *Other names and brands may be claimed
as the property of others.
Copyright © 2016 Intel Corporation, All Rights Reserved
45

More Related Content

What's hot (20)

PDF
Keynote (Phil Rogers) - The Programmers Guide to Reaching for the Cloud - by ...
AMD Developer Central
 
PDF
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor Miller
AMD Developer Central
 
PDF
MIT's experience on OpenPOWER/POWER 9 platform
Ganesan Narayanasamy
 
PDF
Nvidia at SEMICon, Munich
Alison B. Lowndes
 
PDF
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
AMD Developer Central
 
PDF
Exploiting GPUs in Spark
Kazuaki Ishizaki
 
PDF
CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...
AMD Developer Central
 
PDF
Easy and High Performance GPU Programming for Java Programmers
Kazuaki Ishizaki
 
PDF
HC-4021, Efficient scheduling of OpenMP and OpenCL™ workloads on Accelerated ...
AMD Developer Central
 
PDF
LCU13: GPGPU on ARM Experience Report
Linaro
 
PDF
"The OpenVX Computer Vision and Neural Network Inference Library Standard for...
Edge AI and Vision Alliance
 
PDF
Transparent GPU Exploitation for Java
Kazuaki Ishizaki
 
PDF
計算力学シミュレーションに GPU は役立つのか?
Shinnosuke Furuya
 
PDF
Hadoop + GPU
Vladimir Starostenkov
 
PDF
"Fast Deployment of Low-power Deep Learning on CEVA Vision Processors," a Pre...
Edge AI and Vision Alliance
 
PPTX
A Primer on FPGAs - Field Programmable Gate Arrays
Taylor Riggan
 
PDF
Making Hardware Accelerator Easier to Use
Kazuaki Ishizaki
 
PDF
"Lessons Learned from Bringing Mobile and Embedded Vision Products to Market,...
Edge AI and Vision Alliance
 
PDF
Affordable AI Connects To A Better Life
NVIDIA Taiwan
 
PDF
"Dataflow: Where Power Budgets Are Won and Lost," a Presentation from Movidius
Edge AI and Vision Alliance
 
Keynote (Phil Rogers) - The Programmers Guide to Reaching for the Cloud - by ...
AMD Developer Central
 
PL-4043, Accelerating OpenVL for Heterogeneous Platforms, by Gregor Miller
AMD Developer Central
 
MIT's experience on OpenPOWER/POWER 9 platform
Ganesan Narayanasamy
 
Nvidia at SEMICon, Munich
Alison B. Lowndes
 
PT-4052, Introduction to AMD Developer Tools, by Yaki Tebeka and Gordon Selley
AMD Developer Central
 
Exploiting GPUs in Spark
Kazuaki Ishizaki
 
CC-4001, Aparapi and HSA: Easing the developer path to APU/GPU accelerated Ja...
AMD Developer Central
 
Easy and High Performance GPU Programming for Java Programmers
Kazuaki Ishizaki
 
HC-4021, Efficient scheduling of OpenMP and OpenCL™ workloads on Accelerated ...
AMD Developer Central
 
LCU13: GPGPU on ARM Experience Report
Linaro
 
"The OpenVX Computer Vision and Neural Network Inference Library Standard for...
Edge AI and Vision Alliance
 
Transparent GPU Exploitation for Java
Kazuaki Ishizaki
 
計算力学シミュレーションに GPU は役立つのか?
Shinnosuke Furuya
 
Hadoop + GPU
Vladimir Starostenkov
 
"Fast Deployment of Low-power Deep Learning on CEVA Vision Processors," a Pre...
Edge AI and Vision Alliance
 
A Primer on FPGAs - Field Programmable Gate Arrays
Taylor Riggan
 
Making Hardware Accelerator Easier to Use
Kazuaki Ishizaki
 
"Lessons Learned from Bringing Mobile and Embedded Vision Products to Market,...
Edge AI and Vision Alliance
 
Affordable AI Connects To A Better Life
NVIDIA Taiwan
 
"Dataflow: Where Power Budgets Are Won and Lost," a Presentation from Movidius
Edge AI and Vision Alliance
 

Similar to Introduction to Software Defined Visualization (SDVis) (20)

PDF
Software Defined Visualization (SDVis): Get the Most Out of ParaView* with OS...
Intel® Software
 
PDF
SDVIs and In-Situ Visualization on TACC's Stampede
Intel® Software
 
PPTX
Kelly gaither
Lucas Giraldo
 
PDF
Computer-Vision_Integrating-Technology_MOB_17.06.16
Schuyler Kennedy
 
PDF
Software-defined Visualization, High-Fidelity Visualization: OpenSWR and OSPRay
Intel® Software
 
PDF
Ray Tracing with Intel® Embree and Intel® OSPRay: Use Cases and Updates | SIG...
Intel® Software
 
PDF
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
Edge AI and Vision Alliance
 
PDF
May 2021 Embedded Vision Summit Opening Remarks (May 28)
Edge AI and Vision Alliance
 
PDF
Open Source Interactive CPU Preview Rendering with Pixar's Universal Scene De...
Intel® Software
 
PDF
Optimizing HDRP with NVIDIA Nsight Graphics – Unite Copenhagen 2019
Unity Technologies
 
PPT
0329.emccormi.ppt
chmeghana1
 
PDF
HC28.22.430-Vision-Neural-Net-GregEfland-Cadence-v02-57
AAMIR FAROOQUI
 
PDF
Lecture-1-2-+(1).pdf
samaghorab
 
PDF
Lecture-1-2-+(1).pdf
samaghorab
 
PDF
Computer architecture for vision system
AkashPatil334
 
PDF
[03 2][gpu용 개발자 도구 - parallel nsight 및 axe] gateau parallel-nsight
laparuma
 
PPTX
Embree Ray Tracing Kernels | Overview and New Features | SIGGRAPH 2018 Tech S...
Intel® Software
 
PPTX
S12075-GPU-Accelerated-Video-Encoding.pptx
gopikahari7
 
PDF
"Vision for All?," a Presentation from Intel
Edge AI and Vision Alliance
 
PPTX
Introduction to architecture exploration
Deepak Shankar
 
Software Defined Visualization (SDVis): Get the Most Out of ParaView* with OS...
Intel® Software
 
SDVIs and In-Situ Visualization on TACC's Stampede
Intel® Software
 
Kelly gaither
Lucas Giraldo
 
Computer-Vision_Integrating-Technology_MOB_17.06.16
Schuyler Kennedy
 
Software-defined Visualization, High-Fidelity Visualization: OpenSWR and OSPRay
Intel® Software
 
Ray Tracing with Intel® Embree and Intel® OSPRay: Use Cases and Updates | SIG...
Intel® Software
 
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
Edge AI and Vision Alliance
 
May 2021 Embedded Vision Summit Opening Remarks (May 28)
Edge AI and Vision Alliance
 
Open Source Interactive CPU Preview Rendering with Pixar's Universal Scene De...
Intel® Software
 
Optimizing HDRP with NVIDIA Nsight Graphics – Unite Copenhagen 2019
Unity Technologies
 
0329.emccormi.ppt
chmeghana1
 
HC28.22.430-Vision-Neural-Net-GregEfland-Cadence-v02-57
AAMIR FAROOQUI
 
Lecture-1-2-+(1).pdf
samaghorab
 
Lecture-1-2-+(1).pdf
samaghorab
 
Computer architecture for vision system
AkashPatil334
 
[03 2][gpu용 개발자 도구 - parallel nsight 및 axe] gateau parallel-nsight
laparuma
 
Embree Ray Tracing Kernels | Overview and New Features | SIGGRAPH 2018 Tech S...
Intel® Software
 
S12075-GPU-Accelerated-Video-Encoding.pptx
gopikahari7
 
"Vision for All?," a Presentation from Intel
Edge AI and Vision Alliance
 
Introduction to architecture exploration
Deepak Shankar
 
Ad

More from Intel® Software (20)

PPTX
AI for All: Biology is eating the world & AI is eating Biology
Intel® Software
 
PPTX
Python Data Science and Machine Learning at Scale with Intel and Anaconda
Intel® Software
 
PDF
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
Intel® Software
 
PDF
AI for good: Scaling AI in science, healthcare, and more.
Intel® Software
 
PDF
Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...
Intel® Software
 
PPTX
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...
Intel® Software
 
PPTX
Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...
Intel® Software
 
PPTX
AWS & Intel Webinar Series - Accelerating AI Research
Intel® Software
 
PPTX
Intel Developer Program
Intel® Software
 
PDF
Intel AIDC Houston Summit - Overview Slides
Intel® Software
 
PDF
AIDC NY: BODO AI Presentation - 09.19.2019
Intel® Software
 
PDF
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
Intel® Software
 
PDF
Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...
Intel® Software
 
PDF
Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...
Intel® Software
 
PDF
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Intel® Software
 
PDF
RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...
Intel® Software
 
PDF
AIDC India - AI on IA
Intel® Software
 
PDF
AIDC India - Intel Movidius / Open Vino Slides
Intel® Software
 
PDF
AIDC India - AI Vision Slides
Intel® Software
 
PDF
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...
Intel® Software
 
AI for All: Biology is eating the world & AI is eating Biology
Intel® Software
 
Python Data Science and Machine Learning at Scale with Intel and Anaconda
Intel® Software
 
Streamline End-to-End AI Pipelines with Intel, Databricks, and OmniSci
Intel® Software
 
AI for good: Scaling AI in science, healthcare, and more.
Intel® Software
 
Software AI Accelerators: The Next Frontier | Software for AI Optimization Su...
Intel® Software
 
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...
Intel® Software
 
Reducing Deep Learning Integration Costs and Maximizing Compute Efficiency| S...
Intel® Software
 
AWS & Intel Webinar Series - Accelerating AI Research
Intel® Software
 
Intel Developer Program
Intel® Software
 
Intel AIDC Houston Summit - Overview Slides
Intel® Software
 
AIDC NY: BODO AI Presentation - 09.19.2019
Intel® Software
 
AIDC NY: Applications of Intel AI by QuEST Global - 09.19.2019
Intel® Software
 
Advanced Single Instruction Multiple Data (SIMD) Programming with Intel® Impl...
Intel® Software
 
Build a Deep Learning Video Analytics Framework | SIGGRAPH 2019 Technical Ses...
Intel® Software
 
Bring Intelligent Motion Using Reinforcement Learning Engines | SIGGRAPH 2019...
Intel® Software
 
RenderMan*: The Role of Open Shading Language (OSL) with Intel® Advanced Vect...
Intel® Software
 
AIDC India - AI on IA
Intel® Software
 
AIDC India - Intel Movidius / Open Vino Slides
Intel® Software
 
AIDC India - AI Vision Slides
Intel® Software
 
Enhance and Accelerate Your AI and Machine Learning Solution | SIGGRAPH 2019 ...
Intel® Software
 
Ad

Recently uploaded (20)

PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PPT
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PPTX
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PPTX
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Exolore The Essential AI Tools in 2025.pdf
Srinivasan M
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
AUTOMATION AND ROBOTICS IN PHARMA INDUSTRY.pptx
sameeraaabegumm
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 

Introduction to Software Defined Visualization (SDVis)

  • 3. Outline §  The Pendulum of Computing §  State of The Union Software Defined Visualization(SDVis) –  Quick Refresher – What is SDVis? Why? –  We are we today? (Hint: Launched and Active Integrations) §  Visualization @ HPCDC and SC’16 Overview §  Summary 3
  • 5. The Challenge with ‘Traditional’ Large Scale Visualization HPC cluster performs modeling and simulations Dedicated Visualization HW (GPUs) and SW Client devices view the final images HW Visualization Using Dedicated hardware and specialized software Bottlenecks… I/O, Scheduling, Memory Size, Power,… Circa 2010 - 2015
  • 6. 6 The Pendulum of Computing General Purpose Software + Integration Special Purpose Fixed Function Hybrid
  • 7. 7 The Pendulum of Computing Example: Graphics/Visualization – 1970’s->80’s Special Purpose Fixed Function Vector Displays Tektronics Silicon Graphics Gfx Workstations 10’s-100’s of Dev Engs Digital Graphics starting to emerge but in the realm of “specialty” uses General Purpose Software + Integration Hybrid
  • 8. 8 The Pendulum of Computing Example: Graphics/Visualization – ~1980’s-90’s General Purpose Software + Integration Special Purpose Fixed Function Hybrid Vector Displays Tektronics Silicon Graphics Gfx Workstations 100’s of SW Engs VGA/SVGA ”Frame Buffers”. Personal Computers DOS-Apps / Games 1000’s SW Engs Limited Standards High SW Innovation Inflection Point: PCs Democratize Computing and Development
  • 9. 9 The Pendulum of Computing Example: Graphics/Visualization – ~1990-’95 General Purpose Software + Integration Special Purpose Fixed Function Hybrid Vector Displays Tektronics Silicon Graphics Gfx Workstations 100’s of SW Engs HW Cursor Windowing GUI 2D FB HW Copy Rect Fill; Line Draw Device Drivers 100’s -1000’s SW Engs Inflection Point: GUI Systems + Standards Emerge - Algo’s Mature VGA/SVGA F.B. Personal Computers DOS-Apps / Games 1000’s SW Engs Limited Standards High SW Innovation
  • 10. 10 The Pendulum of Computing Example: Graphics/Visualization – ~1995->2000 General Purpose Software + Integration Special Purpose Fixed Function Hybrid Video / Audio Compression Graphics/Video Integrated 3D Fixed Function Pipelines ”High Speed Buses – PCI” 100’s SW Engs Limited Software Innovation HW Cursor Windowing GUI 2D FB HW Copy Rect Fill; Line Draw Inflection Point: Windows 95, DirectX/OpenGL, 3D Gaming VGA/SVGA F.B. Personal Computers DOS-Apps / Games 1000’s SW Engs Limited Standards High SW Innovation
  • 11. 11 The Pendulum of Computing Example: Graphics/Visualization – ~2000-‘14 General Purpose Software + Integration Special Purpose Fixed Function Hybrid Video / Audio Compression Graphics/Video Integrated 3D Fixed Function Pipelines ”High Speed Buses – PCI” Shaders; Limited Compute Advanced 3D Texture Modes Integrated M.B. Graphics High Speed RAM (GDDR) 3D Scientific Vis CUDA, Multicore CPU’s 1000’s SW Engs Inflection Point: 3D Gaming “Realism” drives programmable shaders, improved flexibility HPC Compute Adjacency + CUDA + Multicore -> Parallel Computing Emerges VGA/SVGA F.B. Personal Computers DOS-Apps / Games 1000’s SW Engs Limited Standards High SW Innovation
  • 12. 12 The Pendulum of Computing Example: Graphics/Visualization – 2014 -> ? General Purpose Software + Integration Special Purpose Fixed Function Hybrid Video / Audio Compression Graphics/Video Integrated 3D Fixed Function Pipelines ”High Speed Buses – PCI” Shaders; Limited Compute Advanced 3D Texture Modes Integrated M.B. Graphics High Speed RAM (GDDR) 3D Scientific Vis CUDA, Multicore CPU’s 1000’s SW Engs A ”New” Inflection Point?: Big Data, I/O Bottlenecks, Power, TCO challenges 8+ cores, SIMD CPUs, On-Package High Bandwidth Memory Multicore and Many-Core CPU’s Integrated CPU Graphics HBW Memory and Caches Big Data High SW innovation (perf+fidelity) SciVis analysis growth 1000’s - 10,000’s+ “
  • 13. 13 So what does this mean for Visual Analysis and Workflows?
  • 14. Example: EPFL Blue Brain Project “Brayns” + OSPRay Brain Neuron Growth 3D Visualization Web-Based GUI Large Display or Display Wall Output Real-time Simulation of Brain Neuron Formation and Electrical Impulses ~70 GB of Input Data @ 15-20fps; Shown using 4 Knights Landing Processors @ ISC’16 Intel Booth IMPOSSIBLE FOR A TETHERED GPU WITH LIMITED MEMORY AND Bandwidth!
  • 15. Example:GR-Chombo Black Hole Collision Stephen Hawking CTC LIGO Black Hole Collision simulation and resulting Gravitational Waves 1200 Timesteps over 0.2s – Collide @ ‘Speed of Light’ 3.6 TB Postprocessed (from 36 TB) Dataset IMPOSSIBLE FOR A TETHERED GPU WITH LIMITED MEMORY AND Bandwidth!
  • 17. 17 Refresher – Why SDVis and What is it?
  • 18. Our Vision: Scalable, Flexible Vis Rendering that Runs Anywhere! Standalone Laptops or Workstations Big Memory Nodes or Rendering Focused Clusters Large Compute+Vis clusters with Local or Remote Clients Cloud or Network
  • 19. How? Intel® SSF and Intel-Supported Software Defined Visualization (SDVis)! Embree -  CPU Optimized Ray Tracing Algorithms -  ‘Tool kit’ for Building Ray Tracings Apps -  Broadly Adopted by 3rd Party ISVs -  More at http://embree.github.io OSPRay -  Rendering Engine Based on Embree -  API Designed to Ease Creation of Visualization Software -  More at http://ospray.org OpenSWR -  High Performance CPU Vis Rasterization -  Fully Integrated into MESA v12.0+ -  Supports ParaView, Visit, VTK, EnSight, VL3 -  More at http://mesa3d.org Standard OpenGL Image Image Rendered by OSPRay HW Visualization Software Defined Visualization
  • 20. Addressing Large-scale, High Performance, and High Fidelity Visualization with SDVis Gain deeper understanding of data impacting science & discovery High fidelity, more realistic images even as data sets become increasingly larger, and more complex; no need to compromise data resolution Solve computing + modeling problem together (in-situ vis) Essential SW development suite that makes concurrent simulation and visualization efficient – users can work interactively and get results quicker One system Use same system for both simulation and visualization, avoid data transfer delays and memory size constraints – faster insights for solving toughest problems
  • 21. Your Work. No Compromises. Benefits of SDVis •  Open-sourced technology delivering vivid visualization of complex, enormous data sets •  Innovative software libraries for visualizing results with high performance by unlocking the parallelism already in your system •  High-fidelity images for gaining deeper insights in science and industry, faster •  Software Only solution lowers costs – no card cost, no card maintenance, lower power bills Magnetic Reconnection Model, Courtesy Bill Duaghton(LANL) and Berc Geveci(Kitware) Gravational Waves : GR-Chombo AMR Data, Stephen Hawking CTC, UCambridge; Queens College, London; visualization, Carson Brownlee, Intel, ParaView) Ribosome: Data: Max-Planck Institute for Biophysical Chemistry
  • 22. Hi-Fidelity Visualization with… §  … scalable image quality §  … scalable model size §  … scalable in rendering cost Data set provided by Florida International University
  • 23. OpenSWR Software Rasterizer (www.mesa3d.org www.openswr.org) •  High performance open source software implementation of OpenGL* rasterizer ‑  Fully multi-threaded and vectorized for Intel® processors ‑  Can access full system memory - highest resolution data ‑  Leverages community development effort (MESA) •  Drop in replacement for OpenGL library •  Available since July’16 in Mesa v12.0+ targeting features and performance for leading ScIVis Apps VL3
  • 24. Ray Tracing Foundation: Embree Ray Tracing Kernel Library 24 Provides highly optimized and scalable ray tracing kernels §  Acceleration structure build and ray traversal §  Single Ray, Ray Packets(4,8,16), Ray Streams(N) Targets up to photorealistic professional and scientific rendering applications Highest ray tracing performance on CPUs §  1.5–6× speedup reported by users Support for latest CPUs / ISAs §  Intel® Xeon Phi™ Processor (codenamed Knights Landing) – AVX-512 API for easy integration into applications Free and open source under Apache 2.0 license §  http://embree.github.com 24
  • 25. Performance: Embree vs. NVIDIA* OptiX* (Pascal) 0 5 10 15 20 25 30 35 40 45 50 Bentley (2.3M Tris) Crown (4.8M Tris) Dragon (7.4M Tris) Karst Fluid Flow (8.4M Tris) Power Plant (12.8M Tris) Intel® Xeon® Processor E5-2699 v4 2 x 22 cores, 2.2 GHz Intel® Xeon Phi™ Processor 7250 68 cores, 1.4 GHz NVIDIA TITAN X (Pascal) Coprocessor 12 GB RAM Frames Per Second (Higher is Better), 1024x1024 image resolution Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark* and MobileMark*, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance. Embree 2.12.0, ICC 2016 Update 1, Intel® SPMD Program Compiler (Intel® ISPC) 1.9.1 NVIDIA* OptiX* 4.0.1, CUDA* 8.0.44 Source: Intel
  • 26. 26 Intel Path Tracer Renderer Sample Application: Path Tracer Renderer using Embree and OptiX* RT Libraries Description: Path tracing app for use in benchmarking RT libraries Availability: §  Code: embree.github.io §  Recipe: embree.github.io. Usage Model: §  TBB, ISPC and Intrinsics Highlights: §  The code represents a typical ray tracing rendering pipeline used throughout DCC to show comparative performance on different types of hardware with a variety of input 3D data models. It has been optimized for all Intel ISA;s and delivered to the community and available for download from embree.github.io §  End-User benefits: Ability to achieve competitive performance and the flexibility of IA for rendering and render farm applications Results: §  Embree on dual socket (44 cores total) Intel® Xeon® E5-2699 v4 Processor performs 20% - 30% faster than Intel® Xeon Phi™ 7250 Processor –  Path tracing causes low SIMD utilization (better for Xeon) •  Embree on dual socket (44 cores total) Intel® Xeon® E5-2699 v4 Processor more than 2x faster than OptiX on NVIDIA TITAN X (Pascal) GPU. –  Path tracing causes low SIMD utilization (better for Xeon) –  Large path tracing kernel requires many registers (thus fewer threads executed on GPU) *  Other  names  and  brands  may  be  claimed  as  the  property  of  others.   Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. See benchmark tests and configurations in the speaker notes. For more information go to http://www.intel.com/performance 0 5 10 15 20 25 30 35 40 45 50 Bentley (2.3M Tris) Crown (4.8M Tris) Dragon (7.4M Tris) Karst Fluid Flow (8.4M Tris) Power Plant (12.8M Tris) Intel® Xeon® Processor E5-2699 v4 2 x 22 cores, 2.2 GHz Intel® Xeon Phi™ Processor 7250 68 cores, 1.4 GHz NVIDIA TITAN X (Pascal) Coprocessor 12 GB RAM Frames Per Second (Higher is Better), 1024x1024 image resolution
  • 27. Diffuse Path Tracing Performance: Embree vs. NVIDIA* OptiX* Prime 0 20 40 60 80 100 120 140 160 180 Mazda (5.7M Tris) Villa (37.7M Tris) Art Deco (10.7M Tris) Power Plant (12.8M Tris) San Miguel (10.5M Tris) Intel® Xeon® Processor E5-2699 v4 2 x 22 cores, 2.2 GHz Intel® Xeon Phi™ Processor 7250 68 cores, 1.4 GHz NVIDIA TITAN X (Pascal) Coprocessor 12 GB RAM Million Rays Per Second (Higher is Better), 3840x2160 image resolution Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark* and MobileMark*, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance. Embree 2.12.0, Intel® C++ Compiler 17.0 NVIDIA* OptiX* Prime 4.0.1, CUDA* 8.0.44 Source: Intel
  • 28. Embree Adoption* 28 Image rendered with FluidRay RT Rendered with StingRay, SURVICE Engineering Courtesy of Jeff Patton, Rendered with Corona Renderer pCon.planner rendered courtesty EasternGraphics *Many other announced users incl.: Pixar, Weta Digital, Activision, Chaos V-Ray, Ready At Dawn, FrostBite, EpicGames UnReal, High Moon, Blue Sky, UBISoft MP, Framestore,….
  • 29. 29 §  Build on top of Embree; Launched June 2016 §  Scalable Visualization targeted features –  Surfaces (both polygonal and non- polygonal) –  Volumes, and volume rendering –  High-Fidelity rendering/shading methods –  Scalable Cluster Wide Rendering §  Packed it up in an ‘easy-to-use’ rendering library for visualization –  Same "spirit" as OpenGL, but different API VL3 Brayns PowerCT OSPRay: A Ray-Tracing based Rendering Engine for High-Fidelity Visualization NASA
  • 31. ParaView v5.2 with integrated OSPRay and OpenSWR 31 •  Brain Tumor monitoring and treatment •  3D interactive @ 10-20fps •  Intel® Xeon Phi™ processor cluster •  Ambient occlusion plus shadows •  Stop by the Intel SC’16 booth to see it live! •  Data courtesy Kitware. Visualization, Carson Brownlee, Intel
  • 32. 32 NASA – Custom OSPRay App •  Simulated on Pleiades supercomputer •  Rendered on attached ‘hyperwall’ cluster •  Dataset: SRB booster separation from SLS Simulation: Jeff Onufer and Tom Pulliam, NASA Ames Visualization: Tim Sandstrom and Pat Moran, NASA Ames
  • 33. 33 Stephen Hawking Centre for Theoretical Cosmology – ParaView / VTK with OSPRay Gravational Waves : GR-Chombo AMR Data, Stephen Hawking CTC, UCambridge; Queens College, London; visualization, Carson Brownlee, Intel) •  600 GB Memory Footprint •  36 TB Simulation Data Set •  4 Intel® Xeon Phi™ 7230 Processors •  1 Intel® Xeon® E5 v4 Dual Socket node •  Intel® Omni-Path Fabric •  ~10 fps •  See a demo in the SC’16 Intel “Discovery Zone”
  • 34. 34 Stephen Hawking Centre for Theoretical Cosmology – ‘Walls’ in situ with OSPRay Rendering •  10 TB Memory Footprint •  SGI UV-300 16TB SMP •  >1000 Shared memory Intel® Xeon® E5 v3 processors •  ~15 fps •  Domain Wall formation in the universe from Big Bang to today (13.8 billion years) •  Simulation code by Shellard et al, Visualizaiton by Johannes Gunther (Intel)
  • 35. 35 Argonne VL3 Distributed Volume Renderer VL3 with Mesa v13.0 with OpenSWR VL3 Compositing with OSPRay Visualizations: Silvio Rizzi, Joe Insley: Argonne; Aaron Knoll: SCI @ UUtah
  • 37. SDVis Track Schedule (SATURDAY) 37 Technical Sessions   SW Visualization (Powder Mountain)       Lab Sessions   Lab Room 3 (Capacity 50)   Start   End   Technical Sessions       Start   End   Hands On Lab   12:00 PM   1:00 PM   Registration Opens   1:00 PM   1:50 PM   Welcome Kick Off   1:50 PM   2:05 PM   Break   2:05 PM   2:55 PM   Talk 1 - SDVis Update (Jim Jeffers) Talk 2 – OpenSWR Update (Jeff Amstutz)     2:05 PM   3:30 PM   2:55 PM   3:10 PM    Break       3:10 PM   4:00 PM   Talk 1 - OSPRay 1.0 and Beyond (Jeff A, Intel) Talk 2 - MPI Data-Parallel Rendering w/OSPRay (Carson B, Intel)         3:30 PM   3:40PM   Break       3:40PM   4:30PM   Software Defined Visualization : Getting the most out of ParaView OSPRay (Paul A. Navrátil & David E. DeMarle, Kitware) 4:00 PM   4:15 PM     Break       4:15 PM   5:05 PM   Talk 1 - Realizing Multi-Hit Ray Tracing in Embree and OSPRay (Christiaan Gribble, Intel/SURVICE) Talk 2 - Visualization w/Visit on Knights Landing (Jian Huang & Hank Childs, UOregon / UTennessee)         4:30PM   5:05PM  
  • 38. SDVis Track Schedule (SUNDAY) 38 Technical Sessions   SW Visualization (Powder Mountain)       Lab Sessions   Lab Room 2 (Capacity 25)   7:00 AM   9:00 AM   Registration andBreakfast   8:45 AM   9:30 AM   Keynote   9:30 AM   9:45 AM   Break   9:45 AM   10:35 AM   Talk 1 - SDVis Efforts @ Intel® PCC Aaron Knoll, Unv. Of Utah) Talk 2 - OSPRay Integration into Pcon-Planner (Caglar Özgür & Frank Wicht, Eastern Graphics) 9:45 AM   10:35 AM   Software Defined Visualization : Getting the most out of ParaView OSPRay (Kitware)  10:35 AM   10:50 AM   Break   10:35 AM   11:25AM   10:50 AM   11:40 AM   Bio-Molecular Vis on Knight Landing (John Stone, UIUC) 11:25 AM   11:40AM   Break   11:40 AM   1:00 PM   Lunch time Panel   11:40 AM   1:00 PM   Lunch time Panel   1:00 PM   1:50 PM   Paraview & VTK w/OSPRay and OpenSWR (David DeMarle, Kitware) 1:00 PM   2:00PM   1:50 PM   2:05 PM   Break   2:05 PM   2:55PM   SDVIs and In-Situ Visualization on TACC's Stampede (Paul Navratil) 2:00PM   2:40 AM   2:40AM   2:50PM   Break   2:55 PM   3:10 PM    Break   2:50PM   4:00 PM   3:10 PM   4:00 PM   Live Demos and Open Discussion on Software Defined Visualization (All Vis Track Presenters) 4:00 PM   4:15 PM   Break   4:15 PM   4:45 PM   Closing Keynote   7:00 PM   10:00 PM   Intel® Networking Reception  
  • 39. SC’16 Software Defined Visualization Demos Intel Main Booth (#1819): Intel® SSF Cluster (Intel® Xeon Phi™ Processors, Intel Xeon® v4 Processors, Intel® Omni- Path Fabric, Intel® HPC Orchestrator, Intel® Lustre •  1) ParaView v5.2 w/OSPRay&OpenSWR: Brain Tumor Analysis •  2) VMD v1.9.x w/OSPRay: Cryo-EM Reconstruction with ROME •  3) VMD v1.9.x w/OSPRay: LAMMPS for Cancer Research Intel Discover Zone (#2121) – Intel® Xeon Phi™ Processor DAPs •  Argonne VL3 w/OSPRay: HACC Dark Matter Analysis •  ParaView v5.2 w/OSPRay: Stephen Hawking CTC – Ligo based Black Hole collision Partner Booths •  Dell, SuperMicro, Kitware, NASA, Univ of Utah, NCSA, … 39
  • 40. SCI-X Open House Univ. of Utah 40 Weds  Nov.  16   1:00  p.m.  -­‐  7:00  p.m.     1-­‐5  p.m.:  Open  House:  Cont.  Buses     between  the  Salt  Palace  and  the   University  (10  minutes  each  way).     5  p.m.:  Keynote  presentaKon  by  Jim   Clark  -­‐  Warnock  Engineering  Building   L104  (overflow  -­‐  WEB  2230)     6:00  p.m.  -­‐  RecepKon  -­‐  Catmull   Gallery  
  • 41. Summary: Software Defined Visualization www.sdvis.org •  Addresses ever-growing HPC challenges for data size, flexibility, reliability and maintainability •  OpenSWR, OSPRay and Embree rendering libraries optimize use of CPUs and main memory •  Integrating into prominent Vis tools, ParaView*, VisIt, EnSight*, VMD, Brayns, VL3, and more …. •  All freely available (Open Source), developed and maintained by Intel SDVis = Performance, Fidelity and Lower Cost!!
  • 45. Legal Notices and Disclaimers Intel technologies’ features and benefits depend on system configuration and may require enabled hardware, software or service activation. Performance varies depending on system configuration. No computer system can be absolutely secure. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. Copyright © 2016 Intel Corporation. All rights reserved. Intel, Intel Inside, the Intel logo, Intel Xeon and Intel Xeon Phi are trademarks of Intel Corporation in the United States and other countries. *Other names and brands may be claimed as the property of others. Copyright © 2016 Intel Corporation, All Rights Reserved 45