4
Most read
7
Most read
14
Most read
Graphics processing units - powerful, programmable, and highly parallel - are increasingly targeting general-purpose computing applications. GPU ComputingPresented By:Khan Muhammad Nafee Mostafa0507007, Dept of CSE, KUET
GPU ComputingJ. D. OwensM. HoustonD. LuebkeS. GreenJ. E. StoneJ. C. PhillipsProceedings of the IEEE | Vol 96, No. 5 | May 2008We would be concentrating on,What is GPU ComputingWhy GPU ComputingGPU Architecture and EvolutionGPU Computing ModelSoftware Environment Future
GPU for General Purpose ComputingWhat is GPU Computing ?
What is GPU Computing ?GPU computing is the use of a GPU to do general purpose scientific and engineering computingCPU and GPU together in a heterogeneous computing model.Sequential part of the application runs on the CPU and the computationally-intensive part runs on the GPU. From the user’s perspective, the application just runs faster because it is using the high-performance of the GPU to boost performance.
Over the past few years, the GPU has evolved from a fixed-function special-purpose processor into a full-fledged parallel programmable processor with additional fixed-function special-purpose functionalityWhy GPU Computing…
GPU for Non-Graphic AppsThe GPU is designed for a particular class of applications with the following characteristics,Computational requirements are largeParallelism is substantialThroughput is more important than latencya growing community has identified other applications with similar characteristics and successfully mapped these applications onto the GPU
GPU extends its hand towards CPU for performanceParallelism is the future of computingMany applications have to process huge set of data following same functionsSeveral stream processors can execute same  set of instructions on different data sets and give a higher throughput  If GPU take some share of computation load from CPU, many applications can be benefitted in speed-up
GPU is now turned into a programmable engineGPU Architecture and Evolution
GPU PipelineAvailable operations are configurable but not programmable
Evolution…
All GPU programs must be structured in this way: many parallel elements, each processed in parallel by a single programGPU Computing Model
Computing on the GPUProgramming a GPU for Graphicsprogrammer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.
Computing on the GPUProgramming a GPU for GraphicsProgramming a GPU for General-Purpose Programs (Old)programmer specifies geometric primitive covering computation domain of interest; rasterizer generates fragmentEach fragment is shaded by an SPMD general purpose FPFP computes the fragment by a combination of math operations and ‘gather’ accesses from global memory. resulting buffer can be used as an input on future passes. programmer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.
Computing on the GPUProgramming a GPU for General-Purpose Programs (New)programmer directly defines the computation domain of interest as a structured grid of threadsSPMD general-purpose program computes each threadeach thread is computed by a combination of math  operations and both ‘gather’ (read) accesses from and ‘scatter’ (write) accesses to global memory; (same buffer can be used for both allowing more flexible algorithms)resulting buffer in global memory can then be used as an input in future computation
Software Environments
Software EnvironmentsBrookGPUMicrosoft’s AcceleratorVendor Specific GPGPU systemsAMD ATI’s CTM (Close to the Metal)NVIDIA’s CUDA (Compute Unified Device Architecture)
Scan performance on CPU, graphics-based GPU (using OpenGL), and direct-compute GPU (using CUDA). Results obtained on a GeForce 8800 GTX GPU and Intel Core2-Duo Extreme 2.93 GHz CPU. (Figure adapted from Harris et al.)Scan performance on CPU, OpenGL and CUDA
Future…
Concluding for bright Future…support for double-precision floating-pointhigher bandwidth path between CPU and GPU (like ATI’s HyperTransport)more tightly coupled CPU and GPU (AMD’s fusion or nVidianForce)NVIDIA Quadro for Multiple GPU CollaborationFinally, let us wait for new era when GPU Computing will rule
Thank YouI would also like to thank,

More Related Content

PDF
CPU vs. GPU presentation
PDF
Introduction to Parallel Computing
PPTX
Nvidia (History, GPU Architecture and New Pascal Architecture)
PDF
GPU - An Introduction
PPTX
Graphics processing unit (GPU)
PPTX
Cuda Architecture
PPTX
graphics processing unit ppt
PPTX
CPU vs. GPU presentation
Introduction to Parallel Computing
Nvidia (History, GPU Architecture and New Pascal Architecture)
GPU - An Introduction
Graphics processing unit (GPU)
Cuda Architecture
graphics processing unit ppt

What's hot (20)

PPTX
Graphics processing unit ppt
PDF
GPU - Basic Working
PPTX
Multiprocessor architecture
PPT
isa architecture
DOCX
Parallel computing persentation
PPTX
Heterogeneous computing
PPTX
Amoeba distributed operating System
PDF
Parallel Algorithms
PDF
Gpu Systems
PPT
Pipeline hazards in computer Architecture ppt
PPTX
Multi Processors And Multi Computers
PPTX
PPT
Computer architecture
PPTX
Processor powerpoint
PPT
Blue gene technology
PDF
Hadoop Ecosystem
PPTX
Knowledge representation
PPTX
CPU vs GPU Comparison
PDF
Centralized shared memory architectures
Graphics processing unit ppt
GPU - Basic Working
Multiprocessor architecture
isa architecture
Parallel computing persentation
Heterogeneous computing
Amoeba distributed operating System
Parallel Algorithms
Gpu Systems
Pipeline hazards in computer Architecture ppt
Multi Processors And Multi Computers
Computer architecture
Processor powerpoint
Blue gene technology
Hadoop Ecosystem
Knowledge representation
CPU vs GPU Comparison
Centralized shared memory architectures
Ad

Viewers also liked (20)

PPT
Graphics Processing Unit - GPU
PPTX
GRAPHICS PROCESSING UNIT (GPU)
PPTX
Graphics processing unit (gpu)
PPT
Gpu presentation
PDF
Introduction to GPU Programming
PPT
Parallel computing with Gpu
PDF
GPU Programming
PDF
Example Application of GPU
PPTX
Graphic Processing Unit (GPU)
PDF
GPU Computing for Data Science
PDF
Automatically Defined Functions for Learning Classifier Systems
PPTX
The Effect of Heat on a GPU
PPTX
GPU Computing: A brief overview
PPTX
Graphics processing unit
PPTX
Graphics Processing Unit by Saurabh
PPTX
【セミナー資料】ソーシャル×ビッグデータ×Biで切り開くこれからの企業のあり方
PDF
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLAB
PDF
Jug gpgpu
PPTX
GPU Computing
Graphics Processing Unit - GPU
GRAPHICS PROCESSING UNIT (GPU)
Graphics processing unit (gpu)
Gpu presentation
Introduction to GPU Programming
Parallel computing with Gpu
GPU Programming
Example Application of GPU
Graphic Processing Unit (GPU)
GPU Computing for Data Science
Automatically Defined Functions for Learning Classifier Systems
The Effect of Heat on a GPU
GPU Computing: A brief overview
Graphics processing unit
Graphics Processing Unit by Saurabh
【セミナー資料】ソーシャル×ビッグデータ×Biで切り開くこれからの企業のあり方
FAST AND EFFICIENT IMAGE COMPRESSION BASED ON PARALLEL COMPUTING USING MATLAB
Jug gpgpu
GPU Computing
Ad

Similar to GPU Computing (20)

PDF
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
PPTX
GPGPU programming with CUDA
PDF
Graphics Processing Unit: An Introduction
PDF
Image Processing Application on Graphics processors
PDF
Volume 2-issue-6-2040-2045
PDF
Volume 2-issue-6-2040-2045
PPT
FIR filter on GPU
PDF
VisionizeBeforeVisulaize_IEVC_Final
PDF
High Performance Medical Reconstruction Using Stream Programming Paradigms
PDF
PDF
PPT
Achieving Improved Performance In Multi-threaded Programming With GPU Computing
PDF
IRJET-A Study on Parallization of Genetic Algorithms on GPUS using CUDA
PDF
GPU Programming with Java
PPTX
Revisiting Co-Processing for Hash Joins on the Coupled Cpu-GPU Architecture
PPTX
GPU in Computer Science advance topic .pptx
PPT
NVIDIA CUDA
PDF
Gpu application in cuda memory
PDF
GPU Computing: An Introduction
PPTX
Stream Processing
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
GPGPU programming with CUDA
Graphics Processing Unit: An Introduction
Image Processing Application on Graphics processors
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
FIR filter on GPU
VisionizeBeforeVisulaize_IEVC_Final
High Performance Medical Reconstruction Using Stream Programming Paradigms
Achieving Improved Performance In Multi-threaded Programming With GPU Computing
IRJET-A Study on Parallization of Genetic Algorithms on GPUS using CUDA
GPU Programming with Java
Revisiting Co-Processing for Hash Joins on the Coupled Cpu-GPU Architecture
GPU in Computer Science advance topic .pptx
NVIDIA CUDA
Gpu application in cuda memory
GPU Computing: An Introduction
Stream Processing

More from Khan Mostafa (14)

PDF
Graph-based Analysis and Opinion Mining in Social Network
PDF
Research in the Computing Industry
PDF
Semantic matchmaking Local Closed-World Reasoning
PDF
Survey on real media paint simulation in Computer Graphics
PDF
Seminal works on watercolor painting simulation
PDF
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...
PDF
Project Presentation: Graph-based Analysis and Opinion Mining in Social Network
PDF
A Survey on Sentiment Mining Techniques
PPTX
The Career (CSE)
PPTX
RDF by Structured Reference to Semantics, the RS2 framework
PDF
Study Tour (KUET CSE 2k5) Poster
PDF
Traffic Jam Detection System by Ratul, Sadh, Shams
PPTX
Open Document Format
PPTX
An Approach To Emerge Web 3.0
Graph-based Analysis and Opinion Mining in Social Network
Research in the Computing Industry
Semantic matchmaking Local Closed-World Reasoning
Survey on real media paint simulation in Computer Graphics
Seminal works on watercolor painting simulation
Reaction Paper Discussing Articles in Fields of Outlier Detection & Sentiment...
Project Presentation: Graph-based Analysis and Opinion Mining in Social Network
A Survey on Sentiment Mining Techniques
The Career (CSE)
RDF by Structured Reference to Semantics, the RS2 framework
Study Tour (KUET CSE 2k5) Poster
Traffic Jam Detection System by Ratul, Sadh, Shams
Open Document Format
An Approach To Emerge Web 3.0

Recently uploaded (20)

PDF
Technical Debt in the AI Coding Era - By Antonio Bianco
PDF
TicketRoot: Event Tech Solutions Deck 2025
PPTX
From XAI to XEE through Influence and Provenance.Controlling model fairness o...
PDF
【AI論文解説】高速・高品質な生成を実現するFlow Map Models(Part 1~3)
PPTX
AQUEEL MUSHTAQUE FAKIH COMPUTER CENTER .
PDF
Introduction to c language from lecture slides
PDF
Child-friendly e-learning for artificial intelligence education in Indonesia:...
PDF
“Introduction to Designing with AI Agents,” a Presentation from Amazon Web Se...
PDF
Optimizing bioinformatics applications: a novel approach with human protein d...
PPTX
Information-Technology-in-Human-Society.pptx
PDF
Decision Optimization - From Theory to Practice
PDF
Advancements in abstractive text summarization: a deep learning approach
PDF
Fitaura: AI & Machine Learning Powered Fitness Tracker
PDF
Human Computer Interaction Miterm Lesson
PPTX
Rise of the Digital Control Grid Zeee Media and Hope and Tivon FTWProject.com
PDF
Examining Bias in AI Generated News Content.pdf
PDF
Peak of Data & AI Encore: Scalable Design & Infrastructure
PDF
FASHION-DRIVEN TEXTILES AS A CRYSTAL OF A NEW STREAM FOR STAKEHOLDER CAPITALI...
PDF
ELLIE29.pdfWETWETAWTAWETAETAETERTRTERTER
PPTX
maintenance powerrpoint for adaprive and preventive
Technical Debt in the AI Coding Era - By Antonio Bianco
TicketRoot: Event Tech Solutions Deck 2025
From XAI to XEE through Influence and Provenance.Controlling model fairness o...
【AI論文解説】高速・高品質な生成を実現するFlow Map Models(Part 1~3)
AQUEEL MUSHTAQUE FAKIH COMPUTER CENTER .
Introduction to c language from lecture slides
Child-friendly e-learning for artificial intelligence education in Indonesia:...
“Introduction to Designing with AI Agents,” a Presentation from Amazon Web Se...
Optimizing bioinformatics applications: a novel approach with human protein d...
Information-Technology-in-Human-Society.pptx
Decision Optimization - From Theory to Practice
Advancements in abstractive text summarization: a deep learning approach
Fitaura: AI & Machine Learning Powered Fitness Tracker
Human Computer Interaction Miterm Lesson
Rise of the Digital Control Grid Zeee Media and Hope and Tivon FTWProject.com
Examining Bias in AI Generated News Content.pdf
Peak of Data & AI Encore: Scalable Design & Infrastructure
FASHION-DRIVEN TEXTILES AS A CRYSTAL OF A NEW STREAM FOR STAKEHOLDER CAPITALI...
ELLIE29.pdfWETWETAWTAWETAETAETERTRTERTER
maintenance powerrpoint for adaprive and preventive

GPU Computing

  • 1. Graphics processing units - powerful, programmable, and highly parallel - are increasingly targeting general-purpose computing applications. GPU ComputingPresented By:Khan Muhammad Nafee Mostafa0507007, Dept of CSE, KUET
  • 2. GPU ComputingJ. D. OwensM. HoustonD. LuebkeS. GreenJ. E. StoneJ. C. PhillipsProceedings of the IEEE | Vol 96, No. 5 | May 2008We would be concentrating on,What is GPU ComputingWhy GPU ComputingGPU Architecture and EvolutionGPU Computing ModelSoftware Environment Future
  • 3. GPU for General Purpose ComputingWhat is GPU Computing ?
  • 4. What is GPU Computing ?GPU computing is the use of a GPU to do general purpose scientific and engineering computingCPU and GPU together in a heterogeneous computing model.Sequential part of the application runs on the CPU and the computationally-intensive part runs on the GPU. From the user’s perspective, the application just runs faster because it is using the high-performance of the GPU to boost performance.
  • 5. Over the past few years, the GPU has evolved from a fixed-function special-purpose processor into a full-fledged parallel programmable processor with additional fixed-function special-purpose functionalityWhy GPU Computing…
  • 6. GPU for Non-Graphic AppsThe GPU is designed for a particular class of applications with the following characteristics,Computational requirements are largeParallelism is substantialThroughput is more important than latencya growing community has identified other applications with similar characteristics and successfully mapped these applications onto the GPU
  • 7. GPU extends its hand towards CPU for performanceParallelism is the future of computingMany applications have to process huge set of data following same functionsSeveral stream processors can execute same set of instructions on different data sets and give a higher throughput If GPU take some share of computation load from CPU, many applications can be benefitted in speed-up
  • 8. GPU is now turned into a programmable engineGPU Architecture and Evolution
  • 9. GPU PipelineAvailable operations are configurable but not programmable
  • 11. All GPU programs must be structured in this way: many parallel elements, each processed in parallel by a single programGPU Computing Model
  • 12. Computing on the GPUProgramming a GPU for Graphicsprogrammer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.
  • 13. Computing on the GPUProgramming a GPU for GraphicsProgramming a GPU for General-Purpose Programs (Old)programmer specifies geometric primitive covering computation domain of interest; rasterizer generates fragmentEach fragment is shaded by an SPMD general purpose FPFP computes the fragment by a combination of math operations and ‘gather’ accesses from global memory. resulting buffer can be used as an input on future passes. programmer specifies geometry covering a screen region; rasterizer generates a fragment at each pixel locationEach fragment is shaded by the fragment program (FP).FP computes the fragment by a combination of math operations and global memory readsresulting image can be used as texture on future passes.
  • 14. Computing on the GPUProgramming a GPU for General-Purpose Programs (New)programmer directly defines the computation domain of interest as a structured grid of threadsSPMD general-purpose program computes each threadeach thread is computed by a combination of math operations and both ‘gather’ (read) accesses from and ‘scatter’ (write) accesses to global memory; (same buffer can be used for both allowing more flexible algorithms)resulting buffer in global memory can then be used as an input in future computation
  • 16. Software EnvironmentsBrookGPUMicrosoft’s AcceleratorVendor Specific GPGPU systemsAMD ATI’s CTM (Close to the Metal)NVIDIA’s CUDA (Compute Unified Device Architecture)
  • 17. Scan performance on CPU, graphics-based GPU (using OpenGL), and direct-compute GPU (using CUDA). Results obtained on a GeForce 8800 GTX GPU and Intel Core2-Duo Extreme 2.93 GHz CPU. (Figure adapted from Harris et al.)Scan performance on CPU, OpenGL and CUDA
  • 19. Concluding for bright Future…support for double-precision floating-pointhigher bandwidth path between CPU and GPU (like ATI’s HyperTransport)more tightly coupled CPU and GPU (AMD’s fusion or nVidianForce)NVIDIA Quadro for Multiple GPU CollaborationFinally, let us wait for new era when GPU Computing will rule
  • 20. Thank YouI would also like to thank,