Adjusting primitives for graph : SHORT REPORT / NOTES

0 likes14 views

The document discusses performance evaluations of various graph algorithms, specifically focusing on vector operations such as multiplication and summation using different execution modes including sequential, OpenMP, and CUDA. It compares the performance of using different storage types like float and bfloat16, as well as various CUDA launch configurations. Additionally, it explores strategies for in-place operations and their impact on performance metrics.

Data & Analytics

Adjusting primitives for graph
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list
based graph representation that is
Multiply with different modes (map)
Sequential OpenMP CUDA
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
float bfloat16
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
Sequential OpenMP CUDA (memcpy, in-place)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
sum-loop sum-reduce
one-loop atomic-add
block-loop template, next-pow2 launch one-reduce, next-pow2 launch
block-loop template, prev. pow2 launch one-reduce, prev-pow2 launch
grid-loop
1. Comparing various launch configs for CUDA based vector element sum (in-place).

More Related Content

Similar to Adjusting primitives for graph : SHORT REPORT / NOTES (7)

PDF

Massive parallelism with gpus for centrality ranking in complex networksijcsit

PDF

[2D3]TurboGraph- Ultrafast graph analystics engine for billion-scale graphs i...NAVER D2

PPTX

Semantic Data Management in Graph Databases: ESWC 2014 TutorialMaribel Acosta Deibe

PDF

“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...Edge AI and Vision Alliance

PDF

Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu

PDF

Advances in GPU ComputingFrédéric Parienté

PDF

Bryan Thompson, Chief Scientist and Founder at SYSTAP, LLC at MLconf NYCMLconf

Massive parallelism with gpus for centrality ranking in complex networksijcsit

[2D3]TurboGraph- Ultrafast graph analystics engine for billion-scale graphs i...NAVER D2

Semantic Data Management in Graph Databases: ESWC 2014 TutorialMaribel Acosta Deibe

“ONNX and Python to C++: State-of-the-art Graph Compilation,” a Presentation ...Edge AI and Vision Alliance

Adjusting OpenMP PageRank : SHORT REPORT / NOTESSubhajit Sahu

Advances in GPU ComputingFrédéric Parienté

Bryan Thompson, Chief Scientist and Founder at SYSTAP, LLC at MLconf NYCMLconf

More from Subhajit Sahu (20)

PDF

About TrueTime, Spanner, Clock synchronization, CAP theorem, Two-phase lockin...Subhajit Sahu

PDF

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu

PDF

Adjusting Bitset for graph : SHORT REPORT / NOTESSubhajit Sahu

PDF

Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Subhajit Sahu

PDF

Algorithmic optimizations for Dynamic Monolithic PageRank (from STICD) : SHOR...Subhajit Sahu

PDF

word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...Subhajit Sahu

PDF

DyGraph: A Dynamic Graph Generator and Benchmark Suite : NOTESSubhajit Sahu

PDF

Shared memory Parallelism (NOTES)Subhajit Sahu

PDF

A Dynamic Algorithm for Local Community Detection in Graphs : NOTESSubhajit Sahu

PDF

Scalable Static and Dynamic Community Detection Using Grappolo : NOTESSubhajit Sahu

PDF

Application Areas of Community Detection: A Review : NOTESSubhajit Sahu

PDF

Community Detection on the GPU : NOTESSubhajit Sahu

PDF

Survey for extra-child-process package : NOTESSubhajit Sahu

PDF

Dynamic Batch Parallel Algorithms for Updating PageRank : POSTERSubhajit Sahu

PDF

Abstract for IPDPS 2022 PhD Forum on Dynamic Batch Parallel Algorithms for Up...Subhajit Sahu

PDF

Fast Incremental Community Detection on Dynamic Graphs : NOTESSubhajit Sahu

PDF

Can you ﬁx farming by going back 8000 years : NOTESSubhajit Sahu

PDF

HITS algorithm : NOTESSubhajit Sahu

PDF

Basic Computer Architecture and the Case for GPUs : NOTESSubhajit Sahu

PDF

Dynamic Batch Parallel Algorithms for Updating Pagerank : SLIDESSubhajit Sahu

About TrueTime, Spanner, Clock synchronization, CAP theorem, Two-phase lockin...Subhajit Sahu

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu

Adjusting Bitset for graph : SHORT REPORT / NOTESSubhajit Sahu

Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Subhajit Sahu

Algorithmic optimizations for Dynamic Monolithic PageRank (from STICD) : SHOR...Subhajit Sahu

word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings o...Subhajit Sahu

DyGraph: A Dynamic Graph Generator and Benchmark Suite : NOTESSubhajit Sahu

Shared memory Parallelism (NOTES)Subhajit Sahu

A Dynamic Algorithm for Local Community Detection in Graphs : NOTESSubhajit Sahu

Scalable Static and Dynamic Community Detection Using Grappolo : NOTESSubhajit Sahu

Application Areas of Community Detection: A Review : NOTESSubhajit Sahu

Community Detection on the GPU : NOTESSubhajit Sahu

Survey for extra-child-process package : NOTESSubhajit Sahu

Dynamic Batch Parallel Algorithms for Updating PageRank : POSTERSubhajit Sahu

Abstract for IPDPS 2022 PhD Forum on Dynamic Batch Parallel Algorithms for Up...Subhajit Sahu

Fast Incremental Community Detection on Dynamic Graphs : NOTESSubhajit Sahu

Can you ﬁx farming by going back 8000 years : NOTESSubhajit Sahu

HITS algorithm : NOTESSubhajit Sahu

Basic Computer Architecture and the Case for GPUs : NOTESSubhajit Sahu

Dynamic Batch Parallel Algorithms for Updating Pagerank : SLIDESSubhajit Sahu

Recently uploaded (20)

PDF

What does good look like - CRAP Brighton 8 July 2025Jan Kierzyk

PDF

Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMSMD RIZWAN MOLLA

PDF

Web Scraping with Google Gemini 2.0 .pdfTamanna

PDF

Data Chunking Strategies for RAG in 2025.pdfTamanna

PPTX

apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...apidays

PDF

Building Production-Ready AI Agents with LangGraph.pdfTamanna

PPTX

Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptxlacsonjhoma0407

PPT

tuberculosiship-2106031cyyfuftufufufivifvivivAkshaiRam

PDF

Product Management in HealthTech (Case Studies from SnappDoctor)Hamed Shams

PPTX

apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...apidays

PPTX

apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...apidays

PPTX

apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)apidays

PDF

The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...Lal Chandran

PPTX

Numbers of a nation: how we estimate population statistics | Accessible slidesOffice for National Statistics

PDF

apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...apidays

PDF

Context Engineering for AI Agents, approaches, memories.pdfTamanna

PDF

How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdfTamanna

PPTX

Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...Sease

PDF

Choosing the Right Database for Indexing.pdfTamanna

PDF

Driving Employee Engagement in a Hybrid World.pdfMia scott