


default search action
30th HiPC 2023: Goa, India
- 30th IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023, Goa, India, December 18-21, 2023. IEEE 2023, ISBN 979-8-3503-8322-5

- Sunita Sarawagi:

Modern AI for Analyzing Large Structured Databases: Opportunities and Challenges. xxii - Priyanka Sharma:

High Performance and Energy Efficient Processor for Next Generation Data Centres: FUJITSU - MONAKA. xxiii - Manish Parashar:

Computing Everywhere, All at Once: Harnessing the Computing Continuum for Science. xxiv - Vittal Setty:

Addressing Exponential Scale Problems at Infosys. xxv - Bahareh Khabbazan, Marc Riera, Antonio González:

DNA-TEQ: An Adaptive Exponential Quantization of Tensors for DNN Inference. 1-10 - Gian Singh, Sanmukh R. Kuppannagari, Sarma B. K. Vrudhula:

PARAG: PIM Architecture for Real-Time Acceleration of GCNs. 11-20 - Jake Choi, Jaejin Lee, Sunchul Jung, Heon Young Yeom:

Hybrid CUDA Unified Memory Management in Fully Homomorphic Encryption Workloads. 21-30 - Shaik Jani Basha, Sandani Shaik, Nazrinbanu Nurmohammad Nagori, Veerendra Shetty:

Mobile Gaming Experience: An Approach Based on Thread Scheduler & Thread Priority Manager. 31-40 - Shulei Xu, Goutham Kalikrishna Reddy Kuncham, Mustafa Abduljabbar, Hari Subramoni, Dhabaleswar K. D. K. Panda:

Optimized All-to-All Connection Establishment for High-Performance MPI Libraries Over InfiniBand. 41-50 - Sirui Qi, Dejan S. Milojicic, Cullen E. Bash, Sudeep Pasricha:

MOSAIC: A Multi-Objective Optimization Framework for Sustainable Datacenter Management. 51-60 - Mengtian Yang, Yipeng Wang, Jaydeep P. Kulkarni:

A 118 GOPS/mm23D eDRAM TensorCore Architecture for Large-scale Matrix Multiplication. 61-65 - Zhihui Du, Oliver Alvarado Rodriguez, Fuhuan Li, Mohammad Dindoost, David A. Bader:

Contour Algorithm for Connectivity. 66-75 - Henk Dreuning, Kees Verstoep, Henri E. Bal, Rob V. van Nieuwpoort

:
CAPTURE: Memory-Centric Partitioning for Distributed DNN Training with Hybrid Parallelism. 76-86 - Daegun Yoon, Sangyoon Oh:

MiCRO: Near-Zero Cost Gradient Sparsification for Scaling and Accelerating Distributed DNN Training. 87-96 - Robert Underwood

, Meghana Madhyastha, Randal C. Burns
, Bogdan Nicolae:
Understanding Patterns of Deep Learning Model Evolution in Network Architecture Search. 97-106 - Jinghan Yao, Nawras Alnaasan

, Tian Chen, Aamir Shafi, Hari Subramoni, Dhabaleswar K. D. K. Panda:
Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference. 107-116 - Pu Jiao, Sheng Di, Jinyang Liu

, Xin Liang, Franck Cappello:
Characterization and Detection of Artifacts for Error-Controlled Lossy Compressors. 117-126 - Prashanthi S. K, Vinayaka Hegde, Keerthana Patchava, Ankita Das, Yogesh Simmhan:

Performance Characterization of Containerized DNN Training and Inference on Edge Accelerators. 127-131 - Arham Khan, Sheng Di, Kai Zhao, Jinyang Liu, Kyle Chard, Ian T. Foster, Franck Cappello:

SECRE: Surrogate-Based Error-Controlled Lossy Compression Ratio Estimation Framework. 132-142 - Tania Banerjee

, Jaemoon Lee, Jong Choi, Qian Gong
, Jieyang Chen, Scott Klasky, Anand Rangarajan, Sanjay Ranka:
Fast Algorithms for Scientific Data Compression. 143-152 - Alberto Riccardo Martinelli, Massimo Torquati, Marco Aldinucci, Iacopo Colonnelli

, Barbara Cantalupo:
CAPIO: a Middleware for Transparent I/O Streaming in Data- Intensive Workflows. 153-163 - Akshin Singh, Smruti R. Sarangi:

JASS: A Tunable Checkpointing System for NVM-Based Systems. 164-173 - Shashank Khobragade

, Santi Gopal Mondal, Kalyan Gunda:
Multi-Streamed Metadata-Integrity Verification For Cloud Migration In Deduplication Systems. 174-178 - Mathialakan Thavappiragasam, Vivek Kale:

CPU-GPU Tuning for Modern Scientific Applications using Node-Level Heterogeneity. 179-183 - Hari Sharan, Mythili Vutukuru, Biswabandan Panda

:
DDIOSim: A Microarchitecture Simulator for Data Direct I/O Technology. 184-188 - Ankit Choudhary, S. K. Vaibhav Kodavati, B. Mythili, R. V. G. Anjaneyulu, Manju Sarma M:

FPGA Accelerated Bi-Cubic Convolution for Image Interpolation. 189-193 - Shuai Yang

, Changyou Zhang, Ji Ma:
DeltaSPARSE: High-Performance Sparse General Matrix-Matrix Multiplication on Multi-GPU Systems. 194-202 - Koushik Sen, Sathish Vadhiyar, P. N. Vinayachandran:

Strategies for Fast I/O Throughput in Large-Scale Climate Modeling Applications. 203-212 - Kyle Marino, Pengmiao Zhang, Viktor K. Prasanna:

ME- ViT: A Single-Load Memory-Efficient FPGA Accelerator for Vision Transformers. 213-223 - Vinícius Vitor dos Santos Dias, Samuel Ferraz, Aditya Vadlamani, Mahdi Erfanian, Carlos H. C. Teixeira, Dorgival O. Guedes, Wagner Meira Jr., Srinivasan Parthasarathy:

Graph Pattern Mining Paradigms: Consolidation and Renewed Bearing. 224-233 - Arafath Nihar, Thomas G. Ciardi

, Rounak Chawla, Olatunde Akanbi, Vipin Chaudhary, Yinghui Wu, Roger H. French:
Accelerating Time to Science using CRADLE: A Framework for Materials Data Science. 234-245 - Kevin Assogba

, Bogdan Nicolae, M. Mustafa Rafique:
Optimizing the Training of Co-Located Deep Learning Models Using Cache-Aware Staggering. 246-255 - Avinash Maurya, Bogdan Nicolae, M. Mustafa Rafique, Franck Cappello:

Towards Efficient I/O Pipelines Using Accumulated Compression. 256-265 - Jan-Harm L. F. Betting, Chris I. De Zeeuw, Christos Strydis:

Oikonomos-II: A Reinforcement-Learning, Resource-Recommendation System for Cloud HPC. 266-276 - Zainul Abideen Sayed, Jaroslaw Zola

:
SCoOL - Scalable Common Optimization Library. 277-287 - Satanu Maity

, Mayank Goel, Manojit Ghose:
Data Locality Aware Computation Offloading in Near Memory Processing Architecture for Big Data Applications. 288-297 - Philip E. Davis, Jacob S. Merson

, Pradeep Subedi, Lee F. Ricketson, Cameron W. Smith, Mark S. Shephard, Manish Parashar:
Benesh: a Framework for Choreographic Coordination of In Situ Workflows. 298-308 - Shubhradeep Roy, Suvarthi Sarkar

, Aryabartta Sahu:
Profit Maximization Using Collaborative Storage Management in Multi-Tier Edge-Cloud System. 309-318 - Jiwoo Bang, Chungyong Kim, Eun-Kyu Byun, Hanul Sung, Jaehwan Lee, Hyeonsang Eom:

Towards Enhanced I/O Performance of NVM File Systems. 319-323 - Shruti Shivakumar, Ilya Amburg, Sinan G. Aksoy, Jiajia Li, Stephen J. Young

, Srinivas Aluru:
Fast Parallel Tensor Times Same Vector for Hypergraphs. 324-334 - Ullas A, Rupesh Nasre, R. Govindarajan:

Reduce, Reuse, and Adapt: Accelerating Graph Processing on GPUs. 335-346 - Zhiyi Zhang

, Pengfei Zhang, Zhuopin Xu, Qi Wang:
Reduce Computational Complexity for Convolutional Layers by Skipping Zeros. 347-356 - Lisheng Xie, Jianwei Xue, Liangshun Wu, Faquan Chen, Qingyang Tian, Yifan Zhou, Rendong Ying, Peilin Liu:

SpikeNC: An Accurate and Scalable Simulator for Spiking Neural Network on Multi-Core Neuromorphic Hardware. 357-366 - Anubhav Jana, Purushottam Kulkarni, Umesh Bellur

:
DAGit: A Platform For Enabling Serverless Applications. 367-376 - Mohammad Zubair, Desh Ranjan, Aaron Walden, Gabriel Nastac, Eric J. Nielsen, Boris Diskin, Marc F. Paterno, Samuel Jung, Joshua Hoke Davis:

Efficient GPU Implementation of Automatic Differentiation for Computational Fluid Dynamics. 377-386 - Ajeya Bhat, Sai Manasa Chadalavada, Nagakishore Jammula, Chirag Jain, Yogesh Simmhan:

A Lossless Compression Pipeline for Petabyte-Scale Whole Genome Sequencing Data. 387-391

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














