


default search action
LCPC 2016: Rochester, NY, USA
- Chen Ding, John Criswell, Peng Wu:

Languages and Compilers for Parallel Computing - 29th International Workshop, LCPC 2016, Rochester, NY, USA, September 28-30, 2016, Revised Papers. Lecture Notes in Computer Science 10136, Springer 2017, ISBN 978-3-319-52708-6
Large Scale Parallelism
- Diptorup Deb, Robert J. Fowler, Allan Porterfield:

QUARC: An Array Programming Approach to High Performance Computing. 3-17 - Xian-He Sun, Yuhang Liu:

Utilizing Concurrency: A New Theory for Memory Wall. 18-23 - Sriram Aananthakrishnan, Greg Bronevetsky, Mark Baranowski, Ganesh Gopalakrishnan:

ParFuse: Parallel and Compositional Analysis of Message Passing Programs. 24-39 - Adam Fidel, Francisco Coral-Sabido, Colton Riedel

, Nancy M. Amato, Lawrence Rauchwerger:
Fast Approximate Distance Queries in Unweighted Graphs Using Bounded Asynchrony. 40-54 - Kelly Livingston, Aaron Myles Landwehr, José Monsalve Diaz, Stéphane Zuckerman, Benoît Meister, Guang R. Gao:

Energy Avoiding Matrix Multiply. 55-70
Resilience and Persistence
- Saurabh Hukerikar, Christian Engelmann

:
Language Support for Reliable Memory Regions. 73-87 - Aurangzeb, Rudolf Eigenmann:

Harnessing Parallelism in Multicore Systems to Expedite and Improve Function Approximation. 88-92 - Pengcheng Li, Dhruva R. Chakrabarti:

Adaptive Software Caching for Efficient NVRAM Data Persistence. 93-97
Compiler Analysis and Optimization
- Mary W. Hall

, Protonu Basu:
Polyhedral Compiler Technology in Collaboration with Autotuning Important to Domain-Specific Frameworks for HPC. 101-105 - Prasanth Chatarasi, Jun Shirako, Martin Kong

, Vivek Sarkar:
An Extended Polyhedral Model for SPMD Programs and Its Use in Static Data Race Detection. 106-120 - Aniket Shivam, Alexandru Nicolau, Alexander V. Veidenbaum, Mario Mango Furnari, Rosario Cammarota:

Polygonal Iteration Space Partitioning. 121-136 - Pei-Hung Lin

, Qing Yi, Daniel J. Quinlan, Chunhua Liao
, Yongqing Yan:
Automatically Optimizing Stencil Computations on Many-Core NUMA Architectures. 137-152 - Amit Sabne, Putt Sakdhnagool, Rudolf Eigenmann:

Formalizing Structured Control Flow Graphs. 153-168
Dynamic Computation and Languages
- Hanfeng Chen, Alexander Krolik, Erick Lavoie, Laurie J. Hendren:

Automatic Vectorization for MATLAB. 171-187 - Forest Danford, Eric Welch, Julio Cárdenas-Rodríguez, Michelle Mills Strout:

Analyzing Parallel Programming Models for Magnetic Resonance Imaging. 188-202 - Tongsheng Geng, Stéphane Zuckerman, José Monsalve Diaz

, Alfredo Goldman
, Sami Habib, Jean-Luc Gaudiot, Guang R. Gao:
The Importance of Efficient Fine-Grain Synchronization for Many-Core Systems. 203-217 - Khalid Ahmad, Anand Venkat, Mary W. Hall

:
Optimizing LOBPCG: Sparse Matrix Loop and Data Transformations in Action. 218-232
GPUs and Private Memory
- G. Shashidhar, Rupesh Nasre

:
LightHouse: An Automatic Code Generator for Graph Algorithms on GPUs. 235-249 - Jad Hbeika, Milind Kulkarni:

Locality-Aware Task-Parallel Execution on GPUs. 250-264 - Tong Chen, Zehra Sura, Hyojin Sung:

Automatic Copying of Pointer-Based Data Structures. 265-281 - Kouhei Yamamoto, Tomoya Shirakawa, Yoshitake Oki, Akimasa Yoshida, Keiji Kimura, Hironori Kasahara

:
Automatic Local Memory Management for Multicores Having Global Address Space. 282-296
Run-time and Performance Analysis
- Murali Krishna Emani:

Mapping Medley: Adaptive Parallelism Mapping with Varying Optimization Goals. 299-313 - Konstantinos Sagonas

, Kjell Winblad
:
The Contention Avoiding Concurrent Priority Queue. 314-330 - Chenyang Liu, Milind Kulkarni:

Evaluating Performance of Task and Data Coarsening in Concurrent Collections. 331-345

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














