


default search action
ASPLOS 2022: Lausanne, Switzerland
- Babak Falsafi, Michael Ferdman, Shan Lu, Thomas F. Wenisch:

ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022 - 4 March 2022. ACM 2022, ISBN 978-1-4503-9205-1
Session 1A: Accelerators
- Vidushi Dadu

, Tony Nowatzki
:
TaskStream: accelerating task-parallel workloads by recovering program structure. 1-13 - Zheng Qu

, Liu Liu
, Fengbin Tu
, Zhaodong Chen
, Yufei Ding
, Yuan Xie
:
DOTA: detect and omit weak attentions for scalable transformer acceleration. 14-26 - Dan Zhang

, Safeen Huda, Ebrahim M. Songhori
, Kartik Prabhu
, Quoc V. Le, Anna Goldie
, Azalia Mirhoseini:
A full-stack search technique for domain optimized deep learning accelerators. 27-42 - Qihang Chen

, Boyu Tian
, Mingyu Gao
:
FINGERS: exploiting fine-grained parallelism in graph mining accelerators. 43-55 - Enrico Reggiani

, Cristóbal Ramírez Lazo
, Roger Figueras Bagué
, Adrián Cristal
, Mauro Olivieri
, Osman Sabri Unsal
:
BiSon-e: a lightweight and high-performance accelerator for narrow integer linear algebra computing on the edge. 56-69
Session 1B: Address and Memory
- Jialiang Zhang, Michael M. Swift, Jing Jane Li:

Software-defined address mapping: a case on 3D memory. 70-83 - Jovan Stojkovic, Dimitrios Skarlatos, Apostolos Kokolis, Tianyin Xu

, Josep Torrellas:
Parallel virtualized memory translation with nested elastic cuckoo page tables. 84-97 - Brian Suchy, Souradip Ghosh, Drew Kersnar, Siyuan Chai, Zhen Huang, Aaron Nelson, Michael Cuevas, Alex Bernat, Gaurav Chaudhary, Nikos Hardavellas

, Simone Campanoni, Peter A. Dinda:
CARAT CAKE: replacing paging via compiler/kernel cooperation. 98-114 - Zheng Dang

, Shuibing He, Peiyi Hong, Zhenxin Li
, Xuechen Zhang, Xian-He Sun, Gang Chen:
NVAlloc: rethinking heap metadata management in persistent memory allocators. 115-127 - Chang Hyun Park

, Ilias Vougioukas
, Andreas Sandberg
, David Black-Schaffer
:
Every walk's a hit: making page walks single-access cache hits. 128-141
Session 2A: GPU and Data Analytics
- Shweta Pandey, Aditya K. Kamath

, Arkaprava Basu:
GPM: leveraging persistent memory from a GPU. 142-156 - Heejin Park

, Felix Xiaozhu Lin:
GPUReplay: a 50-KB GPU stack for client ML. 157-170 - Keren Zhou, Yueming Hao

, John M. Mellor-Crummey
, Xiaozhu Meng, Xu Liu:
ValueExpert: exploring value patterns in GPU-accelerated applications. 171-185 - Gengyu Rao, Jingji Chen, Jason Yik, Xuehai Qian:

SparseCore: stream ISA and processor specialization for sparse computation. 186-199 - Lin Jiang, Zhijia Zhao:

JSONSki: streaming semi-structured data with bit-parallel fast-forwarding. 200-211
Session 2B: Privacy and Software Security
- Márton Erdos

, Sam Ainsworth
, Timothy M. Jones
:
MineSweeper: a "clean sweep" for drop-in use-after-free prevention. 212-225 - Oleksii Oleksenko, Christof Fetzer, Boris Köpf

, Mark Silberstein
:
Revizor: testing black-box CPUs against speculation contracts. 226-239 - Tejas Kannan, Henry Hoffmann

:
Protecting adaptive sampling from information leakage on low-power sensors. 240-254 - Haotian Zhang

, Mengfei Ren
, Yu Lei, Jiang Ming
:
One size does not fit all: security hardening of MIPS embedded systems via static binary debloating for shared libraries. 255-270 - Haehyun Cho

, Jinbum Park, Adam Oest, Tiffany Bao, Ruoyu Wang, Yan Shoshitaishvili, Adam Doupé, Gail-Joon Ahn
:
ViK: practical mitigation of temporal memory safety violations through object ID inspection. 271-284
Session 3A: Hardware Security (1)
- Boyuan Yang

, Ruirong Chen
, Kai Huang
, Jun Yang
, Wei Gao
:
Eavesdropping user credentials via GPU side channels on smartphones. 285-299 - Heiner Litz

, Grant Ayers, Parthasarathy Ranganathan:
CRISP: critical slice prefetching. 300-313
Session 3B: Misc.
- Zirui Neil Zhao

, Houxiang Ji
, Adam Morrison
, Darko Marinov
, Josep Torrellas:
Pinned loads: taming speculative loads in secure processors. 314-328 - Peter W. Deutsch

, Yuheng Yang
, Thomas Bourgeat
, Jules Drean, Joel S. Emer, Mengjia Yan
:
DAGguise: mitigating memory timing side channels. 329-343
Session 4A: Systems for Machine Learning
- Geet Sethi, Bilge Acun, Niket Agarwal, Christos Kozyrakis, Caroline Trippel

, Carole-Jean Wu:
RecShard: statistical feature-based memory optimization for industry-scale neural recommendation. 344-358 - Zhen Zheng, Xuanda Yang, Pengzhan Zhao, Guoping Long, Kai Zhu, Feiwen Zhu, Wenyi Zhao, Xiaoyong Liu, Jun Yang, Jidong Zhai, Shuaiwen Leon Song, Wei Lin:

AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures. 359-373 - Shixiong Zhao

, Fanxin Li, Xusheng Chen, Tianxiang Shen, Li Chen, Sen Wang, Nicholas Zhang, Cheng Li, Heming Cui:
NASPipe: high performance and reproducible pipeline parallel supernet training via causal synchronous parallelism. 374-387 - Zihan Liu

, Jingwen Leng
, Zhihui Zhang
, Quan Chen
, Chao Li
, Minyi Guo
:
VELTAIR: towards high-performance multi-tenant deep learning services via adaptive compilation and scheduling. 388-401 - Abhinav Jangda, Jun Huang, Guodong Liu, Amir Hossein Nodehi Sabet, Saeed Maleki, Youshan Miao

, Madanlal Musuvathi, Todd Mytkowicz, Olli Saarikivi:
Breaking the computation and communication abstraction barrier in distributed machine learning workloads. 402-416
Session 4B: Operating System
- Zhiyuan Guo, Yizhou Shan, Xuhao Luo

, Yutong Huang, Yiying Zhang:
Clio: a hardware-software co-designed disaggregated memory system. 417-433 - David A. Cock

, Abishek Ramdas, Daniel Schwyn
, Michael Giardino
, Adam Turowski, Zhenhao He, Nora Hossle, Dario Korolija, Melissa Licciardello, Kristina Martsenko, Reto Achermann, Gustavo Alonso, Timothy Roscoe:
Enzian: an open, general, CPU/FPGA platform for systems software research. 434-451 - Nils Asmussen

, Sebastian Haas
, Carsten Weinhold, Till Miemietz, Michael Roitzsch
:
Efficient and scalable core multiplexing with M³v. 452-466 - Hugo Lefeuvre

, Vlad-Andrei Badoiu
, Alexander Jung
, Stefan Lucian Teodorescu, Sebastian Rauch, Felipe Huici, Costin Raiciu, Pierre Olivier
:
FlexOS: towards flexible OS isolation. 467-482 - Ruslan Nikolaev

, Hassan Nadeem, Cathlyn Stone, Binoy Ravindran:
Adelie: continuous address space layout re-randomization for Linux drivers. 483-498
Session 5A: Quantum Computing
- Lei Xie, Jidong Zhai, Zhenxing Zhang, Jonathan Allcock, Shengyu Zhang, Yicong Zheng

:
Suppressing ZZ crosstalk of Quantum computers through pulse and scheduling co-optimization. 499-513 - Tirthak Patel

, Ed Younis
, Costin Iancu, Wibe de Jong
, Devesh Tiwari:
QUEST: systematically approximating Quantum circuits for higher output fidelity. 514-528 - Swamit S. Tannu

, Poulami Das, Ramin Ayanzadeh
, Moinuddin K. Qureshi
:
HAMMER: boosting fidelity of noisy Quantum circuits by exploiting Hamming behavior of erroneous outcomes. 529-540 - Poulami Das, Aditya Locharla, Cody Jones:

LILLIPUT: a lightweight low-latency lookup-table decoder for near-term Quantum error correction. 541-553 - Gushu Li

, Anbang Wu, Yunong Shi, Ali Javadi-Abhari, Yufei Ding
, Yuan Xie:
Paulihedral: a generalized block-wise compiler optimization framework for Quantum simulation kernels. 554-569
Session 5B: Data Center and Cloud Services
- Wei Zhang, Quan Chen

, Kaihua Fu, Ningxin Zheng, Zhiyi Huang, Jingwen Leng
, Minyi Guo
:
Astraea: towards QoS-aware and resource-efficient multi-stage GPU services. 570-582 - Alexander Fuerst

, Stanko Novakovic, Iñigo Goiri, Gohar Irfan Chaudhry, Prateek Sharma, Kapil Arya, Kevin Broas, Eugene Bak, Mehmet Iyigun, Ricardo Bianchini:
Memory-harvesting VMs in cloud platforms. 583-594 - Tejun Heo, Dan Schatzberg, Andrew Newell, Song Liu, Saravanan Dhakshinamurthy, Iyswarya Narayanan

, Josef Bacik, Chris Mason, Chunqiang Tang, Dimitrios Skarlatos:
IOCost: block IO control for containers in datacenters. 595-608 - Johannes Weiner, Niket Agarwal, Dan Schatzberg, Leon Yang, Hao Wang

, Blaise Sanouillet, Bikash Sharma, Tejun Heo, Mayank Jain, Chunqiang Tang, Dimitrios Skarlatos:
TMO: transparent memory offloading in datacenters. 609-621 - Yawen Wang, Daniel Crankshaw, Neeraja J. Yadwadkar, Daniel S. Berger, Christos Kozyrakis, Ricardo Bianchini:

SOL: safe on-node learning in cloud platforms. 622-634
Session 6A: Accelerating Emerging Applications
- Nika Mansouri-Ghiasi, Jisung Park

, Harun Mustafa
, Jeremie S. Kim, Ataberk Olgun, Arvid Gollwitzer
, Damla Senol Cali
, Can Firtina, Haiyu Mao, Nour Almadhoun Alserr, Rachata Ausavarungnirun, Nandita Vijaykumar, Mohammed Alser, Onur Mutlu
:
GenStore: a high-performance in-storage processing system for genome sequence analysis. 635-654 - Eyes Robson

, Ceyu Xu
, Lisa Wu Wills
:
ProSE: the architecture and design of a protein discovery engine. 655-668 - Bangyan Wang

, Lei Deng
, Fei Sun, Guohao Dai, Liu Liu
, Yu Wang, Yuan Xie:
A one-for-all and o(v log(v ))-cost solution for parallel merge style operations on sorted key-value arrays. 669-682 - McKenzie van der Hagen

, Brandon Lucia
:
Client-optimized algorithms and acceleration for encrypted compute offloading. 683-696
Session 6B: Bugs (1)
- Theodoros Theodoridis

, Manuel Rigger
, Zhendong Su
:
Finding missed optimizations through the lens of dead code elimination. 697-709 - Umang Mathur

, Andreas Pavlogiannis
, Hünkar Can Tunç
, Mahesh Viswanathan:
A tree clock data structure for causal orderings in concurrent executions. 710-725 - Benjamin Reidys, Peng Liu, Jian Huang:

RSSD: defend against ransomware with hardware-isolated network-storage codesign and post-attack analysis. 726-739 - Xiang Cheng, David Devecsery:

Creating concise and efficient dynamic analyses with ALDA. 740-752
Session 7A: Serverless
- Rohan Basu Roy, Tirthak Patel

, Devesh Tiwari:
IceBreaker: warming serverless functions better with heterogeneity. 753-767 - Yanan Yang

, Laiping Zhao, Yiming Li, Huanyu Zhang, Jie Li, Mingyang Zhao, Xingzhen Chen, Keqiu Li:
INFless: a native serverless system for low-latency, high-throughput inference. 768-781 - Zijun Li

, Yushi Liu
, Linsong Guo
, Quan Chen
, Jiagan Cheng
, Wenli Zheng
, Minyi Guo
:
FaaSFlow: enable efficient workflow execution for function-as-a-service. 782-796 - Dong Du

, Qingyuan Liu, Xueqiang Jiang, Yubin Xia, Binyu Zang, Haibo Chen
:
Serverless computing on heterogeneous computers. 797-813 - Qiangyu Pei

, Shutong Chen
, Qixia Zhang
, Xinhui Zhu, Fangming Liu
, Ziyang Jia, Yishuo Wang, Yongjie Yuan:
CoolEdge: hotspot-relievable warm water cooling for energy-efficient edge datacenters. 814-829
Session 7B: Bugs (2)
- Hamed Gorjiara, Guoqing Harry Xu

, Brian Demsky:
Yashme: detecting persistency races. 830-845 - Muhui Jiang, Tianyi Xu, Yajin Zhou, Yufeng Hu, Ming Zhong, Lei Wu, Xiapu Luo, Kui Ren:

EXAMINER: automatically locating inconsistent instructions between real devices and CPU emulators for ARM. 846-858 - Tuo Li, Jia-Ju Bai, Yulei Sui

, Shi-Min Hu:
Path-sensitive and alias-aware typestate analysis for detecting OS bugs. 859-872 - Zhangyu Chen, Yu Hua, Yongle Zhang, Luochangqi Ding:

Efficiently detecting concurrency bugs in persistent memory programs. 873-887 - Ziheng Liu

, Shihao Xia, Yu Liang
, Linhai Song, Hong Hu
:
Who goes first? detecting go concurrency bugs via message reordering. 888-902
Session 8A: Non-traditional Computing and Reconfigurable Hardware
- Dongmoon Min

, Yujin Chung
, Ilkwon Byun
, Junpyo Kim
, Jangwoo Kim
:
CryoWire: wire-driven microarchitecture designs for cryogenic computing. 903-917 - Thilini Kaushalya Bandara

, Dhananjaya Wijerathne
, Tulika Mitra
, Li-Shiuan Peh
:
REVAMP: a systematic framework for heterogeneous CGRA realization. 918-932 - Yuanlong Xiao

, Eric Micallef, Andrew Butt
, Matthew Hofmann, Marc Alston, Matthew Goldsmith, Andrew Merczynski-Hait, André DeHon
:
PLD: fast FPGA compilation to make reconfigurable acceleration compatible with modern incremental refinement software development. 933-945 - Jiacheng Ma

, Gefei Zuo
, Kevin Loughlin, Haoyang Zhang
, Andrew Quinn
, Baris Kasikci
:
Debugging in the brave new world of reconfigurable hardware. 946-962 - Patricia Gonzalez-Guerrero

, Meriam Gay Bautista, Darren Lyles
, George Michelogiannakis
:
Temporal and SFQ pulse-streams encoding for area-efficient superconducting accelerators. 963-976
Session 8B: Synthesis and Compilation
- Theodoros Theodoridis

, Tobias Grosser
, Zhendong Su
:
Understanding and exploiting optimal function inlining. 977-989 - Hammad Ahmad, Yu Huang, Westley Weimer:

CirFix: automatically repairing defects in hardware design code. 990-1003 - Maaz Bin Safeer Ahmad

, Alexander J. Root
, Andrew Adams, Shoaib Kamil, Alvin Cheung
:
Vector instruction selection for digital signal processors using program synthesis. 1004-1016 - Qian Zhang, Jiyuan Wang

, Guoqing Harry Xu
, Miryung Kim:
HeteroGen: transpiling C to heterogeneous HLS code with automated test generation and program repair. 1017-1029 - Yanju Chen

, Junrui Liu, Yu Feng
, Rastislav Bodík:
Tree traversal synthesis using domain-specific symbolic compilation. 1030-1042
Session 9A: Hardware Security (2)
- Jubayer Mahmod, Matthew Hicks:

SRAM has no chill: exploiting power domain separation to steal on-chip secrets. 1043-1055 - Gururaj Saileshwar

, Bolin Wang, Moinuddin K. Qureshi
, Prashant J. Nair:
Randomized row-swap: mitigating Row Hammer by breaking spatial correlation between aggressor and victim rows. 1056-1069 - Mark Zhao

, Mingyu Gao
, Christos Kozyrakis:
ShEF: shielded enclaves for cloud FPGAs. 1070-1085 - Jubayer Mahmod, Matthew Hicks:

Invisible bits: hiding secret messages in SRAM's analog domain. 1086-1098
Session 9B: Smart Networking
- Tushar Swamy, Alexander Rucker, Muhammad Shahbaz

, Ishan Gaur, Kunle Olukotun
:
Taurus: a data plane architecture for per-packet ML. 1099-1114 - Haggai Eran

, Maxim Fudim, Gabi Malka, Gal Shalom, Noam Cohen, Amit Hermony, Dotan Levi, Liran Liss, Mark Silberstein
:
FlexDriver: a network driver for your accelerator. 1115-1129 - Boris Pismenny

, Liran Liss, Adam Morrison
, Dan Tsafrir
:
The benefits of general-purpose on-NIC memory. 1130-1147 - Sebastiano Miano

, Alireza Sanaee
, Fulvio Risso
, Gábor Rétvári
, Gianni Antichi
:
Domain specific run time optimization for software data planes. 1148-1164

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














