


default search action
ACM SIGOPS: Operating Systems Review, Volume 59
Volume 59, Number 1, July 2025
- Zhenning Yang, Archit Bhatnagar, Yiming Qiu, Tongyuan Miao

, Patrick Tser Jern Kon, Yunming Xiao, Yibo Huang
, Martin Casado, Ang Chen:
Cloud Infrastructure Management in the Age of AI Agents. 1-8 - Arney Agrawal, Nitin Kedia, Ashish Panwar, Jayashree Mohan, Nipun Kwatra, Bhargav S. Gulavani, Alexey Tumanov, Ramachandran Ramjee:

Efficient LLM Inference via Chunked Prefills. 9-16 - Yuan Wang, Zhenyuan Yang

, Zhanbo Wang, Mingyu Li, Zhilin Wu, Haibo Chen:
Towards Large Language Model-Friendly APls. 17-23 - Rui Yang, Rajiv Gupta:

DREAM: Distributed Regional Efficient Agent Management with LLMs for Online Multi-Agent Pathfinding. 24-33 - Payman Behnam, Alind Khare, Dhruv Garg, Alexey Tumanov:

Toward Weight Sharing Paradigm for Efficient AI: Training and Inference Serving. 34-45 - Payman Behnam, Yaosheng Fu, Ritchie Zhao, Po-An Tsai, Zhiding Yu, Alexey Tumanov:

EMPIRIC: Exploring Missing Pieces in KV Cache Compression for Reducing Computation, Storage, and Latency in Long-Context LLM Inference. 46-54
- Robin Vonk, Joost Hoozemans, Zaid Al-Ars:

GSST: Parallel string decompression at 191 GB/s on GPU. 55-61 - Shadi Ibrahim, Jad Darrous:

Erasure Coding Aware Block Placement for Data-Intensive Applications. 62-69

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














