


default search action
Haocheng Xi
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c8]Zhijian Liu, Ligeng Zhu, Baifeng Shi, Zhuoyang Zhang, Yuming Lou, Shang Yang, Haocheng Xi, Shiyi Cao, Yuxian Gu, Dacheng Li, Xiuyu Li, Haotian Tang, Yunhao Fang, Yukang Chen, Cheng-Yu Hsieh, De-An Huang, An-Chieh Cheng, Jinyi Hu, Sifei Liu, Ranjay Krishna, Pavlo Molchanov, Jan Kautz, Hongxu Yin, Song Han, Yao Lu:
NVILA: Efficient Frontier Visual Language Models. CVPR 2025: 4122-4134
[c7]Haocheng Xi, Han Cai, Ligeng Zhu, Yao Lu, Kurt Keutzer, Jianfei Chen, Song Han:
COAT: Compressing Optimizer states and Activations for Memory-Efficient FP8 Training. ICLR 2025
[c6]Yuxiang Chen, Haocheng Xi, Jun Zhu, Jianfei Chen:
Oscillation-Reduced MXFP4 Training for Vision Transformers. ICML 2025
[c5]Rishabh Tiwari, Haocheng Xi, Aditya Tomar, Coleman Richard Charles Hooper, Sehoon Kim, Maxwell Horton, Mahyar Najibi, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache. ICML 2025
[c4]Haocheng Xi, Shuo Yang, Yilong Zhao, Chenfeng Xu, Muyang Li, Xiuyu Li, Yujun Lin, Han Cai, Jintao Zhang, Dacheng Li, Jianfei Chen, Ion Stoica, Kurt Keutzer, Song Han:
Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity. ICML 2025
[c3]Jintao Zhang, Chendong Xiang, Haofeng Huang, Jia Wei, Haocheng Xi, Jun Zhu, Jianfei Chen:
SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model Inference. ICML 2025
[i18]Haocheng Xi, Shuo Yang, Yilong Zhao, Chenfeng Xu, Muyang Li, Xiuyu Li, Yujun Lin, Han Cai, Jintao Zhang, Dacheng Li, Jianfei Chen, Ion Stoica, Kurt Keutzer, Song Han:
Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity. CoRR abs/2502.01776 (2025)
[i17]Rishabh Tiwari, Haocheng Xi, Aditya Tomar, Coleman Hooper, Sehoon Kim, Maxwell Horton, Mahyar Najibi, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache. CoRR abs/2502.10424 (2025)
[i16]Jintao Zhang, Chendong Xiang, Haofeng Huang, Jia Wei, Haocheng Xi, Jun Zhu, Jianfei Chen:
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference. CoRR abs/2502.18137 (2025)
[i15]Yuxiang Chen, Haocheng Xi, Jun Zhu, Jianfei Chen:
Oscillation-Reduced MXFP4 Training for Vision Transformers. CoRR abs/2502.20853 (2025)
[i14]Shuo Yang, Haocheng Xi, Yilong Zhao, Muyang Li, Jintao Zhang, Han Cai, Yujun Lin, Xiuyu Li, Chenfeng Xu, Kelly Peng, Jianfei Chen, Song Han, Kurt Keutzer, Ion Stoica:
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation. CoRR abs/2505.18875 (2025)
[i13]Xingyang Li, Muyang Li, Tianle Cai, Haocheng Xi, Shuo Yang, Yujun Lin, Lvmin Zhang, Songlin Yang, Jinbo Hu, Kelly Peng, Maneesh Agrawala, Ion Stoica, Kurt Keutzer, Song Han:
Radial Attention: O(n log n) Sparse Attention with Energy Decay for Long Video Generation. CoRR abs/2506.19852 (2025)
[i12]Aditya Tomar, Coleman Hooper, Minjae Lee, Haocheng Xi, Rishabh Tiwari, Wonjun Kang, Luca Manolache, Michael W. Mahoney, Kurt Keutzer, Amir Gholami:
XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization. CoRR abs/2508.10395 (2025)
[i11]Yuxian Gu, Qinghao Hu, Shang Yang, Haocheng Xi, Junyu Chen, Song Han, Han Cai:
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search. CoRR abs/2508.15884 (2025)
[i10]Saarth Gaonkar, Xiang Zheng, Haocheng Xi, Rishabh Tiwari, Kurt Keutzer, Dmitriy Morozov, Michael W. Mahoney, Amir Gholami:
SciML Agents: Write the Solver, Not the Solution. CoRR abs/2509.09936 (2025)
[i9]Jintao Zhang, Haoxu Wang, Kai Jiang, Shuo Yang, Kaiwen Zheng, Haocheng Xi, Ziteng Wang, Hongzhou Zhu, Min Zhao, Ion Stoica, Joseph E. Gonzalez, Jun Zhu, Jianfei Chen:
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention. CoRR abs/2509.24006 (2025)
[i8]Wenkun He, Yuchao Gu, Junyu Chen, Dongyun Zou, Yujun Lin, Zhekai Zhang, Haocheng Xi, Muyang Li, Ligeng Zhu, Jincheng Yu, Junsong Chen, Enze Xie, Song Han, Han Cai:
DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space. CoRR abs/2509.25180 (2025)
[i7]Junyu Chen, Wenkun He, Yuchao Gu, Yuyang Zhao, Jincheng Yu, Junsong Chen, Dongyun Zou, Yujun Lin, Zhekai Zhang, Muyang Li, Haocheng Xi, Ligeng Zhu, Enze Xie, Song Han, Han Cai:
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder. CoRR abs/2509.25182 (2025)
[i6]Tianrui Feng, Zhi Li, Shuo Yang, Haocheng Xi, Muyang Li, Xiuyu Li, Lvmin Zhang, Keting Yang, Kelly Peng, Song Han, Maneesh Agrawala, Kurt Keutzer, Akio Kodaira, Chenfeng Xu:
StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation. CoRR abs/2511.07399 (2025)- 2024
[c2]Haocheng Xi, Yuxiang Chen, Kang Zhao, Kai Jun Teh, Jianfei Chen, Jun Zhu:
Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization. ICML 2024
[i5]Yifeng Liu, Hanwen Xu, Tangqi Fang, Haocheng Xi, Zixuan Liu, Sheng Zhang, Hoifung Poon, Sheng Wang:
T-Rex: Text-assisted Retrosynthesis Prediction. CoRR abs/2401.14637 (2024)
[i4]Haocheng Xi, Yuxiang Chen, Kang Zhao, Kaijun Zheng, Jianfei Chen, Jun Zhu:
Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization. CoRR abs/2403.12422 (2024)
[i3]Haocheng Xi, Han Cai, Ligeng Zhu, Yao Lu, Kurt Keutzer, Jianfei Chen, Song Han:
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training. CoRR abs/2410.19313 (2024)
[i2]Zhijian Liu, Ligeng Zhu, Baifeng Shi, Zhuoyang Zhang, Yuming Lou, Shang Yang, Haocheng Xi, Shiyi Cao, Yuxian Gu, Dacheng Li, Xiuyu Li, Yunhao Fang, Yukang Chen, Cheng-Yu Hsieh, De-An Huang, An-Chieh Cheng, Vishwesh Nath, Jinyi Hu, Sifei Liu, Ranjay Krishna, Daguang Xu, Xiaolong Wang, Pavlo Molchanov, Jan Kautz, Hongxu Yin, Song Han, Yao Lu:
NVILA: Efficient Frontier Visual Language Models. CoRR abs/2412.04468 (2024)- 2023
[c1]Haocheng Xi, Changhao Li, Jianfei Chen, Jun Zhu:
Training Transformers with 4-bit Integers. NeurIPS 2023
[i1]Haocheng Xi, Changhao Li, Jianfei Chen, Jun Zhu:
Training Transformers with 4-bit Integers. CoRR abs/2306.11987 (2023)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-09 00:16 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







