


default search action
Xiaobin Zhuang
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j2]Xiaofeng Zou
, Cen Chen
, Hongen Shao
, Qinyu Wang
, Xiaobin Zhuang
, Yangfan Li
, Keqin Li
:
ReViT: Vision Transformer Accelerator With Reconfigurable Semantic-Aware Differential Attention. IEEE Trans. Computers 74(3): 1079-1093 (2025) - [c7]Yi Yuan, Dongya Jia, Xiaobin Zhuang, Yuanzhe Chen, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xubo Liu, Xiyuan Kang, Mark D. Plumbley, Wenwu Wang:
Sound-VECaps: Improving Audio Generation with Visually Enhanced Captions. ICASSP 2025: 1-5 - [i8]Dongya Jia, Zhuo Chen, Jiawei Chen, Chenpeng Du, Jian Wu, Jian Cong, Xiaobin Zhuang, Chumin Li, Zhen Wei, Yuping Wang, Yuxuan Wang:
DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation. CoRR abs/2502.03930 (2025) - [i7]Team Seawead, Ceyuan Yang, Zhijie Lin, Yang Zhao, Shanchuan Lin, Zhibei Ma, Haoyuan Guo, Hao Chen, Lu Qi, Sen Wang, Feng Cheng, Feilong Zuo, Xuejiao Zeng, Ziyan Yang, Fangyuan Kong, Zhiwu Qing, Fei Xiao, Meng Wei, Tuyen Hoang, Siyu Zhang, Peihao Zhu, Qi Zhao, Jiangqiao Yan, Liangke Gui, Sheng Bi, Jiashi Li, Yuxi Ren, Rui Wang, Huixia Li, Xuefeng Xiao, Shu Liu, Feng Ling, Heng Zhang, Houmin Wei, Huafeng Kuang, Jerry Duncan, Junda Zhang, Junru Zheng, Li Sun, Manlin Zhang, Renfei Sun, Xiaobin Zhuang, Xiaojie Li, Xin Xia, Xuyan Chi, Yanghua Peng, Yuping Wang, Yuxuan Wang, Zhongkai Zhao, Zhuo Chen, Zuquan Song, Zhenheng Yang, Jiashi Feng, Jianchao Yang, Lu Jiang:
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model. CoRR abs/2504.08685 (2025) - [i6]Kai Li, Can Shen, Yile Liu, Jirui Han, Kelong Zheng, Xuechao Zou, Zhe Wang, Xingjian Du, Shun Zhang, Hanjun Luo, Yingbin Jin, Xinxin Xing, Ziyang Ma, Yue Liu, Xiaojun Jia, Yifan Zhang, Junfeng Fang, Kun Wang, Yibo Yan, Haoyang Li, Yiming Li, Xiaobin Zhuang, Yang Liu, Haibo Hu, Zhuo Chen, Zhizheng Wu, Xiaolin Hu, Eng-Siong Chng, XiaoFeng Wang, Wenyuan Xu, Wei Dong, Xinfeng Li:
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models. CoRR abs/2505.16211 (2025) - [i5]Yakun Song, Jiawei Chen, Xiaobin Zhuang, Chenpeng Du, Ziyang Ma, Jian Wu, Jian Cong, Dongya Jia, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xie Chen:
MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation. CoRR abs/2506.00385 (2025) - [i4]Tingle Li, Baihe Huang, Xiaobin Zhuang, Dongya Jia, Jiawei Chen, Yuping Wang, Zhuo Chen, Gopala Anumanchipalli, Yuxuan Wang:
Sounding that Object: Interactive Object-Aware Image to Audio Generation. CoRR abs/2506.04214 (2025) - 2024
- [i3]Philip Anastassiou
, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, Yuanyuan Huo, Dongya Jia, Chumin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu, Xudong Liu, Yuchen Liu, Zhengxi Liu, Lu Lu, Junjie Pan, Xin Wang, Yuping Wang, Yuxuan Wang, Zhen Wei, Jian Wu, Chao Yao, Yifeng Yang, Yuanhao Yi, Junteng Zhang, Qidi Zhang, Shuo Zhang, Wenjie Zhang, Yang Zhang, Zilin Zhao, Dejian Zhong, Xiaobin Zhuang:
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models. CoRR abs/2406.02430 (2024) - [i2]Yi Yuan, Dongya Jia, Xiaobin Zhuang, Yuanzhe Chen, Zhengxi Liu, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Improving Audio Generation with Visual Enhanced Caption. CoRR abs/2407.04416 (2024) - 2023
- [c6]Chenquan Dai
, Xiaobin Zhuang
, Jiaxin Cai
:
A Survey on Deep Learning for Chinese Medical Named Entity Recognition. ICCAI 2023: 472-476 - 2022
- [c5]Chenquan Dai
, Xiaobin Zhuang
, Jiaxin Cai
:
Chinese Electronic Medical Record Named Entity Recognition Based on Bi-RNN-LSTM-RNN-CRF. ICCPR 2022: 577-583 - [c4]Xiaobin Zhuang, Huiran Yu, Weifeng Zhao, Tao Jiang, Peng Hu:
KaraTuner: Towards End-to-End Natural Pitch Correction for Singing Voice in Karaoke. INTERSPEECH 2022: 4262-4266 - 2021
- [c3]Xiaobin Zhuang, Tao Jiang, Szu-Yu Chou, Bin Wu, Peng Hu, Simon Lui:
Litesing: Towards Fast, Lightweight and Expressive Singing Voice Synthesis. ICASSP 2021: 7078-7082 - [i1]Xiaobin Zhuang, Huiran Yu, Weifeng Zhao, Tao Jiang, Peng Hu, Simon Lui, Wenjiang Zhou:
KaraTuner: Towards end to end natural pitch correction for singing voice in karaoke. CoRR abs/2110.09121 (2021)
2010 – 2019
- 2016
- [j1]Xiaobin Zhuang, Wenxiong Kang, Qiuxia Wu:
Real-time vehicle detection with foreground-based cascade classifier. IET Image Process. 10(4): 289-296 (2016) - [c2]Nengneng Peng, Rui Zhang, Haihua Zeng, Fei Wang, Kai Li, Yuanqing Li, Xiaobin Zhuang:
Control of a nursing bed based on a hybrid brain-computer interface. EMBC 2016: 1556-1559 - 2013
- [c1]Youpan Hu, Qing He, Xiaobin Zhuang, Haibin Wang, Baopu Li, Zhenfu Wen, Bin Leng, Guan Guan, Dongjie Chen:
Algorithm for vision-based vehicle detection and classification. ROBIO 2013: 568-572
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-08-21 01:51 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint