


default search action
BibTeX record journals/corr/abs-2502-19557
@article{DBLP:journals/corr/abs-2502-19557,
author = {Yudi Zhang and
Lu Wang and
Meng Fang and
Yali Du and
Chenghua Huang and
Jun Wang and
Qingwei Lin and
Mykola Pechenizkiy and
Dongmei Zhang and
Saravan Rajmohan and
Qi Zhang},
title = {Distill Not Only Data but Also Rewards: Can Smaller Language Models
Surpass Larger Ones?},
journal = {CoRR},
volume = {abs/2502.19557},
year = {2025},
url = {https://doi.org/10.48550/arXiv.2502.19557},
doi = {10.48550/ARXIV.2502.19557},
eprinttype = {arXiv},
eprint = {2502.19557},
timestamp = {Sun, 21 Sep 2025 12:32:57 +0200},
biburl = {https://dblp.org/rec/journals/corr/abs-2502-19557.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID













