References
MSMARCO
: https://microsoft.github.io/msmarco/
HotpotQA
: https://hotpotqa.github.io/
CQADupStack
: http://nlp.cis.unimelb.edu.au/resources/cqadupstack/
Chatbot Arena: https://chat.lmsys.org/?leaderboard
MMLU: https://arxiv.org/abs/2009.03300
MT Bench: https://arxiv.org/pdf/2402.14762