阿里大模型：Qwen2.5

最新推荐文章于 2025-06-10 14:26:03 发布

诸神缄默不语

最新推荐文章于 2025-06-10 14:26:03 发布

阅读量1.5k

点赞数 3

CC 4.0 BY-SA版权

分类专栏：人工智能学习笔记文章标签： Qwen 通义千问阿里大模型大规模预训练语言模型 NLP 自然语言处理

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/PolarisRisingWar/article/details/145509234

人工智能学习笔记专栏收录该内容

271 篇文章

订阅专栏

诸神缄默不语-个人CSDN博文目录

还没有上过手，等我上过手了会来补更多内容。

中文文档：https://qwen.readthedocs.io/zh-cn/latest/

Qwen现在最新出到2.5版。由于Qwen 2+和Qwen之间并不兼容，所以本文仅包括Qwen2.5的相关实操内容，对于Qwen的技术报告解读等相关内容会另写博文。

Qwen是类GPT的decoder-only架构，自回归模型，分词方式也是BPE。
提供 0.5B 、1.5B 、3B 、7B 、14B 、32B 和 72B 共7种参数规模的模型，并且有基模型和指令微调模型两种变体。
可以处理 32K 或 128K token 长的文本，其中 8K 长度可作为输出。（可以用YaRN进行扩展）
有纯文本、视觉、音频模型，有专门针对代码和数学进行优化的模型。

没开源的模型可以用API：https://www.alibabacloud.com/help/zh/model-studio/developer-reference/use-qwen-by-calling-api
或者从网页端访问：https://chat.qwenlm.ai/

已开源的模型可以本地部署，也可以架API，transformers格式和OpenAI API格式都支持，教程：
https://qwen.readthedocs.io/zh-cn/latest/getting_started/quickstart.html
transformers版详细教程：https://qwen.readthedocs.io/zh-cn/latest/inference/chat.html
Ollama：https://qwen.readthedocs.io/zh-cn/latest/run_locally/ollama.html
llama.cpp：https://qwen.readthedocs.io/zh-cn/latest/run_locally/llama.cpp.html
LM Studio
Web UI：https://qwen.readthedocs.io/zh-cn/latest/web_ui/text_generation_webui.html
vLLM：https://qwen.readthedocs.io/zh-cn/latest/deployment/vllm.html
TGI：https://qwen.readthedocs.io/zh-cn/latest/deployment/tgi.html
SkiPilot：https://qwen.readthedocs.io/zh-cn/latest/deployment/skypilot.html
OpenLLM：https://qwen.readthedocs.io/zh-cn/latest/deployment/openllm.html（这个还会显示required GPU RAM 和支持的操作系统，看起来挺智能的）
（方便使用RAG）LlamaIndex：https://qwen.readthedocs.io/zh-cn/latest/framework/LlamaIndex.html
（方便使用RAG）Langchain：https://qwen.readthedocs.io/zh-cn/latest/framework/Langchain.html
具体的我自己还没试，等试了再写。

官方文档还给出了推荐量化方式，略。

用LLaMA Factory进行SFT：https://qwen.readthedocs.io/zh-cn/latest/training/SFT/llama_factory.html

函数调用：https://qwen.readthedocs.io/zh-cn/latest/framework/function_call.html

官方智能体开发库：https://qwen.readthedocs.io/zh-cn/latest/framework/qwen_agent.html

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

诸神缄默不语 你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。