IT之家 1 月 4 日消息,阿里通义千问 Qwen 最新推出 CodeElo 基准测试,通过和人类程序员对比的 Elo 评级系统,来评估大语言模型(LLM)的编程水平。
The latest Open LLM Leaderboard by Hugging Face showed that all of the top-ranked models were trained and developed on the updated open-source versions of Qwen, part of the Tongyi Qianwen LLM ...