请问是用什么评分模型测 AlignBench 的? #1555
linbeyoung
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
AlignBench原有的GPT4-0613已经下架,用GPT4o测试了Qwen3-4B AlignBench发现分数大概是7.53,和报告中的8.10还有差距,请问您这边用的是什么模型在做评测呢?
Beta Was this translation helpful? Give feedback.
All reactions