Skip to content

Add default score-based verdict mode for fastchat

5411ff8
Select commit
Loading
Failed to load commit list.
Open

Changes related to running benchmark experiments for the paper: Support Qwen3.5 and thinking models, Skywork, truncation tracking, benchmark changes etc #32

Add default score-based verdict mode for fastchat
5411ff8
Select commit
Loading
Failed to load commit list.

Select a check to view from the sidebar