Inquiry Regarding Pairwise Model Comparison in Multi-Modality Arena

Thank you for your remarkable contributions!

I've explored the multi-modality arena and noticed that it actually differs from the Chatbot Arena, where two anonymous models are compared side-by-side. 
![image](https://github.com/user-attachments/assets/78006df8-e67c-46a6-a706-d645b89a0265)
After playing with the demo in the README (as shown above), I observed that only one model is provided for evaluation by the third-party crowd: 
![image](https://github.com/user-attachments/assets/429ebd7a-d491-45a9-9902-562bfdaf72cf)
I cannot find any arena-related keywords in the demo as well:
![image](https://github.com/user-attachments/assets/3618ba98-7084-4bfe-9b2c-abc0e2367085)

This leads me to inquire: where can we find the second model for conducting a pairwise comparison?
@shepnerd @wqshao126 @zzyfd @orashi 



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inquiry Regarding Pairwise Model Comparison in Multi-Modality Arena #25

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Inquiry Regarding Pairwise Model Comparison in Multi-Modality Arena #25

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions