Hi,
I mentioned this on the twitter post. I recently released a preprint which does very similar work where I also used a part of LLM-Aggrefact for my benchmark.
Do you think it's possible to add a comparison to my models to your leaderboard? I can run my models on the updated LLM-Aggrefact benchmark and upload my predictions/results here.
Thanks!