[ACL 2026 Oral] Official implementation of LaMI: Augmenting Large Language Models via Late Multi-Image Fusion
ai deep-learning image-generation language-model large vision-and-language multimodal-deep-learning visual-commonsense-reasoning visual-commonsense
-
Updated
May 18, 2026 - Python