Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks
arXiv:2603.11689v1 Announce Type: new Abstract: Frontier Multimodal Large Language Models (MLLMs) exhibit remarkable capabilities in Visual-Language Comprehension (VLC) tasks. However, they are often deployed as …
Mei Chee Leong, Ying Gu, Hui Li Tan, Liyuan Li, Nancy Chen
9 views