All Articles

Articles

Academic · 1 min

ProbeLLM: Automating Principled Diagnosis of LLM Failures

arXiv:2602.12966v1 Announce Type: new Abstract: Understanding how and why large language models (LLMs) fail is becoming a central challenge as models rapidly evolve and static …

Yue Huang, Zhengzhe Jiang, Yuchen Ma, Yu Jiang, Xiangqi Wang, Yujun Zhou, Yuexing Hao, Kehan Guo, Pin-Yu Chen, Stefan Feuerriegel, Xiangliang Zhang
32 views
Academic · 1 min

TraceBack: Multi-Agent Decomposition for Fine-Grained Table Attribution

arXiv:2602.13059v1 Announce Type: new Abstract: Question answering (QA) over structured tables requires not only accurate answers but also transparency about which cells support them. Existing …

Tejas Anvekar, Junha Park, Rajat Jha, Devanshu Gupta, Poojah Ganesan, Puneeth Mathur, Vivek Gupta
5 views