Cross-Lingual LLM-Judge Transfer via Evaluation Decomposition
arXiv:2603.18557v1 Announce Type: new Abstract: As large language models are increasingly deployed across diverse real-world applications, extending automated evaluation beyond English has become a critical …
Ivaxi Sheth, Zeno Jonke, Amin Mantrach, Saab Mansour
9 views