Academic

Academic

Academic · 1 min

CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context

arXiv:2603.22576v1 Announce Type: new Abstract: We introduce CAPITU, a benchmark for evaluating instruction-following capabilities of Large Language Models (LLMs) in Brazilian Portuguese. Unlike existing benchmarks …

Giovana Kerche Bon\'as, Roseval Malaquias Junior, Marcos Piau, Thiago Laitz, Thales Sales Almeida, Hugo Abonizio, Celio Larcher, Ramon Pires, Rodrigo Nogueira
10 views
Academic · 1 min

Can Large Language Models Reason and Optimize Under Constraints?

arXiv:2603.23004v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated great capabilities across diverse natural language tasks; yet their ability to solve abstraction and …

Fabien Bernier, Salah Ghamizi, Pantelis Dogoulis, Maxime Cordy
8 views