Academic

Academic

Academic · 1 min

An Agentic Evaluation Framework for AI-Generated Scientific Code in PETSc

arXiv:2603.15976v1 Announce Type: new Abstract: While large language models have significantly accelerated scientific code generation, comprehensively evaluating the generated code remains a major challenge. Traditional …

Hong Zhang, Barry Smith, Satish Balay, Le Chen, Murat Keceli, Lois Curfman McInnes, Junchao Zhang
17 views
Academic · 1 min

Adaptive Theory of Mind for LLM-based Multi-Agent Coordination

arXiv:2603.16264v1 Announce Type: new Abstract: Theory of Mind (ToM) refers to the ability to reason about others' mental states, and higher-order ToM involves considering that …

Chunjiang Mu, Ya Zeng, Qiaosheng Zhang, Kun Shao, Chen Chu, Hao Guo, Danyang Jia, Zhen Wang, Shuyue Hu
232 views
Academic · 1 min

Form Follows Function: Recursive Stem Model

arXiv:2603.15641v1 Announce Type: new Abstract: Recursive reasoning models such as Hierarchical Reasoning Model (HRM) and Tiny Recursive Model (TRM) show that small, weight-shared networks can …

Navid Hakimi
34 views
Academic · 1 min

Protein Design with Agent Rosetta: A Case Study for Specialized Scientific Agents

arXiv:2603.15952v1 Announce Type: new Abstract: Large language models (LLMs) are capable of emulating reasoning and using tools, creating opportunities for autonomous agents that execute complex …

Jacopo Teneggi, S. M. Bargeen A. Turzo, Tanya Marwah, Alberto Bietti, P. Douglas Renfrew, Vikram Khipple Mulligan, Siavash Golkar
12 views
Academic · 1 min

Are Large Language Models Truly Smarter Than Humans?

arXiv:2603.16197v1 Announce Type: new Abstract: Public leaderboards increasingly suggest that large language models (LLMs) surpass human experts on benchmarks spanning academic knowledge, law, and programming. …

Eshwar Reddy M, Sourav Karmakar
17 views
Academic · 1 min

Robust Language Identification for Romansh Varieties

arXiv:2603.15969v1 Announce Type: new Abstract: The Romansh language has several regional varieties, called idioms, which sometimes have limited mutual intelligibility. Despite this linguistic diversity, there …

Charlotte Model, Sina Ahmadi, Jannis Vamvas
29 views