All Articles

Articles

Academic · 1 min

[Re] FairDICE: A Gap Between Theory And Practice

arXiv:2603.03454v1 Announce Type: new Abstract: Offline Reinforcement Learning (RL) is an emerging field of RL in which policies are learned solely from demonstrations. Within offline …

Peter Adema, Karim Galliamov, Aleksey Evstratovskiy, Ross Geurts
5 views
Academic · 1 min

Biased Generalization in Diffusion Models

arXiv:2603.03469v1 Announce Type: new Abstract: Generalization in generative modeling is defined as the ability to learn an underlying distribution from a finite dataset and produce …

Jerome Garnier-Brun, Luca Biggio, Davide Beltrame, Marc M\'ezard, Luca Saglietti
32 views
Academic · 1 min

Test-Time Meta-Adaptation with Self-Synthesis

arXiv:2603.03524v1 Announce Type: new Abstract: As strong general reasoners, large language models (LLMs) encounter diverse domains and tasks, where the ability to adapt and self-improve …

Zeyneb N. Kaya, Nick Rui
13 views