Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Buffer Matters: Unleashing the Power of Off-Policy Reinforcement Learning in Large Language Model Reasoning

arXiv:2602.20722v1 Announce Type: new Abstract: Traditional on-policy Reinforcement Learning with Verifiable Rewards (RLVR) frameworks suffer from experience waste and reward homogeneity, which directly hinders learning …

Xu Wan, Yansheng Wang, Wenqi Huang, Mingyang Sun

40 views Mar 2

Academic · 1 min

Modality-Guided Mixture of Graph Experts with Entropy-Triggered Routing for Multimodal Recommendation

arXiv:2602.20723v1 Announce Type: new Abstract: Multimodal recommendation enhances ranking by integrating user-item interactions with item content, which is particularly effective under sparse feedback and long-tail …

Ji Dai, Quan Fang, Dengsheng Cai

34 views Mar 2

Academic · 1 min

Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback

arXiv:2602.20728v1 Announce Type: new Abstract: Reward design has been one of the central challenges for real world reinforcement learning (RL) deployment, especially in settings with …

Chenyang Zhao, Vinny Cahill, Ivana Dusparic

32 views Mar 2

Academic · 1 min

CHESS: Context-aware Hierarchical Efficient Semantic Selection for Long-Context LLM Inference

arXiv:2602.20732v1 Announce Type: new Abstract: Long-context LLMs demand accurate inference at low latency, yet decoding becomes primarily constrained by KV cache as context grows. Prior …

Chao Fei, Guozhong Li, Chenxi Liu, Panos Kalnis

38 views Mar 2

Academic · 1 min

PyVision-RL: Forging Open Agentic Vision Models via RL

arXiv:2602.20739v1 Announce Type: new Abstract: Reinforcement learning for agentic multimodal models often suffers from interaction collapse, where models learn to reduce tool usage and multi-turn …

Shitian Zhao, Shaoheng Lin, Ming Li, Haoquan Zhang, Wenshuo Peng, Kaipeng Zhang, Chen Wei

24 views Mar 2

Academic · 1 min

Pipeline for Verifying LLM-Generated Mathematical Solutions

arXiv:2602.20770v1 Announce Type: new Abstract: With the growing popularity of Large Reasoning Models and their results in solving mathematical problems, it becomes crucial to measure …

Varvara Sazonova, Dmitri Shmelkin, Stanislav Kikot, Vasily Motolygin

34 views Mar 2

Academic · 1 min

POMDPPlanners: Open-Source Package for POMDP Planning

arXiv:2602.20810v1 Announce Type: new Abstract: We present POMDPPlanners, an open-source Python package for empirical evaluation of Partially Observable Markov Decision Process (POMDP) planning algorithms. The …

Yaacov Pariente, Vadim Indelman

35 views Mar 2

Academic · 1 min

Qwen-BIM: developing large language model for BIM-based design with domain-specific benchmark and dataset

arXiv:2602.20812v1 Announce Type: new Abstract: As the construction industry advances toward digital transformation, BIM (Building Information Modeling)-based design has become a key driver supporting intelligent …

Jia-Rui Lin, Yun-Hong Cai, Xiang-Rui Ni, Shaojie Zhou, Peng Pan

40 views Mar 2

Academic · 1 min

Pressure Reveals Character: Behavioural Alignment Evaluation at Depth

arXiv:2602.20813v1 Announce Type: new Abstract: Evaluating alignment in language models requires testing how they behave under realistic pressure, not just what they claim they would …

Nora Petrova, John Burden

37 views Mar 2

Academic · 1 min

Diagnosing Causal Reasoning in Vision-Language Models via Structured Relevance Graphs

arXiv:2602.20878v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) achieve strong performance on visual question answering benchmarks, yet often rely on spurious correlations rather than …

Dhita Putri Pratama, Soyeon Caren Han, Yihao Ding

43 views Mar 2

Academic · 1 min

Predicting Sentence Acceptability Judgments in Multimodal Contexts

arXiv:2602.20918v1 Announce Type: new Abstract: Previous work has examined the capacity of deep neural networks (DNNs), particularly transformers, to predict human sentence acceptability judgments, both …

Hyewon Jang, Nikolai Ilinykh, Sharid Lo\'aiciga, Jey Han Lau, Shalom Lappin

38 views Mar 2

Academic · 1 min

HELP: HyperNode Expansion and Logical Path-Guided Evidence Localization for Accurate and Efficient GraphRAG

arXiv:2602.20926v1 Announce Type: new Abstract: Large Language Models (LLMs) often struggle with inherent knowledge boundaries and hallucinations, limiting their reliability in knowledge-intensive tasks. While Retrieval-Augmented …

Yuqi Huang, Ning Liao, Kai Yang, Anning Hu, Shengchao Hu, Xiaoxing Wang, Junchi Yan

35 views Mar 2

← Previous

328 329 330 331 332

Academic

Buffer Matters: Unleashing the Power of Off-Policy Reinforcement Learning in Large Language Model Reasoning

Modality-Guided Mixture of Graph Experts with Entropy-Triggered Routing for Multimodal Recommendation

Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback

CHESS: Context-aware Hierarchical Efficient Semantic Selection for Long-Context LLM Inference

PyVision-RL: Forging Open Agentic Vision Models via RL

Pipeline for Verifying LLM-Generated Mathematical Solutions

POMDPPlanners: Open-Source Package for POMDP Planning

Qwen-BIM: developing large language model for BIM-based design with domain-specific benchmark and dataset

Pressure Reveals Character: Behavioural Alignment Evaluation at Depth

Diagnosing Causal Reasoning in Vision-Language Models via Structured Relevance Graphs

Predicting Sentence Acceptability Judgments in Multimodal Contexts

HELP: HyperNode Expansion and Logical Path-Guided Evidence Localization for Accurate and Efficient GraphRAG

JCG, PC

HSOLLC Co., Ltd.