Category
Academic
Training Is Everything: Artificial Intelligence, Copyright, and Fair Training
To learn how to behave, the current revolutionary generation of AIs must be trained on vast quantities of published images, written works, and sounds, many …
LLM-Assisted Causal Structure Disambiguation and Factor Extraction for Legal Judgment Prediction
arXiv:2603.11446v1 Announce Type: new Abstract: Mainstream methods for Legal Judgment Prediction (LJP) based on Pre-trained Language Models (PLMs) heavily rely on the statistical correlation between …
BLooP: Zero-Shot Abstractive Summarization using Large Language Models with Bigram Lookahead Promotion
arXiv:2603.11415v1 Announce Type: new Abstract: Abstractive summarization requires models to generate summaries that convey information in the source document. While large language models can generate …
MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large …
arXiv:2603.11414v1 Announce Type: new Abstract: We present MaterialFigBench, a benchmark dataset designed to evaluate the ability of multimodal large language models (LLMs) to solve university-level …
Algorithmic Consequences of Particle Filters for Sentence Processing: Amplified Garden-Paths and Digging-In Effects
arXiv:2603.11412v1 Announce Type: new Abstract: Under surprisal theory, linguistic representations affect processing difficulty only through the bottleneck of surprisal. Our best estimates of surprisal come …
Stop Listening to Me! How Multi-turn Conversations Can Degrade Diagnostic Reasoning
arXiv:2603.11394v1 Announce Type: new Abstract: Patients and clinicians are increasingly using chatbots powered by large language models (LLMs) for healthcare inquiries. While state-of-the-art LLMs exhibit …
Evaluating Explainable AI Attribution Methods in Neural Machine Translation via Attention-Guided Knowledge Distillation
arXiv:2603.11342v1 Announce Type: new Abstract: The study of the attribution of input features to the output of neural network models is an active area of …
Temporal Text Classification with Large Language Models
arXiv:2603.11295v1 Announce Type: new Abstract: Languages change over time. Computational models can be trained to recognize such changes enabling them to estimate the publication date …
ThReadMed-QA: A Multi-Turn Medical Dialogue Benchmark from Real Patient Questions
arXiv:2603.11281v1 Announce Type: new Abstract: Medical question-answering benchmarks predominantly evaluate single-turn exchanges, failing to capture the iterative, clarification-seeking nature of real patient consultations. We introduce …
Artificial Intelligence for Sentiment Analysis of Persian Poetry
arXiv:2603.11254v1 Announce Type: new Abstract: Recent advancements of the Artificial Intelligence (AI) have led to the development of large language models (LLMs) that are capable …