Can Structural Cues Save LLMs? Evaluating Language Models in Massive Document Streams
arXiv:2603.19250v1 Announce Type: new Abstract: Evaluating language models in streaming environments is critical, yet underexplored. Existing benchmarks either focus on single complex events or provide …
Yukyung Lee, Yebin Lim, Woojun Jung, Wonjun Choi, Susik Yoon
7 views