Academic

Large Language Models and Book Summarization: Reading or Remembering, Which Is Better?

arXiv:2603.09981v1 Announce Type: new Abstract: Summarization is a core task in Natural Language Processing (NLP). Recent advances in Large Language Models (LLMs) and the introduction of large context windows reaching millions of tokens make it possible to process entire books in a single prompt. At the same time, for well-known books, LLMs can generate summaries based only on internal knowledge acquired during training. This raises several important questions: How do summaries generated from internal memory compare to those derived from the full text? Does prior knowledge influence summaries even when the model is given the book as input? In this work, we conduct an experimental evaluation of book summarization with state-of-the-art LLMs. We compare summaries of well-known books produced using (i) only the internal knowledge of the model and (ii) the full text of the book. The results show that having the full text provides more detailed summaries in general, but some books have bett

Tairan Fu, Javier Conde, Pedro Reviriego, Javier Coronado-Bl\'azquez, Nina Melero, Elena Merino-G\'omez · March 12, 2026 · 1 min read · 23 views

#cs.CL

Executive Summary

This article investigates the efficacy of Large Language Models (LLMs) in book summarization, comparing summaries generated from internal memory to those derived from full text. The study found that, while full text summaries generally outperform internal knowledge summaries, the latter can occasionally surpass the former. This unexpected result raises questions about the models' ability to summarize long texts effectively.

Key Points

▸ LLMs can generate summaries based on internal knowledge acquired during training, potentially outperforming full text summaries.
▸ Experimental evaluation of book summarization with state-of-the-art LLMs reveals the limitations of the models' summarization capabilities.
▸ Internal knowledge summaries can excel in certain cases, challenging the assumption that full text summaries are inherently superior.

Merits

Insightful Analysis

The study offers a nuanced understanding of the strengths and limitations of LLMs in book summarization, providing valuable insights for the development of more effective NLP models.

Methodological Rigor

The experimental evaluation employed a well-designed approach, comparing summaries generated from internal memory with those derived from full text, and providing a comprehensive analysis of the results.

Demerits

Limited Generalizability

The study's findings may not be directly applicable to other domains or tasks, as the performance of LLMs can vary significantly depending on the specific context and requirements.

Lack of Explanatory Power

The article does not provide a detailed explanation for the instances in which internal knowledge summaries outperform full text summaries, leaving room for further investigation.

Expert Commentary

This study contributes significantly to our understanding of the strengths and limitations of LLMs in book summarization, highlighting the potential for internal knowledge to outperform full text summaries in certain cases. However, the findings also underscore the need for further research on the mechanisms underlying model performance and the development of more effective transfer learning strategies. Ultimately, the article suggests that the field of NLP must continue to prioritize the development of more explainable and interpretable models, particularly in high-stakes applications.

Recommendations

✓ Future research should focus on developing more effective transfer learning strategies for LLMs, leveraging internal knowledge to improve model performance on complex summarization tasks.
✓ Developers should prioritize the incorporation of explainability and interpretability techniques into LLMs, providing a more detailed understanding of their performance and limitations.

Sources

arXiv - cs.CL

Large Language Models and Book Summarization: Reading or Remembering, Which Is Better?

AI Commentary

Executive Summary

Key Points

Merits

Insightful Analysis

Methodological Rigor

Demerits

Limited Generalizability

Lack of Explanatory Power

Expert Commentary

Recommendations

Sources

Related Articles

ConstitutionGPT: An AI-Powered Multilingual Legal Assistance System for Indian Citizens

AI Copyright Infringement: Navigating the Legal Risks of AI-Generated Content

The Rhetoric of Machine Learning

Busemann energy-based attention for emotion analysis in Poincar\'e discs

JCG, PC

HSOLLC Co., Ltd.

Related Articles

ConstitutionGPT: An AI-Powered Multilingual Legal Assistance System for Indian Citizens

AI Copyright Infringement: Navigating the Legal Risks of AI-Generated Content

The Rhetoric of Machine Learning

Busemann energy-based attention for emotion analysis in Poincar\'e discs