More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG
Abstract
Evaluating retrieval-augmented generation with varying document counts while keeping context length constant reveals significant challenges for language models.
Retrieval-augmented generation (RAG) provides LLMs with relevant documents. Although previous studies noted that retrieving many documents can degrade performance, they did not isolate how the quantity of documents affects performance while controlling for context length. We evaluate various language models on custom datasets derived from a multi-hop QA task. We keep the context length and position of relevant information constant while varying the number of documents, and find that increasing the document count in RAG settings poses significant challenges for LLMs. Additionally, our results indicate that processing multiple documents is a separate challenge from handling long contexts. We also make the datasets and code available: https://github.com/shaharl6000/MoreDocsSameLen .
Community
Retrieval-augmented generation (RAG) provides LLMs with relevant documents. Although previous studies noted that retrieving many documents can degrade performance, they did not isolate how the quantity of documents affects performance while controlling for context length. We evaluate various language models on custom datasets derived from a multi-hop QA task. We keep the context length and position of relevant information constant while varying the number of documents, and find that increasing the document count in RAG settings poses significant challenges for LLMs. Additionally, our results indicate that processing multiple documents is a separate challenge from handling long contexts. We also make the datasets and code available: https://github.com/shaharl6000/MoreDocsSameLen .
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Parametric Retrieval Augmented Generation (2025)
- PISCO: Pretty Simple Compression for Retrieval-Augmented Generation (2025)
- On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems (2025)
- RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning (2025)
- A RAG-Based Institutional Assistant (2025)
- Does RAG Really Perform Bad For Long-Context Processing? (2025)
- Diversity Enhances an LLM's Performance in RAG and Long-context Task (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
 You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: 
@librarian-bot
	 recommend
Cool!
Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper
 
					 
						 
						 
					