A company wants to build an AI application that can summarize very long research papers (e.g., 50-100 pages) into a few paragraphs. When selecting a foundation model, which model characteristic is most critical to ensure it can process the entire document effectively to produce a coherent summary?