This is a dedicated watch page for a single video.
An educational institute deploys an AI summarization model to generate concise summaries of lengthy research papers. The institute wants to ensure these summaries maintain coherence and accurately represent the original content. What evaluation method is most suitable?