This is a dedicated watch page for a single video.
You are evaluating a machine translation model by comparing its output to a reference translation. The goal is to measure how closely the model’s output matches the reference in terms of overlapping words and phrases. Which metric should you use?