Watch this video on YouTube
A Generative AI Engineer must clean a dataset of scanned financial documents containing extraneous information like watermarks and logos. What preprocessing step is most effective?