Watch this video on YouTube
An organization is using Google Cloud to analyze customer data and needs to assess data quality to ensure accurate results. Which of the following practices is the most effective approach for identifying duplicate records in a large dataset?