This is a dedicated watch page for a single video.
The Machine Learning team at an e-commerce company is analyzing the sales data. The data is stored in a highly optimized data compression format and the daily volume of data is around 1TB. The team would like to reduce this volume to one-tenth of its original size without significantly compromising on the quality of data, so that they can complete the classification model training in a much shorter time-span. As an ML Specialist, which of the following solutions would you recommend to the team?