AWS Exams GCP Exams Azure Exams GitHub Exams Jira Exams ISC2 Exams

Video: A company has a massive customers’ dataset stored aws video

Question 1 Be Honest
« Back   Next ml-specialty Certification Question »
Answer

Full Certification Question

A company has a massive customers’ dataset stored in a single csv file. The company is building an XGBoost classification model for churn prediction, in other words, predicting whether the customer will stop dealing with the company. The data currently resides in an S3 bucket in a single folder called “Dataset”. A data engineer should split the data into training and validation for future training on Amazon Sagemaker. What is the correct sequence of events to perform this action?