You are a data scientist at a healthcare startup aws video

 ·  PT1H46M27S  ·  EN

machine-learning video for you are a data scientist at a healthcare startup tasked with developing a machine learning model to predict the likelihood of

Full Certification Question

You are a data scientist at a healthcare startup tasked with developing a machine learning model to predict the likelihood of patients developing a specific chronic disease within the next five years. The dataset includes patient demographics, medical history, lab results, and lifestyle factors but contains only 1,000 records. The dataset has missing values in some critical features, and the class distribution is highly imbalanced, with only 5% of patients labeled as having developed the disease. Given these constraints, which approach is MOST LIKELY to determine the feasibility of an ML solution and guide your next steps?