You are managing a machine learning pipeline on AWS to classify customer sentiment based on product reviews. The input data varies significantly across regions, causing inconsistent model accuracy. To address this, you need to standardize and preprocess the data before feeding it into the model for inference. Which two steps should you include in your preprocessing pipeline to handle this issue? (Select two)