A data scientist is in the process of building a classification model using logistic regression. The aim of this model is to classify an email as being spam or not based on some features. The scientist began training the model using the “log loss” cost function and noticed that the model tends to get stuck in a local minimum. What should the scientist do to overcome this problem while training?