A Machine Learning Specialist is developing a Natural Language Processing (NLP) model that powers a paraphrasing tool. This tool aims to generate closely associated words that a user can pick to help him/her construct sentences. The Specialist wants to boost the model performance by feeding word features to a downstream task. How can the Specialist achieve his goal?