data-engineer video for a data engineering team at an e-commerce company needs to build a data pipeline to ingest and transform data from various sources for
A data engineering team at an e-commerce company needs to build a data pipeline to ingest and transform data from various sources for reporting and analysis. The raw data is stored in Amazon S3 as JSON files. The company wants to transform the data into a columnar format to optimize query performance. After the transformation, the data should be stored back in S3 and queried using Amazon Athena for reporting purposes. The company also wants to automate schema discovery and ensure that the pipeline is cost-effective, scalable, and easily maintainable. Which TWO of the following strategies should the team implement to meet these requirements? (Choose Two)