An organization is building a machine learning aws video
machine-learning video for an organization is building a machine learning pipeline that requires preprocessing and transforming large-scale, distributed
Answer
          Full Certification Question
An organization is building a machine learning pipeline that requires preprocessing and transforming large-scale, distributed datasets stored in Amazon S3. The data originates from various sources, including IoT devices and application logs. The organization needs a scalable solution that integrates seamlessly with the pipeline and supports distributed data processing frameworks. Which approach should the organization use to preprocess and transform the data at scale?