AWS – Amazon SageMaker Data Wrangler launches new advanced settings for Amazon Athena data sources
Amazon SageMaker Data Wrangler reduces the time that it takes to aggregate and prepare data for machine learning (ML) from weeks to minutes in Amazon SageMaker Studio, the first fully integrated development environment (IDE) for ML. With SageMaker Data Wrangler, you can simplify the process of data preparation and feature engineering, and complete each step of the data preparation workflow, including data selection, cleansing, exploration, and visualization, from a single visual interface. You can import data from multiple data sources such as Amazon Simple Storage Service (Amazon S3), Amazon Redshift, Snowflake, and 26 Federated Query data sources supported by Amazon Athena. Starting today, customers importing data from Athena data sources can configure S3 query output location and data retention period to control where and how long Athena stores the intermediary data.
Read More for the details.