2021 02 11

AWS – AWS Glue DataBrew now allows you to configure the size of the dataset when auto-generating data quality statistics

When running profile jobs in AWS Glue DataBrew to auto-generate 40+ data quality statistics like column-level cardinality, numerical correlations, unique values, standard deviation, and other statistics, you can now configure the size of the dataset you want analyzed. This allows you to customize your profile to run on x% of the dataset for really large datasets or focus on a sub-sample of the dataset for faster results.

AWS – AWS Glue DataBrew now allows you to configure the size of the dataset when auto-generating data quality statistics

Related Posts

AWS – Amazon VPC Route Server now available in new regions

GCP – Palo Alto Networks automates customer intelligence document creation with agentic design

GCP – Vibe querying: Write SQL queries faster with Comments to SQL in BigQuery