Welcome to above the clouds

AWS – AWS Glue Studio now offers a no code data preparation authoring experience
Today, AWS Glue Studio Visual ETL announces general availability of data preparation authoring, a new no code data preparation user experience for business users and data analysts with a spreadsheet-style UI that runs data integration jobs at scale on AWS Glue for Spark. The new visual data preparation experience makes it easier for data analysts […]

AWS – Amazon SageMaker introduces a new generative AI inference optimization capability
Today, Amazon SageMaker announced general availability of a new inference capability that delivers up to ~2x higher throughput while reducing costs by up to ~50% for generative AI models such as Llama 3, Mistral, and Mixtral models. For example, with a Llama 3-70B model, you can achieve up to ~2400 tokens/sec on a ml.p5.48xlarge instance […]

AWS – Amazon FSx for NetApp ONTAP now allows you to read data during backup restores
Amazon FSx for NetApp ONTAP, a fully managed shared storage service built on NetApp’s popular ONTAP file system, now allows you to read data from a volume while it is being restored from a backup. The feature “read-access during backup restores” allows you to improve Recovery Time Objectives by up to 17x for read-only workloads […]

AWS – AWS Glue Data catalog now supports generating statistics for Apache Iceberg tables
AWS Glue Data Catalog now supports generating column-level aggregated statistics for Apache Iceberg tables. These statistics are now integrated with cost-based optimizer (CBO) from Amazon Redshift Spectrum, resulting in improved query performance and potential cost savings. Apache Iceberg support statistics such as nulls, min, max, but lacks support for generating aggregation statistics such as number […]

AWS – Amazon FSx for NetApp ONTAP now supports NVMe-over-TCP for simpler, lower-latency shared block storage
Amazon FSx for NetApp ONTAP, a service that provides fully managed shared storage built on NetApp’s popular ONTAP file system, today announced support for the NVMe-over-TCP (NVMe/TCP) block storage protocol. Using NVMe/TCP, you can accelerate your block storage workloads such as databases and Virtual Desktop Infrastructure (VDI) with lower latency compared to traditional iSCSI block […]

AWS – Announcing the next generation of Amazon FSx for NetApp ONTAP file systems
Today, we’re announcing next-generation Amazon FSx for NetApp ONTAP file systems that provide higher scalability and enhanced flexibility compared to previous-generation file systems. Previous-generation file systems consisted of a single highly-available (HA) pair of file servers with up to 4 GBps of throughput. Next-gen file systems can be created or expanded with up to 12 […]

AWS – Amazon RDS Data API for Aurora PostgreSQL is now available in 10 additional AWS regions
RDS Data API for Aurora Serverless v2 and Aurora provisioned PostgreSQL-Compatible database instances is now available in Asia Pacific (Sydney), Asia Pacific (Mumbai), Asia Pacific (Seoul), Asia Pacific (Singapore), Europe (Ireland), Europe (London), Europe (Paris), US West (N. California), US East (Ohio), Canada (Central). RDS Data API allows you to access these Aurora clusters via […]

AWS – Amazon MWAA now supports Apache Airflow version 2.9
You can now create Apache Airflow version 2.9 environments on Amazon Managed Workflows for Apache Airflow (MWAA). Apache Airflow 2.9 is the latest minor release of the popular open-source tool that helps customers author, schedule, and monitor workflows. Amazon MWAA is a managed orchestration service for Apache Airflow that makes it easier to set up […]

AWS – Amazon EC2 R8g instances powered by AWS Graviton4 now generally available
AWS announces the general availability of Amazon Elastic Compute Cloud (Amazon EC2) R8g instances. These instances are powered by AWS Graviton4 processors and deliver up to 30% better performance compared to AWS Graviton3-based instances. Amazon EC2 R8g instances are ideal for memory-intensive workloads such as databases, in-memory caches, and real-time big data analytics. These instances are […]
AWS – Amazon OpenSearch Service announces Natural Language Query Generation for log analysis
Amazon OpenSearch Service has added support for AI powered Natural Language Query Generation in OpenSearch Dashboards Log Explorer. With Natural Language Query Generation, you can accelerate analysis by asking log exploration questions in plain English, which are then automatically translated to the relevant Piped Processing Language (PPL) queries and executed to fetch the requested data. […]