Welcome to above the clouds
AWS – Application Signals now supports burn rate for application performance goals
Amazon CloudWatch Application Signals, an application performance monitoring (APM) feature in CloudWatch, makes it easy to automatically instrument and track application performance against their most important business or service level objectives (SLOs). Customers can now receive alerts when these SLOs reach a critical burn rate. This new feature allows you to calculate how quickly your […]
AWS – Amazon GameLift adds containers for faster dev iteration and simplified management
We are excited to announce Amazon GameLift now supports containers for building, deploying, and running game server packages. Amazon GameLift is a fully managed service that allows developers to quickly manage and scale dedicated game servers for multiplayer games. With this new capability, Amazon GameLift supports end-to-end development of containerized workloads, including deployment and scaling […]
AWS – AWS Client VPN now supports the latest Ubuntu OS versions – 22.04 LTS and 24.04 LTS
AWS Client VPN now supports Linux desktop client with Ubuntu versions 22.04 LTS and 24.04 LTS. You can now run the AWS supplied VPN client on the latest Ubuntu OS versions. AWS Client VPN desktop clients are available free of charge, and can be downloaded here. AWS Client VPN is a managed service that securely […]
AWS – Amazon Timestream for InfluxDB is now available in China regions
You can now use Amazon Timestream for InfluxDB in the Amazon Web Services China (Beijing) Region, operated by Sinnet and Amazon Web Services China (Ningxia) Region, operated by NWC. Timestream for InfluxDB makes it easy for application developers and DevOps teams to run fully managed InfluxDB databases on Amazon Web Services for real-time time-series applications […]
AWS – AWS Directory Service is available in the AWS Asia Pacific (Malaysia) Region
AWS Directory Service for Microsoft Active Directory, also known as AWS Managed Microsoft AD, and AD Connector are now available in the AWS Asia Pacific (Malaysia) Region. Built on actual Microsoft Active Directory (AD), AWS Managed Microsoft AD enables you to migrate AD-aware applications while reducing the work of managing AD infrastructure in the AWS […]
AWS – Amazon EC2 Mac instances now available in AWS Canada (Central) Region
Starting today, Amazon Elastic Compute Cloud (Amazon EC2) M2 Mac instances are now generally available (GA) in the AWS Canada (Central) region. This marks the first time we are introducing Mac instances to an AWS Canadian region, providing customers with even greater global accessibility to Apple silicon hardware. Customers can now run their macOS workloads […]
AWS – AWS Control Tower launches the ability to resolve drift for optional controls
AWS Control Tower customers can now use the ResetEnabledControl API to programmatically resolve the control drift or re-deploy the control to its intended configuration. A control drift occurs when the AWS Control Tower managed control is modified outside the AWS Control Tower governance. Resolving drift helps you to adhere to your governance and compliance requirements. […]
GCP – Data loading best practices for AI/ML inference on GKE
As AI models increase in sophistication, there’s increasingly large model data needed to serve them. Loading the models and weights along with necessary frameworks to serve them for inference can add seconds or even minutes of scaling delay, impacting both costs and the end-user’s experience. For example, inference servers such as Triton, Text Generation Inference […]
GCP – Empower your teams with self-service Kubernetes using GKE fleets and Argo CD
Managing applications across multiple Kubernetes clusters is complex, especially when those clusters span different environments or even cloud providers. One powerful and secure solution combines Google Kubernetes Engine (GKE) fleets and, Argo CD, a declarative, GitOps continuous delivery tool for Kubernetes. The solution is further enhanced with Connect Gateway and Workload Identity. This blog post […]
GCP – Unlocking LLM training efficiency with Trillium — a performance analysis
Rapidly evolving generative AI models place unprecedented demands on the performance and efficiency of hardware accelerators. Last month, we launched our sixth-generation Tensor Processing Unit (TPU), Trillium, to address the demands of next-generation models. Trillium is purpose-built for performance at scale, from the chip to the system to our Google data center deployments, to power […]