Welcome to above the clouds

GCP – Build and Deploy a Remote MCP Server to Google Cloud Run in Under 10 Minutes
Integrating context from tools and data sources into LLMs can be challenging, which impacts ease-of-use in the development of AI agents. To address this challenge, Anthropic introduced the Model Context Protocol (MCP), which standardizes how applications provide context to LLMs. Imagine you want to build an MCP server for your API to make it available […]

AWS – AWS Network Firewall now supports AWS Transit Gateway native integration
AWS Network Firewall now supports native integration with AWS Transit Gateway for simplified deployment and management of network security across your global AWS infrastructure. This capability is available in 5 AWS Regions, allowing customers to implement security controls more efficiently. AWS Transit Gateway interconnects your Amazon Virtual Private Clouds (VPCs) and on-premises networks, while AWS […]

AWS – AWS Compute Optimizer now identifies idle EC2 Auto Scaling groups with GPU instances
AWS Compute Optimizer now detects idle EC2 Auto Scaling groups using G and P instance types, enabling you to identify additional savings opportunities in your AWS spend. As AI development accelerates, organizations are creating more Auto Scaling groups with G and P instance types for training and inference workloads. Once you enable the NVIDIA CloudWatch […]

AWS – Amazon RDS for MySQL announces Innovation Release 9.3 in Amazon RDS Database Preview Environment
Amazon RDS for MySQL now supports community MySQL Innovation Release 9.3 in the Amazon RDS Database Preview Environment, allowing you to evaluate the latest Innovation Release on Amazon RDS for MySQL. You can deploy MySQL 9.3 in the Amazon RDS Database Preview Environment which provides the benefits of a fully managed database, making it simpler […]

AWS – Amazon S3 extends additional context for HTTP 403 Access Denied error messages to AWS Organizations
Amazon S3 now includes additional context in HTTP 403 Access Denied errors for requests made to resources in accounts within the same AWS Organization. This context includes the type of policy that denied access, the reason for denial, and information on the AWS Identity and Access Management (IAM) user or role that requested access to […]

GCP – Save early and often with multi-tier checkpointing to optimize large AI training jobs
As foundation model training infrastructure scales to tens of thousands of accelerators, efficient utilization of those high-value resources becomes paramount. In particular, as the cluster gets larger, hardware failures become more frequent (~ few hours) and recovery from previously saved checkpoints becomes slower (up to 30 minutes), significantly slowing down training progress. A checkpoint represents […]

GCP – How Google Cloud is securing open-source credentials at scale
Credentials are an essential part of modern software development and deployment, granting bearers privileged access to systems, applications, and data. However, credential-related vulnerabilities remain the predominant entry point exploited by threat actors in the cloud. Stolen credentials “are now the second-highest initial infection vector, making up 16% of our investigations,” said Jurgen Kutscher, vice-president, Mandiant […]

GCP – Build a multi-agent KYC workflow in three steps using Google’s Agent Development Kit and Gemini
Know Your Customer (KYC) processes are foundational to any Financial Services Institution’s (FSI) regulatory compliance practices and risk mitigation strategies. KYC is how financial institutions verify the identity of their customers and assess associated risks. But as customers expect instant approvals, FSIs face pressure to streamline their manual, time-consuming and error-prone KYC processes. The good […]

GCP – Simplify your multi-cloud strategy with Cloud Location Finder, now in preview
As cloud environments expand beyond traditional architectures to include multiple clouds, managing your infrastructure effectively becomes more complex. Imagine easily accessing consistent and up-to-date location information across different cloud providers, so your multi-cloud applications are designed and optimized with performance, security, and regulatory compliance in mind. Today, we are making this a reality with Cloud […]
GCP – C4D now GA: up to 80% higher performance for your business critical workloads
We’re excited to announce the general availability of our next-generation C4D virtual machine family. Powered by 5th Gen AMD EPYC processors (Turin) paired with Google Titanium’s latest advancements, C4D provides customers with meaningful performance improvements — up to 80% higher throughput for web serving and 30% better performance for general computing workloads compared to the […]