Welcome to above the clouds
GCP – Announcing Ironwood TPUs General Availability and new Axion VMs to power the age of inference
Today’s frontier models, including Google’s Gemini, Veo, Imagen, and Anthropic’s Claude train and serve on Tensor Processing Units (TPUs). For many organizations, the focus is shifting from training these models to powering useful, responsive interactions with them. Constantly shifting model architectures, the rise of agentic workflows, plus near-exponential growth in demand for compute, define this […]
GCP – Announcing Axion C4A metal: Arm-based Axion VMs for specialized use cases
Today, we are thrilled to announce C4A metal, our first bare metal instance running on Google Axion processors, available in preview soon. C4A metal is designed for specialized workloads that require direct hardware access and Arm®-native compatibility. Now, organizations running environments such as Android development, automotive simulation, CI/CD pipelines, security workloads, and custom hypervisors can […]
GCP – Unlock 2x better price-performance with Axion-based N4A VMs, now in preview
Decision makers and builders today face a constant challenge: managing rising cloud costs while delivering the performance their customers demand. As applications evolve to use scale-out microservices and handle ever-growing data volumes, organizations need maximum efficiency from their underlying infrastructure to support their growing general-purpose workloads. To meet this need, we’re excited to announce our […]
GCP – From silicon to softmax: Inside the Ironwood AI stack
As machine learning models continue to scale, a specialized, co-designed hardware and software stack is no longer optional, it’s critical. Ironwood, our latest generation Tensor Processing Unit (TPU), is the cutting-edge hardware behind advanced models like Gemini and Nano Banana, from massive-scale training to high-throughput, low-latency inference. This blog details the core components of Google’s […]
GCP – Your First AI Application is Easier Than You Think
If you’re a developer, you’ve seen generative AI everywhere. It can feel like a complex world of models and advanced concepts. It can be difficult to know where to actually start. The good news is that building your first AI-powered application is more accessible than you might imagine. You don’t need to be an AI expert […]
AWS – AWS End User Messaging SMS launches Carrier Lookup
Starting today, AWS End User Messaging customers can now lookup carrier information related to a phone number including the country, number type, dialing code, and mobile network and carrier codes. With Carrier Lookup, you can increase deliverability by checking important information about a phone number before you start sending messages, avoiding sending messages to the […]
AWS – Amazon CloudFront announces cross-account support for VPC origins
Amazon CloudFront announces cross-account support for Virtual Private Cloud (VPC) origins, enabling customers to access VPC origins that reside in different AWS accounts from their CloudFront distributions. With VPC origins, customers can have their Application Load Balancers (ALB), Network Load Balancers (NLB), and EC2 Instances in a private subnet that is accessible only through their […]
AWS – Amazon CloudWatch Database Insights expands anomaly detection in on-demand analysis
Amazon CloudWatch Database Insights now detects anomalies on additional metrics through its on-demand analysis experience. Database Insights is a monitoring and diagnostics solution that helps database administrators and application developers optimize database performance by providing comprehensive visibility into database metrics, query performance, and resource utilization patterns. The on-demand analysis feature utilizes machine learning to help […]
AWS – Amazon FSx now integrates with AWS Secrets Manager for enhanced management of Active Directory credentials
Amazon FSx now integrates with AWS Secrets Manager, enabling enhanced protection and management of the Active Directory domain service account credentials for your FSx for Windows File Server file systems and FSx for NetApp ONTAP Storage Virtual Machines (SVMs). Previously, if you wanted to join your FSx for Windows file system or FSx for ONTAP […]
GCP – More ways to build, scale, and govern AI agents with Vertex AI Agent Builder
Many developers are prototyping AI agents, but moving to a scalable, secure, and well-managed production agent is far more complex. Vertex AI Agent Builder is Google Cloud’s comprehensive and open platform to build, scale, and govern reliable agents. As a suite of products, it provides the choice builders need to create powerful agentic systems at […]
