Welcome to above the clouds
AWS – Amazon IVS now supports private ingest via interface VPC endpoints
Amazon Interactive Video Service (Amazon IVS) now supports media ingest via interface VPC endpoints powered by AWS PrivateLink. With this launch, you can securely broadcast RTMP(S) streams to IVS Low-Latency channels or IVS Real-Time stages without sending traffic over the public internet. You can create interface VPC endpoints to privately connect your applications to Amazon […]
GCP – Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer
As generative AI becomes more widespread, it’s important for developers and ML engineers to be able to easily configure infrastructure that supports efficient AI inference, i.e., using a trained AI model to make predictions or decisions based on new, unseen data. While great at training models, traditional GPU-based serving architectures struggle with the “multi-turn” nature […]
GCP – Scaling high-performance inference cost-effectively
At Google Cloud Next 2025, we announced new inference capabilities with GKE Inference Gateway, including support for vLLM on TPUs, Ironwood TPUs, and Anywhere Cache. Our inference solution is based on AI Hypercomputer, a system built on our experience running models like Gemini and Veo 3, which serve over 980 trillion tokens a month to […]
GCP – Deliver intuitive shopping experiences with Conversational Commerce agent
Consumer search behavior is shifting, with users now entering longer, more complex questions into search bars in pursuit of more relevant results. For instance, instead of a simple “best kids snacks,” queries have evolved to “What are some nutritious snack options for a 7-year-old’s birthday party?” However, many digital platforms have yet to adapt to […]
GCP – Our approach to carbon-aware data centers: Central data center fleet management
Data centers are the engines of the cloud, processing and storing the information that powers our daily lives. As digital services grow, so do our data centers and we are working to responsibly manage them. Google thinks of infrastructure at the full stack level, not just as hardware but as hardware abstracted through software, allowing […]
AWS – Amazon Bedrock AgentCore Gateway supports AWS PrivateLink invocation and invocation logging
Amazon Bedrock AgentCore Gateway now supports AWS PrivateLink invocation and invocation logging through Amazon CloudWatch, Amazon S3 and Amazon Data Firehose. Amazon Bedrock AgentCore Gateway provides an easy and secure way for developers to build, deploy, discover, and connect to agent tools at scale. With the PrivateLink support and invocation logging, you can apply network […]
GCP – Automate app deployment and security analysis with new Gemini CLI extensions
Find and fix security vulnerabilities. Deploy your app to the cloud. All without leaving your command-line. Today, we’re closing the gap between your terminal and the cloud with a first look at the future of Gemini CLI, delivered through two new extensions: security extension and Cloud Run extension. These extensions are designed to handle critical […]
AWS – Amazon EC2 C6in instances are now available in Asia Pacific (Thailand)
Starting today, Amazon Elastic Compute Cloud (Amazon EC2) C6in instances are available in AWS Region Asia Pacific (Thailand). These sixth-generation network optimized instances, powered by 3rd Generation Intel Xeon Scalable processors and built on the AWS Nitro System, deliver up to 200Gbps network bandwidth, for 2x more network bandwidth over comparable fifth-generation instances. Customers can […]
AWS – Fault Injection Service is now available in the Europe (Zurich) Region
AWS Fault Injection Service (FIS) is a fully managed service for running controlled fault injection experiments to improve application performance, observability, and resilience. Customers can test how their applications and people respond to real-world scenarios, including AZ Availability: Power Interruption and Cross-Region: Connectivity. Customers can create experiment templates in FIS to integrate experiments with continuous […]
AWS – Amazon Managed Service for Prometheus now available in the AWS GovCloud (US) Regions
Amazon Managed Service for Prometheus is now available in the AWS GovCloud (US) Regions. Amazon Managed Service for Prometheus is a fully managed Prometheus-compatible monitoring service that makes it easy to monitor and alarm on operational metrics at scale. The list of all supported regions where Amazon Managed Service for Prometheus is generally available can […]
