Welcome to above the clouds

AWS – Amazon Athena launches single sign-on support for drivers
Amazon Athena announces single sign-on support for its JDBC and ODBC drivers through AWS IAM Identity Center’s trusted identity propagation. This makes it simpler for organizations to manage end-user’s access to data when using 3rd party tools and implement identity-based data governance policies with a seamless sign-on experience. With this new capability, data teams can seamlessly […]

AWS – Introducing AWS CDK Refactor (Preview)
AWS Cloud Development Kit (CDK) CLI now enables safe infrastructure refactoring through the new ‘cdk refactor’ command in preview. This feature allows developers to rename constructs, move resources between stacks, and reorganize CDK applications while preserving the state of deployed resources. By leveraging AWS CloudFormation’s refactor capabilities with automated mapping computation, CDK Refactor eliminates the […]

AWS – AWS IoT SiteWise now supports retraining of anomaly detection models
Today, AWS announced new capabilities for native anomaly detection in AWS IoT SiteWise. This release includes automated model retraining, flexible promotion modes, and exposed model metrics, all designed to enhance the anomaly detection feature. The automated retraining capability allows models to be automatically retrained on a schedule ranging from a minimum of 30 days to […]

AWS – Amazon IVS now supports private ingest via interface VPC endpoints
Amazon Interactive Video Service (Amazon IVS) now supports media ingest via interface VPC endpoints powered by AWS PrivateLink. With this launch, you can securely broadcast RTMP(S) streams to IVS Low-Latency channels or IVS Real-Time stages without sending traffic over the public internet. You can create interface VPC endpoints to privately connect your applications to Amazon […]

GCP – Fast and efficient AI inference with new NVIDIA Dynamo recipe on AI Hypercomputer
As generative AI becomes more widespread, it’s important for developers and ML engineers to be able to easily configure infrastructure that supports efficient AI inference, i.e., using a trained AI model to make predictions or decisions based on new, unseen data. While great at training models, traditional GPU-based serving architectures struggle with the “multi-turn” nature […]

GCP – Scaling high-performance inference cost-effectively
At Google Cloud Next 2025, we announced new inference capabilities with GKE Inference Gateway, including support for vLLM on TPUs, Ironwood TPUs, and Anywhere Cache. Our inference solution is based on AI Hypercomputer, a system built on our experience running models like Gemini and Veo 3, which serve over 980 trillion tokens a month to […]

GCP – Deliver intuitive shopping experiences with Conversational Commerce agent
Consumer search behavior is shifting, with users now entering longer, more complex questions into search bars in pursuit of more relevant results. For instance, instead of a simple “best kids snacks,” queries have evolved to “What are some nutritious snack options for a 7-year-old’s birthday party?” However, many digital platforms have yet to adapt to […]

GCP – Our approach to carbon-aware data centers: Central data center fleet management
Data centers are the engines of the cloud, processing and storing the information that powers our daily lives. As digital services grow, so do our data centers and we are working to responsibly manage them. Google thinks of infrastructure at the full stack level, not just as hardware but as hardware abstracted through software, allowing […]

AWS – Amazon Bedrock AgentCore Gateway supports AWS PrivateLink invocation and invocation logging
Amazon Bedrock AgentCore Gateway now supports AWS PrivateLink invocation and invocation logging through Amazon CloudWatch, Amazon S3 and Amazon Data Firehose. Amazon Bedrock AgentCore Gateway provides an easy and secure way for developers to build, deploy, discover, and connect to agent tools at scale. With the PrivateLink support and invocation logging, you can apply network […]
GCP – Automate app deployment and security analysis with new Gemini CLI extensions
Find and fix security vulnerabilities. Deploy your app to the cloud. All without leaving your command-line. Today, we’re closing the gap between your terminal and the cloud with a first look at the future of Gemini CLI, delivered through two new extensions: security extension and Cloud Run extension. These extensions are designed to handle critical […]