Welcome to above the clouds

GCP – Announcing Terraform Google Provider 6.0.0: More Flexibility, Better Control
The Terraform Google Provider v6.0.0 is now GA. Since the last major Terraform provider release in September 2023, we have been listening closely to the community’s feedback. Discussed below are the primary enhancements and bug fixes that this major release focuses on. Support for earlier versions of Terraform will not change as a result of […]

GCP – A multimodal search solution using NLP, BigQuery and embeddings
Today’s digital landscape offers a vast sea of information, encompassing not only text, but also images and videos. Traditional enterprise search engines were primarily designed for text-based queries, and often fall short when it comes to analyzing visual content. However, with a combination of natural language processing (NLP) and multimodal embeddings, a new era of […]

GCP – The modern marketer’s strategic advantage: AI-powered data clean rooms
Businesses across all industries crave data to better understand their customers and drive sales. Imagine a major consumer packaged goods brand that primarily sells through a large retailer. This brand could gain valuable insights by understanding the key actions / high-value assets (HVAs) customers take on the retailer’s website before making a purchase. Although this […]

AWS – Knowledge Bases for Amazon Bedrock supports Anthropic’s Claude 3.5 Sonnet
Knowledge Bases for Amazon Bedrock securely connects foundation models (FMs) to internal company data sources for Retrieval Augmented Generation (RAG) to deliver relevant, context-specific, and accurate responses. Anthropic’s Claude 3.5 Sonnet foundation model is now generally available on Knowledge Bases. Anthropic’s Claude 3.5 Sonnet—the first model in the forthcoming Claude 3.5 model family—has a 200,000 […]

AWS – Amazon Connect provides new ways to configure callbacks
Amazon Connect now allows you to configure flows to take actions on callbacks prior their creation and while they are in queue. For example, you can now automate sending a notification to a customer via SMS before calling them back, update callback attributes based on latest customer data for agents to reference, or even terminate […]

GCP – Choosing between self-hosted GKE and managed Vertex AI to host AI models
In today’s technology landscape, building or modernizing applications demands a clear understanding of your business goals and use cases. This insight is crucial for leveraging emerging tools effectively, especially generative AI foundation models such as large language models (LLMs). LLMs offer significant competitive advantages, but implementing them successfully hinges on a thorough grasp of your […]

GCP – Maximize your LLM serving throughput for GPUs on GKE — a practical guide
Let’s face it: Serving AI foundation models such as large language models (LLMs) can be expensive. Between the need for hardware accelerators to achieve lower latency and the fact that these accelerators are typically not efficiently utilized, organizations need an AI platform that can serve LLMs at scale while minimizing the cost per token. Through […]

AWS – CloudFormation simplifies resource discovery and template review in the IaC Generator
Today, AWS CloudFormation announces two new enhancements to the IaC generator, which customers use to create infrastructure-as-code (IaC) from existing resources. Now, after the IaC generator finishes scanning the resources in an account, it presents a graphical summary of the different resource types to help customers more quickly find the resources they want to include […]

AWS – Amazon Q now provides more details about user subscriptions and associated resources
The Amazon Q Console now provides administrators with greater visibility into how users are utilizing Amazon Q Developer Pro, Amazon Q Business Pro, and Amazon Q Business Lite subscriptions. This new feature enables administrators to view a list of subscribed users, their subscription status (e.g., active, pending, under free trial, canceled), and their corresponding associations. […]
AWS – Amazon DocumentDB (with MongoDB Compatibility) Global Clusters introduces Failover
Amazon DocumentDB now supports Global Cluster Failover, a fully managed experience for performing a cross-region failover to respond to unplanned events such as a regional outage. With Global Cluster Failover, you can convert a secondary region into the new primary region in typically a minute and also maintain the multi-region Global Cluster configuration. An Amazon […]