Welcome to above the clouds
AWS – Knowledge Bases for Amazon Bedrock supports Anthropic’s Claude 3.5 Sonnet
Knowledge Bases for Amazon Bedrock securely connects foundation models (FMs) to internal company data sources for Retrieval Augmented Generation (RAG) to deliver relevant, context-specific, and accurate responses. Anthropic’s Claude 3.5 Sonnet foundation model is now generally available on Knowledge Bases. Anthropic’s Claude 3.5 Sonnet—the first model in the forthcoming Claude 3.5 model family—has a 200,000 […]
AWS – Amazon Connect provides new ways to configure callbacks
Amazon Connect now allows you to configure flows to take actions on callbacks prior their creation and while they are in queue. For example, you can now automate sending a notification to a customer via SMS before calling them back, update callback attributes based on latest customer data for agents to reference, or even terminate […]
GCP – Choosing between self-hosted GKE and managed Vertex AI to host AI models
In today’s technology landscape, building or modernizing applications demands a clear understanding of your business goals and use cases. This insight is crucial for leveraging emerging tools effectively, especially generative AI foundation models such as large language models (LLMs). LLMs offer significant competitive advantages, but implementing them successfully hinges on a thorough grasp of your […]
GCP – Maximize your LLM serving throughput for GPUs on GKE — a practical guide
Let’s face it: Serving AI foundation models such as large language models (LLMs) can be expensive. Between the need for hardware accelerators to achieve lower latency and the fact that these accelerators are typically not efficiently utilized, organizations need an AI platform that can serve LLMs at scale while minimizing the cost per token. Through […]
AWS – CloudFormation simplifies resource discovery and template review in the IaC Generator
Today, AWS CloudFormation announces two new enhancements to the IaC generator, which customers use to create infrastructure-as-code (IaC) from existing resources. Now, after the IaC generator finishes scanning the resources in an account, it presents a graphical summary of the different resource types to help customers more quickly find the resources they want to include […]
AWS – Amazon Q now provides more details about user subscriptions and associated resources
The Amazon Q Console now provides administrators with greater visibility into how users are utilizing Amazon Q Developer Pro, Amazon Q Business Pro, and Amazon Q Business Lite subscriptions. This new feature enables administrators to view a list of subscribed users, their subscription status (e.g., active, pending, under free trial, canceled), and their corresponding associations. […]
AWS – Amazon DocumentDB (with MongoDB Compatibility) Global Clusters introduces Failover
Amazon DocumentDB now supports Global Cluster Failover, a fully managed experience for performing a cross-region failover to respond to unplanned events such as a regional outage. With Global Cluster Failover, you can convert a secondary region into the new primary region in typically a minute and also maintain the multi-region Global Cluster configuration. An Amazon […]
AWS – AWS Identity and Access Management now supports AWS PrivateLink in all commercial Regions
AWS Identity and Access Management (IAM) now supports AWS PrivateLink in all commercial AWS Regions. With IAM, you can specify who or what can access services and resources in AWS by creating and managing resources such as IAM roles and policies. You can now establish private connection between your virtual private cloud (VPC) and IAM […]
AWS – Amazon EC2 C7i-flex instances are now available in US East (N. Virginia) Region
Starting today, Amazon Elastic Compute Cloud (Amazon EC2) C7i-flex instances that deliver up to 19% better price performance compared to C6i instances, are available in US East (N. Virginia) region. C7i-flex instances expand the EC2 Flex instances portfolio to provide the easiest way for you to get price performance benefits for a majority of compute […]
AWS – AWS Resource Access Manager is now available in the AWS Asia Pacific (Malaysia) Region
You can now use AWS Resource Access Manager (AWS RAM) in the AWS Asia Pacific (Malaysia) Region. AWS RAM helps you securely share your resources across your organization, with specific organizational units (OUs), or with individual AWS accounts. You can centrally create a resource and then share that resource using AWS RAM to reduce the […]