Welcome to above the clouds

AWS – Amazon OpenSearch Service now supports Custom Plugins
Amazon OpenSearch Service introduces Custom Plugins, a new plugin management option that allows you to extend OpenSearch functionality and deliver personalized experiences for applications such as website search, log analytics, application monitoring and, observability. OpenSearch provides a rich set of search and analysis capabilities, and with custom plugins, you can extend these further to meet […]

AWS – Introducing an AWS Management Console Visual Update (Preview)
Now available in Preview, the visual update in the AWS Management Console helps customers scan content, focus on the key information, and find what they are looking for more effectively, while preserving the familiar and consistent experience. The new, modern layout also provides easy access to contextual tools. Customers now benefit from optimized information density […]

AWS – Amazon RDS for PostgreSQL supports pgvector 0.8.0
Amazon Relational Database Service (RDS) for PostgreSQL now supports pgvector 0.8.0, an open-source extension for PostgreSQL for storing and efficiently querying vector embeddings in your database, letting you use retrieval-augemented generation (RAG) when building your generative AI applications. pgvector 0.8.0 release includes improvements on PostgreSQL query planner’s selection of index when filters are present, which […]

AWS – Amazon CloudWatch Logs launches the ability to transform and enrich logs
Amazon CloudWatch Logs announces log transformation and enrichment to improve log analytics at scale with consistent, and context-rich format. Customers can add structure to their logs using pre-configured templates for common AWS services such as AWS Web Application Firewall (WAF), Route53, or build custom transformers with native parsers such as Grok. Customers can also rename […]

GCP – Announcing Mistral AI’s Large-Instruct-2411 on Vertex AI
In July, we announced the availability of Mistral AI’s models on Vertex AI: Codestral for code generation tasks, Mistral Large 2 for high-complexity tasks, and the lightweight Mistral Nemo for reasoning tasks like creative writing. Today, we’re announcing the availability of Mistral AI’s newest model on Vertex AI Model Garden: Mistral-Large-Instruct-2411 is now generally available […]

GCP – Make IAM for GKE easier to use with Workload Identity Federation
At Google Cloud, we work to continually improve our platform’s security capabilities to deliver the most trusted cloud. As part of this goal, we’re helping our users move away from less secure authentication methods such as long-lived, unauditable, service account keys towards more secure alternatives when authenticating to Google Cloud APIs and services. In the […]

GCP – Announcing Mistral AI’s Large-Instruct-2411 and Codestral-2411 on Vertex AI
In July, we announced the availability of Mistral AI’s models on Vertex AI: Codestral for code generation tasks, Mistral Large 2 for high-complexity tasks, and the lightweight Mistral Nemo for reasoning tasks like creative writing. Today, we’re announcing the availability of Mistral AI’s newest models on Vertex AI Model Garden: Mistral-Large-Instruct-2411 is now generally available, […]

GCP – Don’t let resource exhaustion leave your users hanging: A guide to handling 429 errors
Large language models (LLMs) give developers immense power and scalability, but managing resource consumption is key to delivering a smooth user experience. LLMs demand significant computational resources, which means it’s essential to anticipate and handle potential resource exhaustion. If not, you might encounter 429 “resource exhaustion” errors, which can disrupt how users interact with your […]

AWS – Amazon Connect now supports nine additional languages for forecasting, capacity planning, and scheduling
Amazon Connect now supports nine additional languages for forecasting, capacity planning, and scheduling. New languages now supported include: Canadian French, Chinese (Simplified and Traditional), French, German, Italian, Japanese, Korean, Portuguese (Brazilian), and Spanish. These new languages are available in all AWS Regions where Amazon Connect forecasting, capacity planning, and scheduling are available. To learn more […]
AWS – OpenSearch’s vector engine adds support for UltraWarm on Amazon OpenSearch Service
UltraWarm is a fully managed, warm storage tier that’s designed to deliver cost savings on the Amazon OpenSearch Service. With OpenSearch 2.17+ domains, you can now store k-NN (vector) indexes on UltraWarm storage reducing the cost of serving infrequently access k-NN indexes through warm and cold storage tiers. With UltraWarm storage, you can further cost […]