Welcome to above the clouds
AWS – Amazon RDS for PostgreSQL supports pgvector 0.8.0
Amazon Relational Database Service (RDS) for PostgreSQL now supports pgvector 0.8.0, an open-source extension for PostgreSQL for storing and efficiently querying vector embeddings in your database, letting you use retrieval-augemented generation (RAG) when building your generative AI applications. pgvector 0.8.0 release includes improvements on PostgreSQL query planner’s selection of index when filters are present, which […]
AWS – Amazon CloudWatch Logs launches the ability to transform and enrich logs
Amazon CloudWatch Logs announces log transformation and enrichment to improve log analytics at scale with consistent, and context-rich format. Customers can add structure to their logs using pre-configured templates for common AWS services such as AWS Web Application Firewall (WAF), Route53, or build custom transformers with native parsers such as Grok. Customers can also rename […]
GCP – Announcing Mistral AI’s Large-Instruct-2411 on Vertex AI
In July, we announced the availability of Mistral AI’s models on Vertex AI: Codestral for code generation tasks, Mistral Large 2 for high-complexity tasks, and the lightweight Mistral Nemo for reasoning tasks like creative writing. Today, we’re announcing the availability of Mistral AI’s newest model on Vertex AI Model Garden: Mistral-Large-Instruct-2411 is now generally available […]
GCP – Make IAM for GKE easier to use with Workload Identity Federation
At Google Cloud, we work to continually improve our platform’s security capabilities to deliver the most trusted cloud. As part of this goal, we’re helping our users move away from less secure authentication methods such as long-lived, unauditable, service account keys towards more secure alternatives when authenticating to Google Cloud APIs and services. In the […]
GCP – Announcing Mistral AI’s Large-Instruct-2411 and Codestral-2411 on Vertex AI
In July, we announced the availability of Mistral AI’s models on Vertex AI: Codestral for code generation tasks, Mistral Large 2 for high-complexity tasks, and the lightweight Mistral Nemo for reasoning tasks like creative writing. Today, we’re announcing the availability of Mistral AI’s newest models on Vertex AI Model Garden: Mistral-Large-Instruct-2411 is now generally available, […]
GCP – Don’t let resource exhaustion leave your users hanging: A guide to handling 429 errors
Large language models (LLMs) give developers immense power and scalability, but managing resource consumption is key to delivering a smooth user experience. LLMs demand significant computational resources, which means it’s essential to anticipate and handle potential resource exhaustion. If not, you might encounter 429 “resource exhaustion” errors, which can disrupt how users interact with your […]
AWS – Amazon Connect now supports nine additional languages for forecasting, capacity planning, and scheduling
Amazon Connect now supports nine additional languages for forecasting, capacity planning, and scheduling. New languages now supported include: Canadian French, Chinese (Simplified and Traditional), French, German, Italian, Japanese, Korean, Portuguese (Brazilian), and Spanish. These new languages are available in all AWS Regions where Amazon Connect forecasting, capacity planning, and scheduling are available. To learn more […]
AWS – OpenSearch’s vector engine adds support for UltraWarm on Amazon OpenSearch Service
UltraWarm is a fully managed, warm storage tier that’s designed to deliver cost savings on the Amazon OpenSearch Service. With OpenSearch 2.17+ domains, you can now store k-NN (vector) indexes on UltraWarm storage reducing the cost of serving infrequently access k-NN indexes through warm and cold storage tiers. With UltraWarm storage, you can further cost […]
AWS – AWS Compute Optimizer now supports rightsizing recommendations for Amazon Aurora
AWS Compute Optimizer now provides recommendations for Amazon Aurora DB instances. These recommendations help you identify idle database instances and choose the optimal DB instance class, so you can reduce costs for unused resources and increase the performance of under-provisioned workloads. AWS Compute Optimizer automatically analyzes Amazon CloudWatch metrics such as CPU utilization, network throughput, […]
AWS – Amazon CloudFront now supports Anycast Static IPs
Amazon CloudFront introduces Anycast Static IPs, providing customers with a dedicated list of IP addresses for connecting to all CloudFront edge locations worldwide. Typically, CloudFront uses rotating IP addresses to serve traffic. Customers implementing Anycast Static IPs will receive a dedicated list of static IP addresses for their workloads. CloudFront Anycast Static IPs enables customers […]