Welcome to above the clouds
GCP – Toyota shifts into overdrive: Developing an AI platform for enhanced manufacturing efficiency
The automotive industry is facing a profound transformation, driven by the rise of CASE, — connected cars, autonomous and automated driving, shared mobility, and electrification. Simultaneously, manufacturers face the imperative to further increase efficiency, automate manufacturing, and improve quality. AI has emerged as a critical enabler of this evolution. In this dynamic landscape, Toyota turned […]
GCP – Understand how your users are using Gemini for Google Cloud with Cloud Logging and Monitoring
From helping your developers write better code faster with Code Assist, to helping cloud operators more efficiently manage usage with Cloud Assist, Gemini for Google Cloud is your personal AI-powered assistant. However, understanding exactly how your internal users are using Gemini has been a challenge — until today. Today we are announcing that Cloud Logging […]
GCP – How Fullstory uncovers user insights with Vertex AI serving Gemini 1.5 Pro
Mapping the user experience is one of the most persistent challenges a business can face. Fullstory, a leading behavioral data analytics platform, helps organizations identify pain points and optimize digital experiences by reproducing user sessions and sharing strong analytics highlighting areas for improvement in the customer’s journey. This boosts conversion rates, reduces churn, and enhances […]
GCP – To avoid “bill shocks,” Palo Alto Networks deploys custom AI-powered cost anomaly detection
In today’s fast-paced digital world, businesses are constantly seeking innovative ways to leverage cutting-edge technologies to gain a competitive edge. AI has emerged as a transformative force, empowering organizations to automate complex processes, gain valuable insights from data, and deliver exceptional customer experiences. However, with the rapid adoption of AI comes a significant challenge: managing […]
AWS – Amazon EC2 Hpc7a instances are now available in Europe (Paris) region
Starting today, Amazon EC2 Hpc7a instances are available in additional AWS Region Europe (Paris). EC2 Hpc7a instances are powered by 4th generation AMD EPYC processors with up to 192 cores, and 300 Gbps of Elastic Fabric Adapter (EFA) network bandwidth for fast and low-latency internode communications. Hpc7a instances feature Double Data Rate 5 (DDR5) memory, […]
AWS – Amazon EC2 Hpc6id instances are now available in Europe (Paris) region
Starting today, Amazon EC2 Hpc6id instances are available in additional AWS Region Europe (Paris). These instances are optimized to efficiently run memory bandwidth-bound, data-intensive high performance computing (HPC) workloads, such as finite element analysis and seismic reservoir simulations. With EC2 Hpc6id instances, you can lower the cost of your HPC workloads while taking advantage of […]
AWS – Amazon Aurora now available as a quick create vector store in Amazon Bedrock Knowledge Bases
Amazon Aurora PostgreSQL is now available as a quick create vector store in Amazon Bedrock Knowledge Bases. With the new Aurora quick create option, developers and data scientists building generative AI applications can select Aurora PostgreSQL as their vector store with one click to deploy an Aurora Serverless cluster preconfigured with pgvector in minutes. Aurora […]
AWS – SageMaker SDK enhances training and inference workflows
Today, we are introducing the new ModelTrainer class and enhancing the ModelBuilder class in the SageMaker Python SDK. These updates streamline training workflows and simplify inference deployments. The ModelTrainer class enables customers to easily set up and customize distributed training strategies on Amazon SageMaker. This new feature accelerates model training times, optimizes resource utilization, and […]
AWS – Amazon SageMaker introduces new capabilities to accelerate scaling of Generative AI Inference
We are excited to announce two new capabilities in SageMaker Inference that significantly enhance the deployment and scaling of generative AI models: Container Caching and Fast Model Loader. These innovations address critical challenges in scaling large language models (LLMs) efficiently, enabling faster response times to traffic spikes and more cost-effective scaling. By reducing model loading […]
AWS – Amazon RDS Performance Insights extends On-demand Analysis to new regions
Amazon RDS (Relational Database Service) Performance Insights expands the availability of its on-demand analysis experience to 15 new regions. This feature is available for Aurora MySQL, Aurora PostgreSQL, and RDS for PostgreSQL engines. This on-demand analysis experience, which was previously available in only 15 regions, is now available in all commercial regions. This feature allows […]