Welcome to above the clouds

GCP – Introducing stronger default Org Policies for our customers
Google Cloud strives to make good security outcomes easier for customers to achieve. As part of this continued effort, we are releasing an updated and stronger set of security defaults that are automatically implemented for new customers. Google Cloud customers with a verified domain receive an organization resource, which is used as the root of […]

GCP – Anthropic’s Claude 3 Sonnet and Claude 3 Haiku are now generally available on Vertex AI
Earlier this month, we shared the news that Anthropic’s Claude 3 family of models would soon be available to Google Cloud customers on Vertex AI Model Garden. Today, we’re announcing that Claude 3 Sonnet and Claude 3 Haiku are generally available to all customers on Vertex AI. Claude 3 Opus, Anthropic’s most capable and intelligent […]

GCP – Advanced scheduling for AI/ML with Ray and Kueue
Ray is an open-source unified compute framework gaining popularity among developers for its ability to easily scale AI/ML and Python applications. KubeRay offers a solution to harness the power of Ray within Google Kubernetes Engine (GKE). It serves as an orchestrator for Ray clusters, leveraging Kubernetes APIs as the foundational layer for compute, networking, and […]

GCP – How to improve resilience to DDoS attacks with Cloud Armor Advanced rate limiting capabilities
In recent years, we’ve been seeing an exponential surge in the number, volume, and complexity of distributed denial-of-service (DDoS) attacks. In September 2023, we saw the largest DDoS attack to date, which peaked at 398 million requests per second. As the threat landscape evolves, enterprises face a pressing need for effective measures to mitigate these […]

GCP – Automatic driver installation simplifies using NVIDIA GPUs in GKE
As Artificial Intelligence and Machine Learning (AI/ML) models grow larger, training and inference applications demand accelerated compute such as NVIDIA GPUs. Google Kubernetes Engine (GKE) is a fully managed Kubernetes service that simplifies container orchestration, and has become the platform of choice to deploy, scale, and manage custom ML platforms. GKE can now automatically install […]

GCP – Unify analytics with Spark procedures in BigQuery, now generally available
BigQuery is powered by a highly scalable and capable SQL engine that can handle large data volumes with standard SQL, and that offers advanced capabilities such as BigQuery ML, remote functions, vector search, and more. However, there are cases where you may need to leverage open-source Apache Spark expertise or existing Spark-based business logic to […]

GCP – BigQuery customers save up to 54% in TCO compared to alternative cloud data platforms
Over the last two decades, cloud-based data management and analytics solutions have become more cost effective and flexible than their on-premises counterparts. But even among cloud providers there are differences in cost, flexibility, scalability, and AI readiness that impact a business’s bottom line. Choosing the right enterprise data warehouse (EDW) solution requires designing an effective […]

GCP – Accelerate your generative AI journey with NVIDIA NeMo framework on GKE
Background Ever since generative AI gained prominence in the AI field, organizations ranging from startups to large enterprises have moved to harness its power by making it an integral part of their applications, solutions, and platforms. While the true potential of generative AI lies in creating new content based on learning from existing content, it […]

GCP – Google named a Leader in The Forrester Wave: AI Infrastructure Solutions, Q1 2024
Today, we are excited to announce that Forrester Research has recognized Google as a Leader in The Forrester Wave™: AI Infrastructure Solutions, Q1 2024. We believe this is a testament to our vision and strong track record of delivering continuous innovation and leading AI infrastructure products for our customers. Google received the highest scores of […]
GCP – Why GKE for your Ray AI workloads? Portability, scalability, manageability, cost
The revolution in generative AI (gen AI) and large language models (LLMs) is leading to larger model sizes and increased demands on the compute infrastructure. Organizations looking to integrate these advancements into their applications increasingly require distributed computing solutions that offer minimal scheduling overhead. As the need for scalable gen AI solutions grows, Ray, an […]