Welcome to above the clouds
GCP – Running AI on fully managed GKE, now with new compute options, pricing and resource reservations
Kubernetes is a popular way to run AI workloads like training, and large language model (LLM) serving, including our new open model Gemma. Google Kubernetes Engine (GKE) in Autopilot mode provides a fully managed Kubernetes platform that offers the power and flexibility of Kubernetes but without the need to worry about compute nodes, so you […]
GCP – DZ BANK unlocks 70% toil savings and 90% cost savings with a Cloud Run-first approach
Editor’s note: DZ BANK is the second largest bank by assets in Germany. In this post, Cloud Engineer Tim Harpe from DZ BANK shares how migrating to Google Cloud resulted in spectacular efficiency gains and cost savings. DZ BANK chose Google Cloud to accelerate its digital transformation because Google Cloud offers cutting-edge technology, top-tier expertise, […]
GCP – Build generative AI applications with similarity search in Cloud SQL for MySQL
Generative AI is transforming application development across industries as developers build brand new user experiences that weren’t possible before. We’re already seeing customers like Linear build amazing new AI-powered applications with Google Cloud databases. Recently, we announced that you can now also use Cloud SQL for MySQL to perform similarity searches by indexing and searching […]
GCP – Confluent brings real-time capabilities to Google Cloud generative AI
In 2023, the spotlight was on generative AI (gen AI) and how it is paving the way for a new category of AI that can create and co-innovate with humans to produce new content, such as text, code, images, and music. Gen AI capabilities are not only promising but extremely powerful, given that large language […]
GCP – Dividends from data: Building a lean data stack for a Series C Fintech
It is often said that a journey of a thousand miles begins with a single step. 10 years ago, building a data technology stack felt a lot more like a thousand miles than it does today; technology, automation, and business understanding of the value of data have significantly improved. Instead, the problem today is knowing […]
GCP – Personalized Service Health now in the Google Cloud mobile app
To simplify incident management for businesses, in August 2023 we introduced Personalized Service Health to provide fast, transparent, relevant, and actionable communication about Google Cloud service disruptions. Today, we’re excited to announce that Personalized Service Health is available in the Google Cloud mobile app as well, making it easier for businesses to stay informed, from […]
GCP – Build supercharged gen AI applications with LangChain and Google Cloud databases
Generative AI is empowering developers — even those without experience in machine learning — to build transformative AI applications. In order to get started they need to integrate large language models (LLMs) and other foundation models with operational databases and craft prompts to pull relevant information from various data sources, including their existing enterprise systems. […]
GCP – How Glance improves database operations with Spanner
Editor’s note: Today we hear from Glance, which offers diverse and engaging content directly on the lock screen of Android Phones ranging from breaking news, gaming, entertainment, to personalized shopping recommendations, empowering individuals to effortlessly discover and engage with what they love, all at a single glance. The company recently migrated from Azure Cosmos DB […]
GCP – Regional vs. zonal GKE clusters: making the right choice for your workloads
Google Kubernetes Engine (GKE) empowers businesses to efficiently orchestrate, manage, and scale containerized applications within Google Cloud. When designing your GKE environment, a pivotal decision arises: selecting between a regional or zonal cluster. This choice significantly impacts your application’s availability, scalability, and cost-effectiveness. In this blog post, we delve into the characteristics and considerations associated […]
GCP – Announcing Anthropic’s Claude 3 models in Google Cloud Vertex AI
At Google Cloud, we’re committed to empowering customer choice and innovation through our curated collection of first-party, open-source, and third-party models available in Vertex AI. That’s why we’re thrilled to announce that Claude 3 — Anthropic’s new family of state-of-the-art models — will be generally available in Vertex AI Model Garden over the coming weeks, […]