Cloud

2025 04 22

GCP – Google Cloud Database and LangChain integrations now support Go, Java, and JavaScript

Last year, Google Cloud and LangChain announced integrations that give generative AI developers access to a suite of LangChain Python packages. This allowed application developers to leverage Google Cloud’s database portfolio in their gen AI applications to drive the most value from their private data.

Today, we are expanding language support for our integrations to include Go, Java, and JavaScript.

Each package will have up to three LangChain integrations:

Vector stores to enable semantic search for our databases
Chat message history to enable chains to recall previous conversations
Document loader for loading documents from your enterprise data

Developers now have the flexibility to create intricate workflows and easily interchange underlying components (like a vector database) as needed to align with specific use cases. This technology unlocks a variety of applications, including personalized product recommendations, question answering, document search and synthesis, customer service automation, and more.

In this post, we’ll share more about the integrations – and code snippets to get started.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud databases’), (‘body’, <wagtail.rich_text.RichText object at 0x3e49f87219d0>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/products?#databases’), (‘image’, None)])]>

New language support

LangChain is known for its popular Python package; however, your team’s expertise and services may not be in Python. Java and Go are commonly used programming languages for production-grade and enterprise-scale applications. Developers may prefer Javascript and Typescript for the asynchronous programming support and compatibility with front-end frameworks like React and Vue.

In addition to Python developers, the LangChain developer community encompasses developers proficient in Java, JavaScript, and Go. It is an active and supportive community centered around the LangChain framework, which facilitates the development of applications powered by large language models (LLMs).

Google Cloud is dedicated to providing secure and easy to use database integrations for your Gen AI applications. Our integrations embed Google Cloud connectors that create secure connections, handle SSL certificates, and support IAM authorization and authentication. The integrations are optimized for PostgreSQL databases (AlloyDB for PostgreSQL, AlloyDB Omni, Cloud SQL for PostgreSQL) to ensure proper connection management, flexible tables schemas, and improved filtering.

JavaScript Support

JavaScript developers can utilize LangChain.js, which provides tools and building blocks for developing applications leveraging LLMs. LangChain simplifies the process of connecting LLMs to external data sources and enables reasoning capabilities in applications. Other Google Cloud integrations, such as Gemini models, are available within LangChain.js, allowing seamless interaction with GCP resources.

Resources (Cloud SQL PostgreSQL support only)	Links
Documentation	Link
How-to guides	Vector Store Memory Document Loader
Quick start guide	Vector Store Memory Document Loader
GitHub	Repository

Below are the integrations and their code snippets to get started.

Install the dependency:

code_block: <ListValue: [StructValue([(‘code’, ‘npm install @langchain/google-cloud-sql-pg’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f8721c40>)])]>

Engine

code_block: <ListValue: [StructValue([(‘code’, ‘import { PostgresEngine } from “@langchain/google-cloud-sql-pg”;rnrnconst engine: PostgresEngine = await PostgresEngine.fromInstance(rn “project-id”,rn “region”,rn “instance-name”,rn “database-name”,rn);’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca5e0>)])]>

Use this package with AlloyDB for PostgreSQL and AlloyDB Omni by customizing your Engine to connect your instance. You will need the AlloyDB Auth Proxy to make authorized, encrypted connections to AlloyDB instances.

code_block: <ListValue: [StructValue([(‘code’, ‘import { PostgresEngine, PostgresEngineArgs} from “@langchain/google-cloud-sql-pg”;rnrnconst engine: PostgresEngine = await PostgresEngine.fromEngineArgs(rn `postgresql+asyncpg://${OMNI_USER}:${OMNI_PASSWORD}@${OMNI_HOST}:5432/${OMNI_DATABASE_NAME}`rn);’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca490>)])]>

Vector store

code_block: <ListValue: [StructValue([(‘code’, ‘import { PostgresVectorStore } from “@langchain/google-cloud-sql-pg”;rnimport { VertexAIEmbeddings } from “@langchain/google-vertexai”;rnrnawait engine.initVectorstoreTable(“my_vector_store_table”, 768);rnrnconst embeddings = new VertexAIEmbeddings({rn model: “text-embedding-004”,rn});rnconst vectorStore = await PostgresVectorStore.create(rn engine,rn embeddingService,rn “my_vector_store_table”rn);’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca6a0>)])]>

Chat message history

code_block: <ListValue: [StructValue([(‘code’, ‘import { PostgresChatMessageHistory } from “@langchain/google-cloud-sql-pg”;rnrnawait engine.initChatHistoryTable(“my_chat_table”, 768);rnrnconst chat_history = await PostgresChatMessageHistory.create({rn engine,rn “user-session-1”,rn “my_chat_table”rn});’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca4c0>)])]>

Loader

code_block: <ListValue: [StructValue([(‘code’, ‘import { PostgresLoader } from “@langchain/google-cloud-sql-pg”;rnrnrnconst loader = await PostgresChatMessageHistory.create(rn engine,rn {query: “SELECT * FROM my_table”}rn);rnrnlet data = await loader.load()’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca580>)])]>

Java Support

For Java developers, there’s LangChain4j, a Java implementation of LangChain. This allows Java developers to build LLM-powered applications with a familiar ecosystem. In LangChain4j, you can also access the full array of VertexAI Gemini models.

Resources	Links
How-to Guides	AlloyDB Embedding Store AlloyDB Document Loader
Quick Start Guides	AlloyDB example
GitHub	Repository

*Note: Cloud SQL integrations will be released soon.

Below are the integrations and their code snippets to get started.

For Maven in pom.xml:

code_block: <ListValue: [StructValue([(‘code’, ‘<dependency>rn <groupId>dev.langchain4j</groupId>rn <artifactId>langchain4j-alloydb-pg</artifactId>rn <version>1.0.0-beta3</version>rn</dependency>rnrn<!– New Version to be released –>rn<dependency>rn <groupId>dev.langchain4j</groupId>rn <artifactId>langchain4j-cloud-sql-pg</artifactId>rn <version>1.0.0-beta4</version>rn</dependency>’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7ecadf0>)])]>

Engine

code_block: <ListValue: [StructValue([(‘code’, ‘import dev.langchain4j.engine.AlloyDBEngine;rnrnAlloyDBEngine engine = new AlloyDBEngine.Builder()rn .projectId(“PROJECT_ID”)rn .region(“REGION”)rn .cluster(“CLUSTER”)rn .instance(“INSTANCE”)rn .database(“DATABASE”)rn .build();’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca730>)])]>

Embedding store

code_block: <ListValue: [StructValue([(‘code’, ‘import dev.langchain4j.store.embedding.alloydb.AlloyDBEmbeddingStore;rnrnengine.initVectorStoreTable(new EmbeddingStoreConfig.builder(tableName, vectorSize).build());rnAlloyDBEmbeddingStore store = new AlloyDBEmbeddingStore.Builder(engine, tableName).build();’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7ecabb0>)])]>

Document loader

code_block: <ListValue: [StructValue([(‘code’, ‘import dev.langchain4j.data.document.loader.alloydb.AlloyDBLoader;rnrnAlloyDBLoader loader = new AlloyDBLoader.Builder(engine).query(“SELECT * FROM my_table”).build();rnList<Document> data = loader.load();’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca940>)])]>

Go support

LangchainGo is the Go programming language port of LangChain.

The LangChain framework was designed to support the development of sophisticated applications that connect language models to data sources and enable interaction with their environment. The most powerful and differentiated applications go beyond simply using a language model via an API; they are data-aware and agentic.

Last year Google’s SDKs were added as providers for LangChainGo; this makes it possible to use the capabilities of the LangChain framework with Google’s Gemini models as LLM providers.

We now have AlloyDB and Cloud SQL for PostgreSQL support in LangchainGo.

Resources	Links
How-to guides	AlloyDB Vector Store Cloud SQL Vector Store AlloyDB Memory Cloud SQL Memory
Quick start guides	AlloyDB Vector Store Cloud SQL Vector Store AlloyDB Memory Cloud SQL Memory
GitHub	Repository

Below are the integrations and their code snippets to get started.

Install the dependency

code_block: <ListValue: [StructValue([(‘code’, ‘go get -u github.com/tmc/langchaingo’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca250>)])]>

Engine

code_block: <ListValue: [StructValue([(‘code’, ‘import (rnt”context”rnt”github.com/tmc/langchaingo/internal/alloydbutil”rn)rnrnpgEngine, err := alloydbutil.NewPostgresEngine(ctx,rntalloydbutil.WithDatabase(database),rntalloydbutil.WithAlloyDBInstance(projectID, region, cluster, instance),rn)’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca970>)])]>

Vector Store

code_block: <ListValue: [StructValue([(‘code’, ‘package mainrnrnimport (rnt”log”rnrnt”github.com/tmc/langchaingo/embeddings”rnt”github.com/tmc/langchaingo/internal/alloydbutil”rnt”github.com/tmc/langchaingo/llms/googleai/vertex”rnt”github.com/tmc/langchaingo/vectorstores/alloydb”rn)rnrnfunc main() {rnt// Initialize table for the Vectorstore to use. You only need to do this the first time you use this table.rntvectorstoreTableoptions, err := &alloydbutil.VectorstoreTableOptions{rnttTableName: “my_table”,rnttVectorSize: 768,rnt}rntif err != nil {rnttlog.Fatal(err)rnt}rnrnterr = pgEngine.InitVectorstoreTable(ctx, *vectorstoreTableoptions)rntif err != nil {rnttlog.Fatal(err)rnt}rnrnt// Initialize VertexAI LLMrntllm, err := vertex.New(ctx,rnttvertex.WithCloudProject(projectID),rnttvertex.WithCloudLocation(vertexLocation),rnttvertex.WithDefaultModel(“text-embedding-005”),rnt)rntif err != nil {rnttlog.Fatal(err)rnt}rnrnte, err := embeddings.NewEmbedder(llm)rntif err != nil {rnttlog.Fatal(err)rnt}rnrnt// Create a new AlloyDB Vectorstorerntvs, err := alloydb.NewVectorStore(ctx, pgEngine, e, “my_table”)rn}’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7ecae50>)])]>

Chat message history

code_block: <ListValue: [StructValue([(‘code’, ‘import (rnt”context”rnt”log”rnt”github.com/tmc/langchaingo/internal/alloydbutil”rnt”github.com/tmc/langchaingo/llms”rnt”github.com/tmc/langchaingo/memory/alloydb”rn)rnrntrn// Creates a new table in the Postgres database, which will be used for storing Chat History.rnerr = pgEngine.InitChatHistoryTable(ctx, tableName)rnif err != nil {rntlog.Fatal(err)rn}rnrn// Creates a new Chat Message Historyrncmh, err := alloydb.NewChatMessageHistory(ctx, *pgEngine, tableName, sessionID)rnif err != nil {rntlog.Fatal(err)rn}’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e49f7eca790>)])]>

*Note code is shown for AlloyDB. See links for Cloud SQL for Postgres examples.

Get started

The LangChain Vector stores integration is available for Google Cloud databases with vector support, including AlloyDB, Cloud SQL for PostgreSQL, Firestore, Memorystore for Redis, and Spanner.

The Document loaders and Memory integrations are available for all Google Cloud databases including AlloyDB, Cloud SQL for MySQL, PostgreSQL and SQL Server, Firestore, Datastore, Bigtable, Memorystore for Redis, El Carro for Oracle databases, and Spanner. Below are a few resources to get started.

Resources:

Read More for the details.

2025 04 22

GCP – 50% faster merge and 50% fewer bugs: How CodeRabbit built its AI code review agent with Google Cloud Run

Tibor Kiss Cloud, Google Cloud gcp

CodeRabbit, a rapidly growing AI code review tool, is leveraging Google Cloud Run to cut code review time and bugs in half by safely and efficiently executing untrusted code.

CodeRabbit improves code quality and automates code reviews by analyzing changes against the entire codebase and generating scripts for deeper analysis. It integrates with code hosting platforms to provide automated feedback on pull requests.

To safely execute untrusted code, CodeRabbit needed an execution environment that was scalable, cost-effective, and secure enough to analyse and run their customers’ code.

In this post, we’ll share how CodeRabbit built an AI code review agent with Google Cloud Run to scale dynamically and handle high volumes efficiently and securely.

CodeRabbit integrates directly with platforms like GitHub and GitLab, providing automated code reviews triggered by pull requests. Its integration with the foundational models doesn’t just analyze the changed files; it assesses the impact of those changes on the entire codebase. This requires a sophisticated system that can:

Clone the user’s repository.
Set up a build environment with necessary dependencies (think npm install, go mod download, etc.).
Run static analysis tools including 20+ linters and security scanners.
Execute AI-generated scripts. This is where things get really interesting. CodeRabbit’s AI agent creates shell scripts to navigate the code, search for specific patterns (using tools like cat, grep, and even ast-grep), and extract relevant information. It can even generate Python code for analysis.
Interact with external services. CodeRabbit can also perform actions by generating and executing curl commands, for example to interfacing with services like Slack, Jira and Linear.

This solution needs to be scalable, cost-effective, and above all, secure. The code being analyzed and executed is, by definition, untrusted. It could be incomplete, buggy, or even contain malicious intent.

The solution: Cloud Run

CodeRabbit Architecture: Powered by Cloud Run

CodeRabbit’s architecture cleverly combines several technologies to create a robust and isolated execution environment:

Cloud Run services: CodeRabbit uses Cloud Run services as the foundation. Incoming webhook events (from GitHub, GitLab, etc.) are first handled by a lightweight Cloud Run service that performs billing and subscription checks. This service then pushes a task to Google Cloud Tasks.
Google Cloud tasks: This acts as a queue, decoupling the webhook handling from the actual code execution. This allows CodeRabbit to handle bursts of pull requests without overwhelming the system.
Cloud Run execution service: This is the heart of the system. A separate Cloud Run service pulls tasks from the Cloud Tasks queue. Each task represents a code review request. This service is configured with a 3600 second long request timeout and a concurrency of 8 requests per instance, allowing it to scale based on CPU utilization. This setup is crucial because code reviews are long-running operations, often taking 10-20 minutes to complete. The Execution Service uses an in-memory volume mount where the entire repository, build artifacts, and temporary files are stored.
Sandboxing: All Cloud Run instances are sandboxed with two layers of sandboxing and can be configured to have minimal IAM permissions via dedicated service identity. In addition, CodeRabbit is leveraging Cloud Run’s second generation execution environment, a microVM providing full Linux cgroup functionality. Within each Cloud Run instance, CodeRabbit uses Jailkit to create isolated processes and cgroups to further restrict the privileges of the jailed process.

Sandboxing is especially critical for CodeRabbit in scenarios where untrusted code must be executed, such as:

Static analyzers that support custom, untrusted plugins (e.g., ESLint, Rubocop)
LLM-generated verification scripts for deeper analysis of the entire codebase
LLM-generated CLI actions, such as opening GitHub or Jira issues
Python-based advanced analyses

Code verification publishing a running analysis chain that ran in a Cloud Run sandbox

CodeRabbit’s use of Cloud Run allows it to scale dynamically. During peak hours, CodeRabbit’s Agentic PR Reviewer service receives up to 10 requests/second served by over 200 Cloud Run instances. Each Cloud Run instance is fairly bulky and utilizes 8vCPUs and 32GiB memory. CodeRabbit sees high CPU utilization, significant network traffic (downloading repositories and dependencies), and high memory usage when powering their PR reviewer service with Cloud Run.

Try this on your own

CodeRabbit’s use of Google Cloud Run is a compelling example of how to build a secure, scalable, and cost-effective platform for running AI-powered code analysis. Their architecture provides a blueprint for developers tackling similar challenges, and their experience highlights the evolving capabilities of serverless technologies. We’re excited to see how their platform advances as Cloud Run continues to add new features.

Learn more about developing, deploying and hosting AI agents on Cloud Run, watch the “Build AI Agents on Cloud Run” Cloud Next ’25 session featuring CodeRabbit, and give CodeRabbit a try.

Read More for the details.

2025 04 22

GCP – Automate data pipelines with BigQuery’s new data engineering agent

Tibor Kiss Cloud, Google Cloud gcp

For years, data teams have relied on the BigQuery platform to power their analytics and unlock critical business insights. But building, managing, and troubleshooting the data pipelines that feed those insights can be a complex, time-consuming process, requiring specialized expertise and a lot of manual effort. Today, we’re excited to announce our vision, a major step forward in simplifying and accelerating data engineering with BigQuery data engineering agent.

These agents aren’t just assistive tools, but agentic solutions, designed to act as intelligent partners in your data workflows. They automate daunting tasks, collaborate with your team, and continuously learn and adapt, freeing you to focus on what matters most: extracting value from your data.

Why a data engineering agent?

The world of data is changing. Organizations are generating more data than ever before, and that data is coming from a wider variety of sources, in a multitude of formats. At the same time, businesses need to move faster, making quick, data-driven decisions to stay competitive.

This creates a challenge. Traditional data engineering approaches often involve:

Tedious manual coding: Building and modifying pipelines can require writing and updating complex SQL queries, which is time-consuming and error-prone.
Schema struggles: Mapping data from different sources to the right format can be time-intensive, especially as schemas evolve.
Difficult troubleshooting: Diagnosing and fixing pipeline issues can involve lengthy sifting through logs and code, delaying critical insights.
Siloed expertise: Building and maintaining pipelines often requires specialized skills, creating bottlenecks and limiting who can contribute.

The BigQuery data engineering agent aims to address these pain points head-on and accelerate the way data pipelines are built and managed.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud data analytics’), (‘body’, <wagtail.rich_text.RichText object at 0x3e49f7b06af0>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/bigquery/’), (‘image’, None)])]>

Meet your new AI-powered data engineering team

Imagine a team of expert data engineers, available 24/7, ready to jump in and tackle the toilsome pipeline development, maintenance, and troubleshooting tasks, enabling your data team to scale and focus on higher-value work. We are announcing the data engineering agent as experimental.

Here are a few ways how BigQuery data engineering agent will change the game:

1. Autonomous pipeline building and modification

Do you need a new pipeline to ingest, transform, and validate data? Simply describe your needs in natural language – the agent handles the rest. For example:

“Create a pipeline to load data from the ‘customer_orders’ bucket, standardize the date formats, remove duplicate entries based on order ID, and load it into a BigQuery table named ‘clean_orders’.”

The agent, leveraging its understanding of data engineering best practices and your specific environment and context, generates the necessary SQL code, builds the pipeline, and even creates basic unit tests. It’s not just about automation; it’s about intelligent, context-aware automation.

Need to update an existing pipeline? Just tell the agent what you want to change. It analyzes the existing code, proposes modifications, and even highlights potential impacts on downstream processes. You remain in control, reviewing and approving changes, but the agent handles the heavy lifting.

2. Proactive troubleshooting and optimization

Pipeline issues? The agent monitors your pipelines, identifies issues such as schema and data drift, and proposes fixes. It’s like having a dedicated expert constantly watching over your data infrastructure.

3. Bulk draft pipelines

A powerful use of the data engineering agent is to scale pipeline generation or modification using previously acquired context and knowledge. This allows users to quickly scale pipelines for different departments or use cases, with customizations as needed, using the command line and API for automation at scale. In the example below, the agent takes instructions from the command line and leverages domain-specific agent instructions to create bulk pipelines.

How it works: Intelligence under the hood

To handle the complexity that most organizations have to deal with, the agents rely on several key concepts:

Hierarchical context: The agents draw on multiple sources of knowledge:

Universal understanding of common data formats, SQL best practices, etc.
Vertical-specific knowledge of industry conventions (e.g., data formats in healthcare or finance)
Organizational awareness of your company’s or department’s specific business context, data structures, naming conventions, and security policies
Data pipeline-specific understanding the details of source and target schemas, transformations, and dependencies

Continuous learning: The agents don’t just follow instructions; they learn from user interactions and previously developed pipelines. Agent knowledge gets continually enhanced over time as they work in your environment.

A collaborative, multi-agent environment

BigQuery data engineering agent are a part of a multi-agent environment, where specialized agents collaborate to achieve complex goals, working together and delegating tasks, much like a real-world data engineering team:

An ingestion agent expertly handles data intake from various sources.
A transformation agent crafts efficient and reliable data pipelines.
A validation agent helps ensures data quality and consistency.
A troubleshooting agent proactively identifies and resolves issues.
A data quality agent, powered by Dataplex metadata, monitors data and proactively alerts on anomalies.

Our initial focus is on ingestion, transformation and troubleshooting tasks, but we plan to expand these initial capabilities to other critical data engineering tasks.

Your workflow, your way

Whether you prefer working in the BigQuery Studio UI, crafting code in your favorite IDE, or managing pipelines through the command line, we want to meet you where you are. We are initially making data engineering agent available in BigQuery Studio’s pipeline editor and API/CLI, but we plan to expose it in other contexts.

Data engineering agent and your data workers

The world is only beginning to see the full potential of AI-powered agents in revolutionizing how data workers interact with and derive value from their data. With BigQuery data engineering agent, the roles of data engineers, data analysts and data scientists are expanding beyond their traditional boundaries, empowering these teams to achieve more, faster, and with greater confidence. These agents act as intelligent collaborators, streamlining workflows, automating tedious tasks, and unlocking new levels of productivity. Initially we are focusing on core data engineering tasks of promoting data from Bronze to Silver in a data lake and expanding from there.

Coupled with products like Dataplex, BigQuery ML, and Vertex AI, BigQuery data engineering agent is poised to transform the way organizations manage, process, and derive value from their data. By automating complex tasks, promoting collaboration, and empowering data workers of all skill levels, these agents are paving the way for a new era of data-driven innovation.

Ready to get started?

This is just the beginning of our journey to build a truly intelligent, autonomous data platform. We’re committed to continuously expanding the capabilities of data engineering agent, making them even more powerful and intuitive partners for all your data needs.

BigQuery data engineering agent will be available soon. We’re excited to see how it fits into your data engineering workflows and help you unlock the full potential of your data. Show your interest in getting access here.

Read More for the details.

2025 04 22

GCP – Diving into the technology behind Google’s AI-era global network

Tibor Kiss Cloud, Google Cloud gcp

The unprecedented growth and unique challenges of AI applications are driving fundamental architectural changes to Google’s next-generation global network.

The AI era brings an explosive surge in demand for network capacity, with novel traffic patterns characteristic of large-scale model training and inference. Simultaneously, the critical need for unwavering reliability has reached new heights; in an AI-driven world, outages are simply not an option. Furthermore, the requirement for enhanced security and fine-grained control, including data sovereignty considerations, is paramount. Finally, the operational cost and complexity associated with scaling traditional network architectures necessitate a more innovative approach, pushing us beyond basic automation towards true autonomy.

As we discussed in this blog, we are meeting these challenges head-on by building the next generation of Google’s global network upon four key architectural principles: (1) exponential scalability, (2) beyond-9s reliability, (3) intent-driven programmability, and (4) autonomous networking.

In this blog, let’s peel back the layers and see how the underlying technology makes these four principles a reality.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud databases’), (‘body’, <wagtail.rich_text.RichText object at 0x3ea3eeada370>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/products?#databases’), (‘image’, None)])]>

Exponential scalability with a multi-shard network

We embrace elastic horizontal scaling as a core architectural principle for Google’s global network through our multi-shard network. Instead of one monolithic network, we’ve built multiple independent shards. This provides several benefits:

Horizontal scaling: When more capacity is needed, we can scale up by growing a shard, and scale out by adding more shards, overcoming the limits and complexity of vertical scale. This is akin to adding more independent networks, rather than trying to make a single network bigger and bigger.
Independent planes: The separation of control, data, and management planes within each shard significantly limits the impact radius of any potential issue. A software bug or operational error (such as an incorrect configuration push) in one shard is far less likely to impact others, enhancing the network’s overall stability.

In the AI era, the WAN is the new LAN and the continent is the data center. This horizontal scaling approach, inspired by the design of our massive data center fabrics, allows Google’s global network to handle the unprecedented bandwidth demands of today’s AI workloads. This multi-shard network has been a key enabler for us to accommodate the average 7X WAN traffic growth between 2020 and 2025, and more importantly, an order of magnitude growth in peak traffic due to the bursty nature of ML traffic over the same period.

Beyond-9s reliability: Architecting for resilience

In a world of always-on services, reliability is paramount. Google’s global network incorporates several key innovations to achieve beyond-9s availability, emphasizing diversity and independence at every layer of the stack to avoid “shared fate” (cascading failures) and minimize impact during failures.

Multi-shard isolation: Each network shard has independent data, control, and management planes. We control what can enter and leave these shards to a cluster or edge. This prevents a bad state from a cluster poisoning all the shards at the same time. The sharded architecture inherently provides a degree of isolation. Furthermore, we apply a multi-vendor paradigm when deploying our network shards, thanks to years of development of open API and models (discussed later) that allows us to operationalize any vendor platform under the same network function. This multi-vendor approach protects our network shards from vulnerabilities introduced by third-party software or hardware.
Region isolation: With this approach, regional cores keep traffic within their domains, and regional gateways enforce policies for traffic that’s entering or leaving. This limits the impact of regional events, effectively shielding the rest of the network.

Protective ReRoute: Google’s global network implements a unique transport technique for shortening user-visible outages that complements routing repair, and it marks a radical shift in how we think about network reliability. In the conventional network model, hosts send packets, and routers handle them. With Protective ReRoute, hosts actively shift traffic flows across network paths to improve reliability and performance, intelligently detecting network path anomalies and promptly, automatically rerouting traffic to a healthy, alternative path, which can be in the same or alternative shard. The host reroutes traffic in round-trip time scales, i.e., O(RTT), by changing a few bits in the packet header that are used to compute the hash function to select a specific path among many equally viable paths. This host-initiated re-routing protects customer traffic beyond what traditional routing and traffic engineering can achieve, and is independent of the type of network, scale of network, or type of failure, thereby providing robust and deterministic recovery and performance. With Protective ReRoute in our network, we have observed up to a 93% reduction in cumulative outage minutes.

For a conceptual overview of these scalability and resilience innovations, check out this video:

Also, be sure to check out this demo to see the combined value of our multi-shard network and Protective ReRoute in action. Here, we emulate a network shard failure and show how the host promptly detects a path failure and routes the traffic over an alternative path in a different, healthy shard, providing near-instant recovery.

Intent-driven programmability for fine-grained network controls

To cater to our customers’ diverse and evolving needs, network agility and fine-grained programmability is crucial. Google’s global network allows for network controls to be precisely tailored to specific business requirements, encompassing regulatory compliance, digital sovereignty mandates, and unique application performance needs, down to the most granular network attributes. This programmability is made possible by:

Software-defined networking (SDN) controllers: Google’s global network is fully intent-driven, with SDN everywhere. We use SDN controllers to manage network behavior hierarchically. Orion, our hierarchical and federated SDN control plane platform, propagates top-level intent through layers of network control applications, which then react by updating their internal state and generating intermediate intent for each network switch. This hierarchical propagation results in changes to the programmed flow state in network switches.
Universal network model: Our universal network model, Multi-Abstraction-Layer Topology representation, or MALT, allows us to specify generic intent and business policy. Our control and management planes can then use these representations to implement these policies coherently across the network.
Standardized API: Because we rely on the OpenConfig software layer, we can use multiple routing vendors interchangeably, making the network more robust. With vendor diversity, a bug or an issue in one vendor’s software or hardware doesn’t impact the whole network, and we have options when scaling our network.

This programmability enables us to implement business policies directly into the network fabric, offering granularity and the ability to isolate bandwidth for critical applications. Customers with specific regulatory requirements can also leverage this programmability to enforce their desired network path controls for their data in motion.

Autonomous networking for the network powering AI

The sheer scale and complexity of a global network of our scale demands a shift from traditional automation to a more intelligent, autonomous approach that requires minimal human intervention. This is especially critical to avoid the substantial increase in operational expenses that come with network growth, and to flatten the cost curves for network planning, design and operations. Below are some examples where we apply AI/ML techniques to help today. We see opportunities to expand into many more use cases:

Network incident response with a Gemini and Vertex AI agentic framework: We are using an agentic AI approach to shorten outage times by identifying and mitigating failures faster, and to perform more effective root-cause analysis. This is helping us reduce the mean-time to detect and mean-time to resolve network issues.
Demand forecasting and capacity planning: We are using AutoML for accurate demand forecasting, and employing graph optimization to optimize our network capacity planning.
Reinforcement learning for routing optimization: We tune routing metrics for specific objectives, such as network performance, with reinforcement learning.

Autonomous networking has allowed us to slash failure mitigation times from hours to minutes, improving our network’s resilience and customer experience. Check out this demo to see an example of our autonomous network in action!

Putting it all together

Google’s next-generation global network represents a paradigm shift in network architecture designed to power the AI era, embracing horizontal scalability through multi-sharding, architecting for resilience at every layer with regional isolation and Protective ReRoute, enabling fine-grained programmability with SDN, and adopting autonomous network operation powered by AI/ML. This helps Google’s global network provide the scale, reliability, performance, and security that today’s mission-critical services and AI/ML applications demand. This transformation of Google’s software-defined global backbone not only meets the formidable challenges of the AI era, but empowers our customers to innovate and thrive in this new landscape. Our next-generation network is designed to be the invisible, yet indispensable, force driving the future of technology and connectivity.

This deep dive only scratches the surface, but hopefully, provides a glimpse into the innovative technologies that underpin Google’s global network. As we continue to navigate the exciting challenges and opportunities of the AI era, Google’s global network is the bedrock upon which we build and deliver transformative experiences for users and customers worldwide. Stay tuned for more updates as Google’s global network continues to evolve!

Read More for the details.

2025 04 22

GCP – MCP Toolbox for Databases: Simplify AI Agent Access to Enterprise Data

Tibor Kiss Cloud, Google Cloud gcp

At Google Cloud Next 25, we announced incredible ways for enterprises to build multi-agent ecosystems with Vertex AI and Google Cloud Databases – including better ways for agents to communicate with each other using Agent2Agent Protocol and Model Context Protocol (MCP). With the growing excitement around MCP for developers, we’re making it easy for MCP Toolbox for Databases (formerly Gen AI Toolbox for Databases) to access your enterprise data in databases. This is another step forward in providing secure and standardized ways to innovate with agentic applications. Let’s take a look.

MCP Toolbox for Databases (formerly Gen AI Toolbox for Databases)

MCP Toolbox for Databases (Toolbox) is an open-source MCP (Model Context Protocol) server that allows developers to connect gen AI agents to enterprise data easily and securely. MCP is an emerging open standard created by Anthropic for connecting AI systems with data sources through a standardized protocol, replacing fragmented integrations that require custom integrations.

Currently, Toolbox can be used to build tools for a large number of databases: AlloyDB for PostgreSQL (including AlloyDB Omni), Spanner, Cloud SQL for PostgreSQL, Cloud SQL for MySQL, Cloud SQL for SQL Server, and self-managed MySQL and PostgreSQL. Because it’s fully open-source, it includes contributions from third-party databases such as Neo4j and Dgraph. Toolbox offers simplified development with reduced boilerplate code, enhanced security through OAuth2 and OIDC, and end-to-end observability with OpenTelemetry integration. This enables you to develop tools easier, faster, and more securely by handling the complexities such as connection pooling, authentication, and more.

As an MCP server, Toolbox provides the additional scaffolding for implementing production-quality database tools and making them accessible to any client in the growing MCP ecosystem. This compatibility allows developers building agentic applications to leverage Toolbox and securely query a wide range of databases through a single, standardized protocol, simplifying development and enhancing interoperability.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud AI and ML’), (‘body’, <wagtail.rich_text.RichText object at 0x3e7420374a30>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/vertex-ai/’), (‘image’, None)])]>

MCP Toolbox for Databases supports Agent Development Kit (ADK)

At Next, we launched the Agent Development Kit (ADK), an open-source framework that simplifies the process of building sophisticated multi-agent systems while maintaining precise control over agent behavior. With ADK, you can build an AI agent in under 100 lines of intuitive code. With ADK, you can:

Shape how your agents think, reason, and collaborate through deterministic guardrails and orchestration controls.
Interact with your agents in human-like conversations with ADK’s unique bidirectional audio and video streaming capabilities enabled with just a few lines of code. Check out the demo of an interactive agent from the opening keynote at NEXT 2025 built on the ADK here.
Choose the model or deployment that works best for your needs. ADK works with your stack of choice – whether that’s your preferred top-tier model, deployment target, or integration with remote agents built on other frameworks. ADK also supports the Model Context Protocol (MCP), enabling secure, two-way connections between your data sources and AI agents.
Deploy to production using the direct integration to Vertex AI Agent Engine. This clear and reliable path from development to enterprise-grade deployment eliminates the typical overhead associated with moving agents into production.

Diagram showing Toolbox with support for ADK and connecting to databases

To get started, go to Vertex AI Agent Garden to explore a curated set of agent samples for common use cases like data science and customer service agents. Discover tools that can be easily used to build agents with ADK such as connecting agents to databases with the integrated MCP Toolbox for Databases. You can access source code in GitHub samples that you can clone and start using to develop your own agents.

Adding LangGraph support

LangGraph gives you essential built-in support for persistence layer, implemented through checkpointers. This helps you build resilient, stateful agents that can reliably manage long-running tasks or resume after interruptions.

To leverage powerful managed databases for storing this state, Google Cloud offers dedicated integration libraries. Developers can choose the following:

The highly scalable AlloyDB for PostgreSQL using the AlloyDBSaver class from the langchain-google-alloydb-pg-python library, or opt for
Cloud SQL for PostgreSQL utilizing the corresponding checkpointer implementation, PostgresSaver, within the langchain-google-cloud-sql-pg-python library.

Both offer robust mechanisms to seamlessly save and load agent execution states, allowing workflows to be reliably paused, resumed, and audited, backed by the manageability and performance of Google Cloud’s PostgreSQL offerings.

When you compile graph with a checkpointer, the checkpointer saves a checkpoint of the graph state at every super-step. Those checkpoints are saved to a thread, which can be accessed after graph execution. Because threads allow access to graph’s state after execution, several powerful capabilities including human-in-the-loop, memory, time travel, and fault-tolerance are all possible.

Install the packages:

code_block: <ListValue: [StructValue([(‘code’, ‘pip install langchain-google-alloydb-pg[langgraph]rnpip install langchain-google-cloud-sql-pg[langgraph]’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e74225ce0a0>)])]>

Learn more about langgraph checkpoint usage for AlloyDB here and Cloud SQL PG here.

Get started

This Colab demonstrates a complete workflow for building and deploying a LangGraph Hotel Agent which can search, book and cancel hotels. This sample shows how to build and deploy an agent (model, tools, and reasoning) using the Vertex AI SDK and MCP Toolbox for Databases.

The demonstration will begin with agent development, integrating the MCP Toolbox for Databases to Search, Book, and Cancel hotels. It will then walk you through deploying the agent to Agent Engine and the MCP Toolbox to Cloud Run, and conclude by demonstrating how to connect these services remotely.

Here are some more resources to get started with Toolbox and MCP.

Read More for the details.

2025 04 21

AWS – Amazon RDS Proxy is now available in 3 additional AWS regions

Tibor Kiss AWS, Cloud AWS

Amazon Relational Database Service (RDS) Proxy is now available in the Asia Pacific (Malaysia), Asia Pacific (Thailand), and Mexico (Central) Regions. RDS Proxy is a fully managed and a highly available database proxy for RDS and Amazon Aurora databases. RDS Proxy helps improve application scalability, resiliency, and security.

Many applications, including those built on modern architectures capable of horizontal scaling based on ebb and flow of active users, can open a large number of database connections or open and close connections frequently. This can stress the database’s memory and compute, leading to slower performance and limited application scalability. Amazon RDS Proxy sits between your application and database to pool and share established database connections, improving database efficiency and application scalability. In case of a failure, Amazon RDS Proxy automatically connects to a standby database instance within a region. With Amazon RDS Proxy, database credentials and access can be managed through AWS Secrets Manager and AWS Identity and Access Management (IAM), eliminating the need to embed database credentials in application code.

For information on supported database engine versions and regional availability of RDS Proxy, refer to the RDS Proxy RDS and Aurora documentation.

Read More for the details.

2025 04 21

AWS – Amazon Kinesis Data Streams increases default shard limits to up to 20,000 per AWS account

Tibor Kiss AWS, Cloud AWS

Amazon Kinesis Data Streams now offers significantly higher default shard limits for data streams in Provisioned capacity mode, increasing from 500 to 20,000 shards per AWS account in the US East (N. Virginia), US West (Oregon), and Europe (Ireland) regions. You can also see an account’s utilization of the shards limit in any region via the AWS Service Quotas console, so you can grow streaming workloads easily and quickly to bring real-time insights to more use cases.

Amazon Kinesis Data Streams is a serverless data streaming service that allows customers to build de-coupled applications that publish and consume real-time data streams at any scale. A data stream is composed of shards, and each shard provides 1 MB/sec ingress and 2 MB/sec egress throughput capacity. You can easily change a stream’s throughput capacity by specifying its number of shards via the console, an API call, or the CLI. With the increased limits, customers using Provisioned mode can now process up to 10 GB/sec of ingress and 20 GB/sec of egress per account by default, and they can always request further increases to this limit.

The default shard limits have also been increased from 200 to 1,000 shards or 6,000 shards per account for all other regions. You can view the new defaults for all regions and request further increases via the Service Quotas console. For more information about how shard limits affect your data streams, see the Quotas and Limits documentation.

Read More for the details.

2025 04 21

AWS – Amazon MSK adds support for Apache Kafka version 3.9

Tibor Kiss AWS, Cloud AWS

Amazon Managed Streaming for Apache Kafka (Amazon MSK) now supports Apache Kafka version 3.9, which allows users to retain tiered data when disabling Tiered Storage at the topic level. Consumer applications can continue to read historical data from the remote log start offset (Rx) while maintaining continuous log offsets across both local and remote storage.
Along with this feature, Apache Kafka version 3.9 includes various bug fixes and improvements. For more details, please refer to the Apache Kafka release notes for version 3.9.

Amazon MSK is a fully managed service for Apache Kafka and Kafka Connect that makes it easier for you to build and run applications that use Apache Kafka as a data store. Amazon MSK is compatible with Apache Kafka, which enables you to quickly migrate your existing Apache Kafka workloads to Amazon MSK with confidence or build new ones from scratch. With Amazon MSK, you can spend more time innovating on streaming applications and less time managing Apache Kafka clusters. To learn how to get started, see the Amazon MSK Developer Guide.

Support for Apache Kafka version 3.9 is available in all AWS regions where Amazon MSK is available.

Read More for the details.

2025 04 21

AWS – Amazon Q Developer releases state of the art agent for feature development

Tibor Kiss AWS, Cloud AWS

Today, AWS announces the update of Amazon Q Developer’s software development agent. This new agent achieves state-of-the-art performance on industry benchmark SWTBench Verified (49%) and sits among the top ranking models on SWEBench Verified (66%). The agent has access to tools for planning and reasoning that use the capacity of advanced models to their fullest. By running in a dedicated environment with built-in access to all the functionalities of a modern IDE, the agent is now able to generate multiple candidate solutions for a given problem, select the most promising one, and return higher quality code to the developer.

With this new agent, developers can further accelerate their development team velocity. The update to the agent translates to more reliable suggestions and reduced debugging time for developers. This allows developers to focus on higher-level design and innovation, while the agent handles more routine coding tasks with increased accuracy. The new software development agent for Amazon Q Developer is available in all AWS Regions where Amazon Q is supported.

Getting started with the software development agent is simple. Developers can begin using it immediately by typing ‘/dev’ in the Q chat window in Visual Studio Code or JetBrains integrated development environment (IDE) where the Amazon Q Developer plugin is installed. To learn more about Amazon Q, visit the Amazon Q product page or refer to the agent documentation.

Read More for the details.

2025 04 21

AWS – Amazon SQS now supports Internet Protocol Version 6 (IPv6)

Tibor Kiss AWS, Cloud AWS

Amazon SQS now supports Internet Protocol version 6 (IPv6) for API requests enabling you to communicate with Amazon SQS using Internet Protocol Version 6 (IPv6), Internet Protocol Version 4 (IPv4), or dual stack clients using public endpoints.

Amazon SQS is a fully managed message queuing service that enables decoupling and scaling of distributed systems, microservices, and serverless applications. The addition of IPv6 support provides customers with a vastly expanded address space, eliminating concerns about address exhaustion and simplifying network architecture for IPv6-native applications. With simultaneous support for both IPv4 and IPv6 clients on SQS public endpoints, customers can gradually transition from IPv4 to IPv6-based systems and applications without needing to switch all systems at once. This enhancement is particularly valuable for modern cloud-native applications and organizations transitioning to IPv6 as part of their modernization efforts.

To learn more on best practices for configuring IPv6 in your environment, visit the whitepaper on IPv6 in AWS. This feature is now available in all AWS commercial Regions, including AWS China Regions, and can be used at no additional cost.

See here for a full listing of our Regions. To learn more about Amazon SQS, please refer to our Developer Guide.

Read More for the details.

2025 04 21

AWS – Introducing Amazon EC2 C8gd, M8gd, and R8gd instances

Tibor Kiss AWS, Cloud AWS

AWS announces the general availability of Amazon Elastic Compute Cloud (Amazon EC2) C8gd instances, Amazon EC2 M8gd instances, and Amazon EC2 R8gd instances with up to 11.4 TB of local NVMe-based SSD block-level storage. These instances are powered by AWS Graviton4 processors, delivering up to 30% better performance over Graviton3-based instances. They have up to 40% higher performance for I/O intensive database workloads, and up to 20% faster query results for I/O intensive real-time data analytics than comparable AWS Graviton3-based instances. These instances are built on the AWS Nitro System and are great fit for applications that need access to high-speed, low latency local storage.

Each instance is available in 12 different sizes. They provide up to 50 Gbps of network bandwidth and up to 40 Gbps of bandwidth to the Amazon Elastic Block Store (Amazon EBS). Additionally, customers can now adjust the network and Amazon EBS bandwidth on these instances by 25% using EC2 instance bandwidth weighting conﬁguration, providing greater ﬂexibility with the allocation of bandwidth resources to better optimize workloads. These instances offer Elastic Fabric Adapter (EFA) networking on 24xlarge, 48xlarge, metal-24xl, and metal-48xl sizes.

All of these instances are available in the following AWS Regions: US East (Ohio, N. Virginia), and US West (Oregon).

To learn more, see Amazon C8gd instances, Amazon M8gd Instances, and Amazon R8gd Instances. To learn how to migrate your workloads to AWS Graviton-based instances, see the Getting started with Graviton.

Read More for the details.

2025 04 21

AWS – Amazon EC2 C6id instances are now available in AWS Europe (Paris) region

Tibor Kiss AWS, Cloud AWS

Starting today, Amazon Elastic Compute Cloud (Amazon EC2) C6id instances are available in Europe (Paris) Region. These instances are powered by 3rd generation Intel Xeon Scalable Ice Lake processors with an all-core turbo frequency of 3.5 GHz and up to 7.6 TB of local NVMe-based SSD block-level storage.

C6id instances are built on AWS Nitro System, a combination of dedicated hardware and lightweight hypervisor, which delivers practically all of the compute and memory resources of the host hardware to your instances for better overall performance and security. Customers can take advantage of access to high-speed, low-latency local storage to scale performance of applications such data logging, distributed web-scale in-memory caches, in-memory databases, and real-time big data analytics.

These instances are generally available today in the US West (Oregon), US East (Ohio, N. Virginia), Canada (Central), Canada West (Calgary), AWS GovCloud (US-West), Mexico (Central), South America (Sao Paulo), Asia Pacific (Thailand, Seoul, Malaysia, Tokyo, Singapore, Sydney), Europe (Paris, Ireland, Frankfurt, London), Israel (Tel Aviv) Regions.

Customers can purchase the new instances via Savings Plans, Reserved, On-Demand, and Spot instances. To learn more, visit our product page for Amazon C6id instances. To get started, visit AWS Command Line Interface (CLI), and AWS SDKs.

Read More for the details.

2025 04 21

GCP – Your comprehensive guide to Google Cloud Security at RSA 2025

Tibor Kiss Cloud, Google Cloud gcp

Gaining comprehensive visibility into threats across your entire digital landscape is paramount for security teams. We’re excited to bring our capabilities, products, and expertise to the upcoming RSA Conference in San Francisco, where you can learn more about our latest innovations, and where we’ll be sharing insight from this year’s highly-anticipated M-Trends report.

We now offer a streamlined, effective way to make Google an integral part of your security team with Google Unified Security, announced at Google Cloud Next earlier this month. This converged solution brings together the best of Google — unmatched threat visibility, faster threat detection, continuous virtual red-teaming, the most trusted browser, and Mandiant expertise — supercharged by Google Gemini and running on a planet-scale security fabric.

In addition to exploring Google Unified Security firsthand at the RSA Conference, you can take a deep dive into our newest M-Trends report, showcasing the results of more than 450,000 hours of frontline incident response investigation analysis from 2024.

From connecting with Google’s security experts to witnessing innovative cloud security technology in action, Google Cloud Security is the place to be at the RSA Conference. We’ve got a packed schedule of booth activities, insightful keynotes, deep-dive sessions, and exclusive events you won’t want to miss.

Here’s your guide to everything Google Cloud Security is bringing to RSA Conference 2025.

Meet us at our booth: Dive into demos and test your knowledge

Find the Google Cloud Security team on the show floor at booth #N-6062 in the Moscone Center, North Hall. Here you can:

Meet with our security experts: Engage in one-on-one conversations and discover how making Google a part of your security team can strengthen your defenses with Google Unified Security.
Check out live presentations and 1:1 demos: Experience our latest security innovations firsthand and see how Google Unified Security can address your specific challenges.
Test your knowledge at M-Trends trivia: Put your threat intelligence skills to the test for a chance to win exciting prizes.

Gain insights directly from Google Cloud Security leaders

Beyond speculation: Data-driven insights into AI and cybersecurity
Hear Sandra Joyce, VP, Google Threat Intelligence, assess the real-world and future impacts of AI in cybersecurity. This session cuts through the noise to expose practical applications of AI, drawing on Mandiant’s incident response engagements and analysis of attacker use of Gemini.
- Tuesday, April 29 | 10:50 AM | Moscone West Keynote Stage
Cybersecurity Year-in-Review and The Future Ahead
Kevin Mandia, one of industry’s most prominent and respected voices, will present his annual report on the cyber landscape, including the evolving CISO role, emergence of AI, and need for resilience. He’ll be joined by former New York Times cyber reporter Nicole Perlroth to discuss the data and share firsthand stories and actionable strategies to strengthen defenses and prepare for the future.
- Wednesday, Apr 30 | 9:40 AM – 10:30 AM PDT | Moscone South Keynote Stage

Explore expert-led sessions

We have an exciting lineup of Google Cloud Security speakers who will be presenting at RSAC this year — on the mainstage, in track sessions, and at our Google Cloud Security hub in the Marriott Marquis. Below are the highlights of our Google-led sessions from RSAC, and see our website for a complete list.

Shadow AI: Shining the Governance Light on AI – [IAIS-M05]
- Speakers: Anton Chuvakin, Senior Staff Security Advisor, Google Cloud; Michael Bernhardt, Director for Information Security, DATEV;John Dickson, CEO, Bytewhisper Security; Diana Kelley, CISO, Protect AI
- Monday, April 28, 2025 | 1:10 PM – 2:00 PM
How Security UX Must Change to Combine Human Expertise with Agentive AI – [IAIS-T01]
- Speaker: Steph Hay, Senior Director, Gemini Product & UX, Cloud Security, Mandiant / Google Cloud
- Tuesday, April 29, 2025 | 8:30 AM – 9:20 AM
The Next Chapter: AI, Modern Threats, and the Future of Cybersecurity – [PART4-W02]
- Speakers: Peter Bailey, VP, Google Cloud, David Wong, Director, Mandiant Consulting
- Wednesday, Apr 30 | 9:40 AM – 10:30 AM PDT
Lessons from AI Red Teaming – And How to Apply Them Proactively – [ANI-W01]
- Speaker: Daniel Fabian, Principal Digital Arsonist, Google
- Wednesday, Apr 30 | 8:30 AM – 9:20 AM PDT

Visit the Google Cloud Security Hub for exclusive events

Join us at the Marriott Marquis for exclusive sessions and networking opportunities at the Google Cloud Security Hub. Register now to secure your spot:

Executive breakfast | Modern cyber defense: Building resilient organizations in a complex world: Join us for an exclusive breakfast briefing where we’ll address the unprecedented challenges facing modern cyber defense. This session will explore the critical role of information sharing and AI in Google Unified Security, and how it helps build more robust and resilient organizations in today’s increasingly complex world.
- Tuesday, April 29 | 8:00 AM | Marriott Marquis – Google Cloud Security Hub
Threat Intelligence briefing and luncheon: Learn the latest frontline intelligence over lunch with Google Threat Intelligence Group VP, Sandra Joyce and Chief Analyst, John Hultquist. Don’t miss this exclusive threat overview, where they’ll share observations and analysis of the current threat landscape and how to build a resilient cybersecurity program.
- Tuesday, April 29 | 12:00 PM – 1:15 PM | Marriott Marquis – Google Cloud Security Hub

Unwind and connect at our Customer Lounge

During the week, relax and connect with Google Cloud Security experts and partners at the Marriott Marquis for breakfast, lunch, snacks, coffee, and boba. Participate in additional Google Cloud Security sessions, play games, and get a new headshot while networking with other security professionals.

Join us in the space for the return of Tasting Tuesday and Wine Down Wednesday (both starting at 5:30 PM), brought to you in collaboration with Google Cloud Security partners.

Tasting Tuesday: A Delicious Start to RSAC: Enjoy a vibrant atmosphere, eat San Francisco-inspired cuisine, listen to great live music while connecting with industry peers, and savor the start of a successful conference.
Wine Down Wednesday: Celebrate Success: Join us for the ultimate RSAC closing event. Enjoy pairings of great wine and food and live music, and raise a glass to new connections and a successful week of achievements.

Meet you there

RSA Conference 2025 promises to be an insightful week, and Google Cloud Security is ready to contribute valuable knowledge and innovative solutions. We encourage you to make the most of your time by visiting our booth, attending our sessions, re-energizing at the Google Cloud Security Hub in the Marriott Marquis, and connecting with our team.

We’re eager to discuss your security challenges and demonstrate how Google can be your strategic security partner in the face of evolving threats. If you can’t join us in person, we encourage you to stream the RSA Conference sessions here to stay one step ahead of threats.

Read More for the details.

2025 04 18

AWS – AWS HealthOmics announces workflow versioning support

Tibor Kiss AWS, Cloud AWS

AWS HealthOmics now supports workflow versioning, enabling customers to manage multiple versions of their bioinformatics workflows efficiently. AWS HealthOmics is a HIPAA-eligible service that helps healthcare and life sciences customers accelerate scientific breakthroughs with fully managed biological data stores and workflows. With this release, workflow developers can create and maintain multiple versions of their workflows while retaining consistent workflow IDs and base ARNs across versions.

With workflow versioning, users can select specific workflow versions when starting a run, enabling better control and reproducibility of their analyses. This simplifies collaboration by automatically sharing new workflow versions with existing subscribers, eliminating the need for manual resharing and ensuring teams always have access to the latest workflow iterations.

Workflow versioning is supported in all regions where AWS HealthOmics is available: US East (N. Virginia), US West (Oregon), Europe (Frankfurt, Ireland, London), Asia Pacific (Singapore), and Israel (Tel Aviv).

To get started with workflow versioning, see the AWS HealthOmics documentation.

Read More for the details.

2025 04 18

AWS – Amazon CloudWatch launches cross-account observability in the AWS GovCloud (US) Regions

Tibor Kiss AWS, Cloud AWS

AWS launches CloudWatch cross-account observability in the AWS GovCloud (US) Regions, enabling monitoring and troubleshooting of applications across multiple AWS accounts within an AWS GovCloud (US) Region. CloudWatch Cross-account observability allows seamless searching, visualization, and analysis of metrics, logs, and traces, removing account boundaries.

Security teams, operations teams, and service owners can now easily explore cross-account telemetry and analyze them to drive powerful insights helping to efficiently monitor and troubleshoot application health issues. CloudWatch Cross-account observability enables searching log groups across multiple accounts, running cross-account Logs Insights queries, and creating Contributor Insights rules to identify top contributors generating log entries. You can now run Metrics Insights queries on your cross-account metrics for a consolidated view and create cross-account alarms. With cross-account observability, you can now use AWS X-Ray Trace Map to monitor your cross-account applications or achieve single pane of glass observability on your end-to-end distributed traces for requests spanning across multiple accounts. For example, users can obtain end-to-end distributed traces tracking AWS Lambda functions calling each other across accounts. This eliminates the need to switch between accounts during troubleshooting, saving time and providing comprehensive visibility.

CloudWatch cross-account observability is now available in all commercial AWS Regions and the AWS GovCloud (US) Regions with no extra cost for logs and metrics, and the first trace copy is free. Detailed pricing information and documentation are available on the Amazon CloudWatch pricing page and documentation pages.

Read More for the details.

2025 04 18

AWS – AWS Console Mobile Application adds support for Amazon Lightsail

Tibor Kiss AWS, Cloud AWS

AWS customers can now access Amazon Lightsail from within the AWS Console Mobile App to monitor and manage Lightsail instances, containers, databases, network, storage, snapshots, domains and DNS while on the go. Visit the Services tab in the AWS Console Mobile App and select Lightsail to get started.

The AWS Console Mobile App enables AWS customers monitor and manage a select set of resources and receive push notifications to stay informed and connected with their AWS resources while on-the-go. The sign-in process supports biometrics authentication, making access to AWS resources simple, secure, and quick. Lightsail offers easy-to-use virtual private server (VPS) instances, storage, databases, and more for a cost-effective monthly price. For AWS services not available natively, customers can access the AWS Management Console via an in-app browser to access service pages without additional authentication, manual navigation, or need to switch from the app to a browser.

Visit the AWS Console Mobile App product page for more information about the AWS Console Mobile App, including a full list of supported services and regions. Visit the Amazon Lightsail product page for more information about Amazon Lightsail, including supported regions.

Read More for the details.

2025 04 18

AWS – Amazon CloudWatch agent now supports Red Hat OpenShift Service on AWS (ROSA)

Tibor Kiss AWS, Cloud AWS

Amazon CloudWatch agent support for Red Hat OpenShift Service on AWS (ROSA) enables monitoring of applications and infrastructure using familiar CloudWatch tools such as Container Insights and Application Signals. ROSA is a fully-managed cloud service that helps customers to quickly deploy, operate, and scale containerized applications on AWS with the same consistent OpenShift experience they have on-premises. This new capability allows DevOps teams and application owners to gain deep visibility into their ROSA clusters’ performance, health, and resource utilization leveraging AWS’s native observability tools.

CloudWatch agent on ROSA enables the collection and analysis of metrics, logs, and traces from containerized applications and underlying infrastructure components, helping customers to identify the impact of anomalies on end-user experience. This integration streamlines the troubleshooting process and allows teams to quickly identify issues across their ROSA clusters and other AWS services. Through unified infrastructure and application monitoring, customers can set up automated alerts, track performance trends, and correlate events across their entire application stack.

Amazon CloudWatch agent is available in all public AWS Regions and AWS GovCloud (US). Please see Amazon CloudWatch pricing for pricing details.

To get started with Amazon CloudWatch agent on Red Hat OpenShift Service on AWS, see Setting up Container Insights on RedHat OpenShift on AWS (ROSA) in the Amazon CloudWatch User Guide.

Read More for the details.

2025 04 18

AWS – AWS STS global endpoint now serves your requests locally in regions enabled by default

Tibor Kiss AWS, Cloud AWS

AWS Security Token Service (AWS STS) now automatically serves all requests to the global endpoint (sts.amazonaws.com) in the same AWS Region as your deployed workloads, enhancing resiliency and performance. Previously, all requests to the STS global endpoint were served from the US East (N. Virginia) Region.

With this enhancement, your applications benefit from improved latency and fault isolation as requests are processed in the same Region as your workloads. For example, if your application runs in US West (Oregon) and calls the STS global endpoint, your requests are now served locally in US West (Oregon) instead of being routed to US East (N. Virginia).

This update is available in all AWS Regions that are enabled by default. No action is required from customers to benefit from these improvements. Any requests to the STS global endpoint from Regions not enabled by default (i.e. opt-in Regions) will continue to be served in US East (N. Virginia).

We continue to recommend that you use the appropriate STS Regional endpoints whenever possible. For more information about these changes, see AWS STS global endpoint changes , and the announcement blogpost.

Read More for the details.

2025 04 18

AWS – Amazon Managed Service for Prometheus now supports label-based active series limits

Tibor Kiss AWS, Cloud AWS

Amazon Managed Service for Prometheus now supports label-based active series limits within your workspace. This feature helps you manage active series volume across different producers such as applications, services or teams that share a workspace.

You can now allocate specific active series limits to different metric producers in your workspace, enabling you to protect your critical metrics. If a sub-set of metrics experience an unexpected surge, only the metrics sharing the same label-based active series limits are throttled. For example, you can set different limits for metrics from different applications using label sets like {app=”payment-service”, environment=”prod”}. If the payment-service application produced an unexpected surge in metrics, only ingested metrics originating from the payment-service application are throttled.

This experience is enabled by the new Workspace Configuration APIs. Using these APIs, you can also manage the data retention period of your workspace. You can specify the number of days to retain metrics data within your workspace, before permanent deletion. To get started, visit AMP console’s workspace configuration tab or use AWS CLI, SDK, or APIs. Check out the Amazon Managed Service for Prometheus user guide for detailed documentation.

This feature is now available in all AWS regions where Amazon Managed Service for Prometheus is generally available.

Read More for the details.

2025 04 18

GCP – Supercharge your data the open-source way: Memorystore for Valkey is now GA

Tibor Kiss Cloud, Google Cloud gcp

Editor’s note: Ping Xie is a Valkey maintainer on the Valkey Technical Steering Committee (TSC).

Memorystore, Google Cloud’s fully managed in-memory service for Valkey, Redis and Memcached, plays an increasingly important role in our customers’ deployments — in fact, over 90% of the top 100 Google Cloud customers use Memorystore. Today, we’re excited that the Memorystore for Valkey service is now generally available, a significant step forward for open-source in-memory data management on the cloud. With the GA, you can now run your production workloads on Memorystore for Valkey backed by a 99.99% availability SLA along with features such as Private Service Connect, multi-VPC access, cross-region replication, persistence, and many more.

When we launched the preview of Memorystore for Valkey in August 2024, hundreds of Google Cloud customers like Major League Baseball (MLB) and Bandai Namco Studios Inc. jumped in and deployed the service. In the last few months, they’ve provided us with invaluable feedback that has shaped the service we’re announcing today:

“At Major League Baseball, our use of Memorystore has been a key part in optimizing how we bring data to our fans. We are excited about the general availability of Memorystore for Valkey, a truly open-source alternative. We believe its inherent flexibility and the power of community-driven development will further enhance our speed, scalability, and real-time data processing capabilities, allowing us to better serve our fans, players, and operations.” – Rob Engel, Vice President of Software Engineering, Major League Baseball

“Bandai Namco Studios uses Memorystore to power the low-latency and high-scale performance essential for many of our titles. We’re excited about the GA launch of Memorystore for Valkey. Its speed, features, and truly open-source nature will empower us to enhance real-time gameplay and scale for our global player base. We look forward to leveraging Memorystore for Valkey’s capabilities to continue pushing the boundaries of gaming innovation.” – Motoo Fukuda, Technical Director at Bandai Namco Studios Inc.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud databases’), (‘body’, <wagtail.rich_text.RichText object at 0x3e573ddbe580>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/products?#databases’), (‘image’, None)])]>

What’s new at GA

At GA, Memorystore for Valkey is backed by a 99.99% SLA powered by Google’s advanced high availability and zonal placement algorithms, and ships with a comprehensive suite of enterprise-grade features such as:

Support for Private Service Connect: Memorystore for Valkey is built on top of Private Service Connect, which allows customers to connect to up to 250 shards using just two IP addresses. Memorystore’s discovery endpoint being highly available ensures no single point of failure for your cluster.
Zero-downtime scaling: Memorystore for Valkey offers zero downtime scaling (in and out) so your cluster can grow with your application’s needs, and so it’s cost-optimized for your workloads. It supports cluster sizes ranging from 1 to 250 nodes.
Integrated Google-built vector similarity search: Memorystore for Valkey supports ultra-low latency, in-memory vector search, and can perform vector search at single-digit millisecond latency on over a billion vectors, with greater than 99% recall.

This performance is powered by Google’s vector search module, the official search module for the Valkey OSS project, which is integrated into Memorystore for Valkey. The module enables modern AI applications for gen AI use cases such as retrieval-augmented generation (RAG), recommendation systems, and semantic search. With hybrid search support, users can achieve more accurate and contextually relevant search results, leading to improved application performance and a better user experience.

Managed backups: Access to built-in managed backups enables both automated and on-demand backups for migrations, disaster recovery, and compliance.
Cross-region replication (CRR): Using CRR, you can achieve disaster recovery prepared-ness and low-latency reads across regions. At this time, in addition to the primary region, we support up to two secondary regions with clusters that in turn can have varying numbers of replicas. Memorystore for Valkey ensures both the data plane and control plane remain in sync across regions.
Multi-VPC access: Memorystore for Valkey supports multiple client-side VPCs to connect to one Private Service Connection endpoint on the Valkey cluster. Using this technology, you can securely connect clients across multiple projects and VPCs.
Persistence: Memorystore for Valkey offers both RDB-snapshot and AOF-logging based persistence to meet varying data durability requirements.

Memorystore for Valkey supports both Valkey 7.2, and our engine of choice, Valkey 8.0, which offers many enhancements over its predecessors:

Exceptional performance: With asynchronous I/O improvements, Memorystore for Valkey 8.0 delivers better throughput and achieves up to 2x Queries Per Second(QPS) of Memorystore for Redis Cluster at microseconds latency, helping applications handle demanding internet-scale workloads with ease.

While priced in-line with Memorystore for Redis Cluster, Memorystore for Valkey’s performance optimizations can lead to substantial cost savings by potentially requiring fewer nodes to handle the same workload.

Optimized memory efficiency: Valkey 8.0’s optimized memory management delivers improved memory savings, reducing operational costs across various workloads.
Enhanced reliability: Valkey 8.0 offers significantly more reliable scaling with Google-contributed features like automatic failover for empty shards and highly available migration states. Additionally, we also introduced migration states auto-reparing to further strengthen system resilience.

In addition, Memorystore for Valkey also provides other capabilities, such as maintenance windows, single zone clusters, single shard clusters, no-cost inter-zone replication, etc.

Our commitment to open source and customer trust

Following licensing updates to Redis OSS by Redis Inc. in March 2024, the open-source community established Valkey OSS as an alternative that’s supported by organizations including Google, Amazon, Snap and others.

We deeply value the trust you place in us. To ensure you continue to have access to powerful, open technology, we launched Memorystore for Valkey on Google Cloud. Unlike Redis, the Valkey OSS project is under the BSD 3-clause license and backed by the Linux Foundation. The momentum behind Valkey has been exhilarating.

In addition to Memorystore for Valkey, we are also committed to supporting and delivering new capabilities for Memorystore for Redis Cluster and Memorystore for Redis. And when Memorystore for Redis customers are ready to adopt Valkey — for its price-performance, reliability and open-source nature — we offer full migration support. Memorystore for Valkey is fully compatible with Redis OSS 7.2 APIs and your favorite clients, making it easy to switch to open source. Further, you can reuse your Memorystore for Redis and Memorystore for Redis cluster committed use discounts (CUDs), smoothing the transition.

Try Memorystore for Valkey today

The best way to experience the power of Memorystore for Valkey is to try it out. Get started with the documentation or deploy your first Valkey instance. Don’t let having to self-manage Redis hold you back. Experience the simplicity and speed of Memorystore for Valkey today and see how it can power your applications, so you can focus on what matters: innovating and creating impactful applications for your business!

Read More for the details.