Cloud

2025 12 10

GCP – Announcing Model Context Protocol (MCP) support for Google services

With the recent launch of Gemini 3, we have the state-of-the-art reasoning to help you learn, build, and plan anything. But for AI to truly be an “agent”, to pursue goals and solve real-world problems on behalf of users, it needs more than just intelligence; it needs to reliably work with tools and data.

Anthropic’s Model Context Protocol (MCP), often likened to a “USB-C for AI”, has quickly become a common standard to connect AI models with data and tools. MCP enables AI applications to execute the complex multi-step tasks it takes to solve real world problems. However, implementing Google’s existing community-built servers often requires developers to identify, install, and manage individual, local MCP servers or deploy open-source solutions–placing the burden on developers, and often leading to fragile implementations.

Today we’re announcing the release of fully-managed, remote MCP servers. Google’s existing API infrastructure is now enhanced to support MCP, providing a unified layer across all Google and Google Cloud services. Developers can now simply point their AI agents or standard MCP clients like Gemini CLI and AI Studio to a globally-consistent and enterprise-ready endpoint for Google and Google Cloud services.

Crucially, we are extending this capability to your broader enterprise stack through Apigee, allowing you to leverage the purpose-built APIs your organization uses for specific data flows and business logic. Customers can now expose and govern their own developer-built APIs, as well as external third-party APIs, as discoverable tools for agents. Read more about Apigee’s announcement here.

We are incrementally releasing MCP support for all our services, starting with:

1. Google Maps: Grounding AI in the real world

Maps Grounding Lite, available through Google Maps Platform, connects AI agents to trusted geospatial data, offering access to fresh information on places, weather forecasts, and routing details such as distance and travel time. This allows developers to build agents that can accurately answer real-world location and travel queries without hallucinating. For example, an AI assistant can use Grounding Lite to respond to queries such as, “How far is the nearest park from this rental?”, “What should I pack for the weather in Los Angeles this weekend?”, or “Could you recommend kid-friendly restaurants near our hotel?”

2. BigQuery: Reasoning over enterprise data

The BigQuery MCP server enables agents to natively interpret schemas and execute queries against enterprise data without the security risks or latency of moving data into context windows. It provides direct access to BigQuery features like forecasting while ensuring data remains in-place and governed.

3. Google Compute Engine (GCE): Autonomous infrastructure management

By exposing capabilities like provisioning and resizing as discoverable tools, this server empowers agents to autonomously manage infrastructure workflows. Agents can handle everything from initial builds to day-2 operations, such as dynamically adapting to workload demands.

4. Google Kubernetes Engine (GKE): Autonomous container operations

The GKE MCP server exposes a structured, discoverable interface that allows agents to interact reliably with both GKE and Kubernetes APIs, eliminating the need to parse brittle text output or string-together complex CLI commands. This unified surface allows agents, operating autonomously or with human-in-the-loop guardrails, to diagnose issues, remediate failures, and optimize costs.

Built-in security and observability

We are bringing order to this ecosystem with a unified approach to discovery and governance. With the new Cloud API Registry and Apigee API Hub, developers can find trusted MCP tools from Google and their own organizations, respectively. We pair this ease of discovery with rigorous control: administrators can manage access via Google Cloud IAM, rely on audit logging for observability, and utilize Google Cloud Model Armor to defend against advanced agentic threats such as indirect prompt injection.

“Google’s support for MCP across such a diverse range of products, combined with their close collaboration on the specification, will help more developers build agentic AI applications. As adoption grows among leading platforms, it brings us closer to agentic AI that works seamlessly across the tools and services people already use.”– David Soria Parra, Co-creator of MCP & Member of Technical Staff, Anthropic

Let’s see an example of these new MCP servers in action:

Imagine an agent that will help identify an ideal location for retail. Using Agent Development Kit (ADK), you can build a natural-language agent backed by Gemini 3 Pro, that connects to BigQuery to forecast revenue based sales data, while simultaneously, the agent cross-references Google Maps to scout for complementary businesses and validate delivery routes, all via standard, managed MCP servers.

To truly unlock the potential of agentic AI, your agents need access to your entire application stack, from containers to your relational databases. In the next few months, we will be rolling out MCP support for additional services including:

Projects, Compute, and Storage: Cloud Run, Cloud Storage, Cloud Resource Manager
Databases and Analytics: AlloyDB, Cloud SQL, Spanner, Looker, Pub/Sub, Dataplex Universal Catalog
Security: Google Security Operations (SecOps)
Cloud operations: Cloud Logging, Cloud Monitoring
Google services: Developer Knowledge API, Android Management API
And many more

The key to the agentic future

With these new and extended MCP capabilities, we are ensuring developers and agents can easily interact with data and take actions too. Google is committed to leading the AI revolution not just by building the best models, but also by building the best ecosystem for those models and agents to thrive. As a founding member of Agentic AI Foundation, we will continue to contribute to the evolution of MCP through the open source community. By giving agents the best method to connect to the world, we are freeing developers to focus on what’s next.

Learn how to get started with the MCP servers: see our MCP docs.
Our demo and its complete code is shared here.

Read More for the details.

2025 12 10

GCP – Announcing MCP support in Apigee: Turn existing APIs into secure and governed agentic tools

Tibor Kiss Cloud, Google Cloud gcp

Today, we expanded Google’s support for Model Context Protocol (MCP) with the release of fully-managed, remote MCP servers, giving developers worldwide consistent and enterprise-ready access to Google and Google Cloud services. This includes support for MCP in Apigee, which makes it possible for agents to use your secure, governed APIs and custom workflows cataloged in Apigee API hub, as tools to complete tasks for end users.

With Apigee’s support for MCP, you don’t need to make any changes to your existing APIs, write any code, or deploy and manage any local or remote MCP servers. Apigee uses your existing API specifications and manages the underlying infrastructure and transcoding, so that you can focus on the business logic for your agents.

Overview of Apigee’s MCP support

Apigee provides 30+ built-in policies for authorization, authentication, security, and governance controls to ensure that API interactions are consistently protected. Apigee’s debugging UI and analytics capabilities provide end-to-end visibility over those interactions and monitoring and alerting for traffic and performance issues.

With Apigee’s support for MCP, you can turn your existing APIs into MCP tools, governed by the same set of policies and with full visibility over agentic interactions. You can do this by creating an “MCP proxy” in an environment group, specifying /mcp as the basepath and mcp.apigeex.com as the target URL, and including an OpenAPI specification. Once the MCP proxy is deployed, it will be registered in Apigee API hub as an “MCP” API.

Deployed MCP proxies are automatically registered in Apigee API hub

When a tools/list or tools/call request is made to the MCP endpoint, Apigee uses the operations documented in the OpenAPI spec as the MCP tools list. You can then bundle the MCP proxy in an API product, and apply granular quota and identity and access policies to ensure that only authorized MCP clients, agents, and developers can list and call those tools.

With this process you can, for example, designate specific API operations as MCP tools. You can then specify that an MCP tool for your “Payments” service can only be accessed by designated agents with known client identities and a legitimate need to use the tool.

You can then use Apigee Analytics to monitor MCP tool usage. And, with the recent launch of Apigee API insights, you can also use the new “Insights” tab in Apigee API hub’s catalog to view traffic and performance metrics for your MCP endpoints.

Benefits of Apigee’s approach to MCP support

Our main goal with MCP support in Apigee is to make sure that you can secure, govern, and monitor usage of MCP tools with the same policies and workflows in Apigee that you’re already familiar with.

What this means for you:

No added operational burden: You don’t need to set up and manage an MCP server for each of your APIs; just deploy an MCP proxy, and Apigee will take care of the rest. Apigee fully manages the MCP servers, transcoding, and protocol handling.
Tool observability and governance: Apigee’s built-in identity, authorization, and security policies can also be used to secure and govern your MCP endpoints and tools, and you can use Apigee analytics to monitor tool usage by MCP clients.
Comprehensive tool security: Apigee helps make sure that all agentic interactions are secure. For example, you can use Cloud Data Loss Prevention to classify and protect sensitive data passed from your tools, and use Model Armor to guard against prompt injection and jailbreaking attempts. You can make sure agents and users have the proper IAM permissions to invoke MCP tools, and view and fully debug the entire end-to-end flow of agentic interactions. You can also use Apigee Advanced API Security to keep your tools secure.
Centralized tool catalog: After you deploy an MCP proxy, Apigee automatically registers your MCP endpoint in Apigee API hub, along with your spec. This allows you to maintain a searchable, centralized tool catalog and promote tool reuse.

Using Apigee MCP tools with agent frameworks

Apigee’s MCP support is designed for maximum compatibility. Your secured Apigee MCP endpoints are usable with agents built using a variety of frameworks, including ADK, LangGraph, and other popular solutions across the AI ecosystem.

However, developers choosing Agent Development Kit (ADK) have an exclusive, streamlined advantage when developing agents within the Google ecosystem.

ADK is a flexible and modular framework for developing and deploying AI agents. While optimized for Gemini and the Google ecosystem, ADK is model-agnostic, deployment-agnostic, and is built for compatibility with other frameworks. ADK was designed to make agent development feel more like software development, to make it easier for developers to create, deploy, and orchestrate agentic architectures that range from simple tasks to complex workflows.

ADK includes a toolset for both Apigee and Application Integration, so that developers building custom agents with ADK can easily connect those agents to your MCP endpoints and tools that are governed and secured with Apigee. You can also use the ApigeeLLM wrapper for ADK to expose your LLM endpoint through an Apigee proxy, integrating governance into your agentic workflows. (Note: The ApigeeLLM wrapper is currently designed for use with Vertex AI and the Gemini API in Google AI Studio, but we’re planning to support other models and interfaces.)

Google Cloud also provides services for deploying custom agents. You can use Vertex AI Agent Engine to deploy your agents, and then put them in action across your organization using Gemini Enterprise.

Next steps

MCP support in Apigee is currently in preview use with customers. Please contact your Apigee or Google Cloud account team to access this feature.

Read More for the details.

2025 12 10

AWS – Amazon ElastiCache Serverless now supports same-slot WATCH command

Tibor Kiss AWS, Cloud AWS

Today, we are announcing that Amazon ElastiCache Serverless now supports the WATCH command for same-slot transactions, helping developers build more reliable applications with improved data consistency in high-concurrency scenarios. With this launch, the WATCH command makes transactions conditional, ensuring they execute only when monitored keys remain unchanged.

For ElastiCache Serverless, the WATCH command works with transactions that operate on keys within the same hash slot as the watched keys. When applications attempt to watch keys that are not in the same hash slot, they’ll receive a CROSSSLOT error. Developers can control key placement by using hash tags in their key names to ensure keys hash to the same slot. The transaction will also be aborted when ElastiCache Serverless cannot guarantee the state of watched keys.

WATCH command support is available in all AWS regions where ElastiCache Serverless is supported at no additional cost. To get started, create transactions using the WATCH command through your preferred client library. To learn more about conditional transactions and the WATCH command, see the ElastiCache Serverless documentation, and the Valkey transactions documentation.

Read More for the details.

2025 12 10

AWS – Amazon CloudWatch SDK supports optimized JSON, CBOR protocols

Tibor Kiss AWS, Cloud AWS

Amazon CloudWatch announces support for both the JSON and Concise Binary Object Representation (CBOR) protocols in the CloudWatch SDK, enabling lower latency and improved performance for CloudWatch customers. The SDK will automatically use JSON or CBOR as its new default communication protocol, offering customers a lower end-to-end processing latency as well as reduced payload sizes, application client side CPU, and memory usage.

Customers use the CloudWatch SDK either directly or through Infrastructure as Code solutions to manage their monitoring resources. Reducing control plane operations latency and payload size helps customer optimize their operational maintenance and resources usage and costs. JSON and the CBOR data formats are standards designed to enable better performance over the traditional AWS Query protocol.

The CloudWatch SDK for JSON and CBOR protocols support is available in all AWS Regions where Amazon CloudWatch is available and for all generally available AWS SDK language variants.

To leverage the performance improvements, customers can install the latest SDK version here. To learn more about the AWS SDK, see Amazon Developer tools.

Read More for the details.

2025 12 10

GCP – Agent Factory Recap: Building with Gemini 3, AI Studio, Antigravity, and Nano Banana

Tibor Kiss Cloud, Google Cloud gcp

Welcome back to The Agent Factory! This week, we went beyond the hype to dissect the technical details of Google’s massive wave of AI releases. We were joined by Paige Bailey, the UTL for Developer Relations at DeepMind, to break down everything from the new Gemini 3 model to the Antigravity IDE.

Google has been shipping at a breakneck pace—literally a new model or feature nearly every day—and this episode is all about how developers can harness these tools right now.

This post guides you through the key ideas from our conversation. Use it to quickly recap topics or dive deeper into specific segments with links and timestamps.

The Tech Stack – What is it?

We tossed around a few new names in this episode. Here is a quick primer on the tech discussed:

Gemini 3: The latest iteration of Google’s model family. While Gemini 1 was about understanding and Gemini 2 was about reasoning, Gemini 3 is designed for acting and coding. It features improved tool use and function calling.
Antigravity: Google’s new AI-native IDE (Integrated Development Environment) designed to integrate Gemini 3 directly into the coding workflow, allowing for multimodal inputs like screenshots to drive code changes.
Nano Banana Pro: The newest iteration in the media generation series, capable of creating high-fidelity images, voxel art, and game assets.

The Factory Floor

The Factory Floor is our segment for getting hands-on. Here, we moved from high-level concepts to practical code with live demos.

Building “Nordic Shield” with Gemini 3

Timestamp: 11:20

Paige demonstrated the “Build” feature in AI Studio to create a complex React application from scratch. The goal was to test the model’s ability to self-correct and handle specific design constraints.

The Prompt: Create an insurance cataloging app using the webcam and microphone. It needed a “Nordic/IKEA” design theme, an inventory list, and the ability to estimate item value using Google Search grounding.
The Process: Gemini 3 generated a React Native app, set up the directory structure, and wrote its own prompts for the agents.
The Result: The app, named “Nordic Shield,” successfully cataloged items (like a Pixel 7 and a soda can) via video. When it encountered audio issues, it generated a reasoning trace to debug the problem live. It successfully utilized Gemini Live for the conversation and executed a secondary “agentic” step to search Google for the estimated value of the items.

Redesigning a Website with Antigravity

Timestamp: 30:27

We shifted gears to look at Google’s new IDE, Antigravity. The goal was to update an existing, text-heavy website to match a new, vibrant “neo-brutalist” design aesthetic using only screenshots as a guide.

The Input: The existing codebase plus two screenshots of the desired visual style (doodly, pastel, notebook-esque).
The Implementation: Antigravity analyzed the images to understand the design philosophy. It created a task list and an implementation plan to ensure it stayed grounded.
The Outcome: The IDE successfully refactored the site to match the brand guidelines, introducing “jiggling pill” UI elements and updating the color palette to match the provided screenshots perfectly.

Paige Bailey on The Evolution of Gemini

We sat down with Paige to understand how DeepMind is approaching the rapid evolution of their models and what it means for developers building agents today.

The Three Stages of Gemini

Timestamp: 2:49

Paige outlined the clear evolutionary path of the Gemini family. She explained that the original Gemini was focused on multimodal understanding (video, text, audio). Gemini 2 introduced thinking—the ability to reason and plan step-by-step. Gemini 3, the current iteration, is all about acting. This model is optimized for acting on its reasoning, specifically through coding and tool use, allowing for composite architectures where models work together rather than in isolation.

Pre-Training vs. Post-Training

Timestamp: 4:55

We discussed the “schooling” of these models. Paige used a great analogy:

Pre-training is like sending the model to school. It involves giving Gemini access to massive amounts of tokens (internet data, synthetic data, video game footage) to learn the basics.
Post-training is “on-the-job experience.” This is where DeepMind provides specific, hand-curated examples of complex workflows, such as multi-turn conversations that involve editing websites or using multiple tools to accomplish a single task.

The “Vending Bench”

Timestamp: 6:48

Benchmarks are changing. Paige introduced us to a fascinating new evaluation metric called Vending Bench. This test gauges a model’s ability to run a passive business—specifically, a vending machine. The model must figure out stock, reorder items, deploy restockers, and do long-range planning to maximize uptime. The score is determined by how much profit the model generates in a year. Currently, Gemini 3 Pro is generating around $5,462 per machine, showing significant improvements in long-term strategic decision-making.

Creative Multimodality with Nano Banana

Timestamp: 28:34

We also touched on the creative side of the stack. Paige highlighted that when you combine reasoning with multimodal outputs, the possibilities explode. She shared examples of Nano Banana Pro being used to generate game assets, orthographic blueprints for 3D modeling (like castles), and detailed physics explainers. The key takeaway is the power of combining these media models with search grounding to create accurate, high-fidelity visual assets.

Conclusion

It is incredible to see not just the models, but the entire ecosystem Google is building—from the hardware to the IDEs like Antigravity. The ability to deploy these agents directly to Google Cloud with a single click bridges the gap between a cool demo and a production-ready application.

As Paige mentioned, the trajectory is exponential. Whether you are building passive businesses or complex coding agents, the tools are ready.

Your turn to build

If you haven’t yet, head over to AI Studio or try out the Gemini API.

Try the “Vending Bench” challenge yourself—can you build an agent that runs a better business than Gemini 3?

Let us know what you build!

Connect with us

Amit Maraj → LinkedIn, X, TikTok
Paige Bailey → LinkedIn, X

Read More for the details.

2025 12 10

AWS – AWS Support Center Console now supports screen sharing for troubleshooting support cases

Tibor Kiss AWS, Cloud AWS

Today, AWS announces that AWS Support Center Console now support screen sharing for troubleshooting support cases. With this new feature, you can request a virtual meeting while in an active chat or call, join support calls with one click through a meeting bridge link. With the new virtual meetings, you will be able to share your screen during the meeting and maintain seamless access to case details for efficient troubleshooting. This enhancement simplifies your support experience by keeping all support interactions within the AWS Support Center console.

To learn more visit the AWS Support page.

Read More for the details.

2025 12 10

AWS – Amazon Braket now supports Qiskit 2.0

Tibor Kiss AWS, Cloud AWS

Amazon Braket now supports Qiskit 2.0, enabling quantum developers to use the latest version of the most popular quantum software framework with native primitives and client-side compilation capabilities.

With this release, Braket provides native implementations of Qiskit’s Sampler and Estimator primitives that leverage Braket’s program sets for optimized batching, reducing execution time and costs compared to generic wrapper approaches. The native primitives handle parameter sweeps and observable measurements service-side, eliminating the need for customers to implement this logic manually. Additionally, the bidirectional circuit conversion capability enables customers to use Qiskit’s extensive compilation framework for client-side transpilation before submitting to Braket devices, providing the control and reproducibility that enterprise users and researchers require for device characterization experiments and custom compilation passes.

Qiskit 2.0 support is available in all AWS Regions where Amazon Braket is available. To get started, see the Qiskit-Braket provider documentation and the Amazon Braket Developer Guide.

Read More for the details.

2025 12 09

GCP – How Virgin Media O2 uses data contracts to enable trusted data and scalable AI products

Tibor Kiss Cloud, Google Cloud gcp

As organizations scale their data and AI capabilities, many are adopting federated data architectures to empower domain teams, accelerate innovation, and foster ownership. This decentralization is essential for building AI products that are adaptable and data-driven — but it also introduces new challenges: maintaining trust, ensuring data quality, and enforcing governance across distributed teams and systems.

At Virgin Media O2 (VMO2), in collaboration with Google Cloud, we’ve developed a robust and scalable approach to address these challenges: data contracts. These contracts serve as the data quality and assurance layer for our data products, ensuring that every dataset we publish is reliable, documented, and ready for consumption. Defined at the asset level, such as individual BigQuery tables or Google Cloud Storage buckets, data contracts are redefining how we manage and share data, enabling the creation of trusted and scalable AI products across our data mesh.

A data contract acts as a formal, machine-readable agreement between a data producer and its users. It serves as an explicit interface, defining the data’s expected characteristics, including its schema, semantics, data quality metrics, and Service Level Objectives (SLOs) like freshness and completeness. See below an example of a data contract that we construct.

The power of this approach lies in moving beyond static documentation. Because they are machine-readable, data contracts become living guarantees with continuous enforcement and real-time validation directly within data pipelines. This proactive monitoring allows teams to detect schema changes or SLA breaches early, transforming data quality from a reactive fix into a scalable, automated mechanism. By embedding product thinking, this methodology elevates data from a simple byproduct to a first-class data product, ensuring that its context and intent travel with it through the data lifecycle. This creates the trusted foundation essential for building reliable AI products at scale.

Practical implementation

To put these principles into practice, we designed a scalable platform on Google Cloud using a hub-and-spoke data contracts solution. This architecture balances centralized governance with federated ownership. A central “Hub” team provides the self-service data contract capabilities including cloud infrastructure, while departmental “Spoke” teams are empowered to own the contract and data quality for their data products.

This is all brought to life through a fully automated, GitOps-driven workflow. A data producer simply can define their data contract in a YAML file for different types of assets like BigQuery table or Google Cloud Storage bucket, and commit it to a GitLab repository. The data contract is then verified dynamically against customizable validation schemas. However, even after validation, the contract exists only as a static blueprint. This is where Dataplex Universal Catalog becomes the key, acting as the engine that transforms this static declaration into enforceable agreement on the actual data.

Dataplex Universal Catalog is an intelligent data fabric that unifies data management and governance, providing the scalable engine needed to operationalize our contracts. We leverage two core capabilities of Dataplex Universal Catalog to make this possible:

Dataplex auto data quality: This is the enforcement engine. Our CI/CD automation reads the SLOs from the YAML contract and provision Data Quality Scan jobs. These jobs use a combination of Dataplex’s powerful pre-defined rules for common checks like null value monitoring and schema change detection, as well as custom rules to enforce unique business logic. This “Data Governance as Code” approach ensures our quality standards are version-controlled, repeatable, and scalable.
Dataplex data profiling: To help teams write effective contracts, we use Dataplex to continuously scan and analyze data assets. This provides vital statistical metadata and insights into the data, such as null frequencies, value ranges, and data type distributions. This proactive data discovery helps producers set realistic quality thresholds and gives users a deeper understanding of the data they are using.

Once these rules and scans are defined in Dataplex, Cloud Composer then orchestrates their execution. It uses the static YAML contracts as a blueprint to dynamically generate the necessary DAGs, which can be further customized for each individual asset. The results are written to BigQuery, making the quality status of every data product transparent and actionable. To provide a unified view for central monitoring, BigQuery authorized views are used to aggregate data quality results and contract statuses from all departments without creating data copies. We also leverage Pub/Sub as an event bus to enable the central team to share department-specific data with respective Spokes.

The diagram below illustrates this workflow in practice, focusing on the core lifecycle of a contract while simplifying the broader hub-and-spoke architecture.

To maintain trust at scale, real-time observability and alerting are vital. Our platform provides dashboards that track contract compliance, while automated alerts flag schema drift, SLA violations, or quality anomalies. For data users, this transparency is critical. They reference the contract definitions to clearly understand the agreed-upon SLOs (such as freshness or completeness) and rely on the dashboards to verify that the data product is meeting those promises before integrating it into their workflows. These signals create powerful feedback loops between data producers and users, fostering faster resolution and closer collaboration.

This real-time visibility transforms data quality from a reactive activity into a proactive practice. As shown in our dashboards, teams get an immediate overview of platform health, data quality scores, and contract compliance.

Furthermore, they can drill down into specific alerts, giving them the context needed to treat data issues with the same urgency as application outages — a critical step in achieving operational excellence for AI and analytics.

Beyond quality: Compliance, governance, and data-product thinking

Beyond data quality, contracts play a crucial role in compliance and governance. By codifying privacy and regulatory requirements — such as GDPR, HIPAA, or PCI — directly within the contract, organizations can automate classification, access control, and auditability. This reduces the risk of non-compliance, especially in federated environments where manual oversight cannot scale.

Finally, product thinking ties everything together. Data contracts embody the mindset that data is a product, not a byproduct. They embed ownership, accountability, and discoverability into every stage of the lifecycle — empowering teams to deliver trusted, scalable, and resilient AI products.

A foundation for the future

By operationalizing trust through data contracts, we are fostering a culture of shared responsibility and data-first thinking. This federated model does more than simply fix pipelines; it builds the trusted foundation needed to scale next-generation AI. It ensures that the resilient AI tools empowering our teams are built on data that is reliable, consistent, and well-defined. As we innovate, our decisions are guided by trusted information. And while full realization takes time, the strategic impact is clear.

Learn more about Dataplex Universal Catalog to explore this use case.

_{The authors would like to thank and recognise the team for their contributions on this project: Eric Tyree, Director, Machine Learning Operations & Data Science at VMO2, Vinay Pai, Head of Data Architecture at VMO2, Shivang Bhargava, Senior Cloud Data Engineer at VMO2, Christopher Slattery, Data Engineer at VMO2, Rakesh Agrwal, Product Manager at VMO2, Sameer Zubair, Principal Platforms Tech lead at VMO2, Philip Adler Senior Software Engineer at VMO2, Carys Williams Data Scientist at VMO2, Li Wang Data Engineer at VMO2, Sobhan Afroosheh, Customer Engineer at Google, Janos Bana, Technical Solutions Consultant, Google.}

Read More for the details.

2025 12 09

GCP – From adoption to impact: Putting the DORA AI Capabilities Model to work

Tibor Kiss Cloud, Google Cloud gcp

The 2025 State of AI-assisted Software Development report revealed a critical truth: AI is an amplifier. It magnifies the strengths of high-performing organizations and the dysfunctions of struggling ones.

While AI adoption is now near-universal, with 90% of developers using it in their daily workflows, success is not guaranteed. Our cluster analysis of nearly 5,000 technology professionals reveals significant variation in team performance: Not everyone experiences the same outcomes from adopting AI.

From this disparity, we can conclude that how they are using AI is a critical factor. We wanted to understand the particular capabilities and conditions that enable teams to achieve positive outcomes, leading us to develop the DORA AI Capabilities Model report.

This companion guide to the 2025 DORA Report is designed to help you navigate our new reality. It provides actionable strategies, implementation tactics, and measurement frameworks to help technology leaders build an environment where AI thrives.

Seven capabilities that amplify success

Successfully using AI requires cultivating your technical and cultural environment. From the same set of respondents who participated in the 2025 DORA survey, we identified seven foundational capabilities that are proven to amplify the positive impact of AI on organizational performance:

Clear and communicated AI stance: Ambiguity creates risk. A clear policy provides the psychological safety developers need to experiment effectively.
Healthy data ecosystems: AI is only as good as the data it learns from. Investing in high-quality, accessible, and unified internal data significantly amplifies AI’s benefits.
AI-accessible internal data: This involves “context engineering,” moving beyond simple prompts to securely connect AI tools to your internal documentation and codebases.
Strong version control practices: As AI increases the volume and velocity of code generation, version control becomes your critical safety net. Frequent commits and robust rollback capabilities are essential for maintaining stability in an AI-assisted world.
Working in small batches: AI can easily generate massive blocks of code, which are hard to review and test. Enforcing the discipline of small batches counteracts this risk, ensuring that speed translates to product performance rather than instability.
User-centric focus: Speed is irrelevant if you are moving in the wrong direction. Adopting AI tools can actually harm teams that lack a user-centric focus. Keeping user needs as your North Star is essential for guiding AI-assisted development.
Quality internal platforms: A platform provides the automated, secure “paved roads” that allow AI benefits to scale across the organization. It prevents individual productivity gains from being lost to downstream bottlenecks.

Where to start: Assessing your team

Every organization starts their AI journey differently. To help you prioritize, this report introduces seven distinct team archetypes derived from our cluster analysis. These profiles range from “harmonious high-achievers,” who excel in both performance and well-being, to teams facing “foundational challenges” or those stuck in a “legacy bottleneck,” where unstable systems undermine morale.

Identifying the profile that best matches your team can help pinpoint the most impactful interventions. For example, a “high impact, low cadence” team might prioritize automation to improve stability, while a team “constrained by process” might focus on reducing friction through a better AI stance.

Digging deeper with Value Stream Mapping

Once you understand your team’s profile, how do you direct your efforts? The report includes a step-by-step facilitation guide for running a Value Stream Mapping (VSM) exercise.

VSM acts as an AI force multiplier. By visualizing your flow from idea to customer, you can identify where work waits and where friction exists. This ensures that the efficiency gains from AI aren’t just creating local optimizations that pile up work downstream, but are instead channeled into solving system-level constraints.

Get better at getting better

AI adoption is an organizational transformation. The greatest returns come not from the tools themselves, but from investing in the foundational systems that enable them.

Download the full report
Join the DORA community

Read More for the details.

2025 12 09

GCP – AlphaEvolve on Google Cloud: AI for agentic discovery and optimization

Tibor Kiss Cloud, Google Cloud gcp

Innovators in science and engineering face a common barrier: the search space for solving complex problems — like designing a new chip or discovering a drug molecule — is often too vast for standard brute-force methods to explore effectively.

To help you overcome this challenge, we are releasing AlphaEvolve, a Gemini-powered coding agent for designing advanced algorithms, to Google Cloud, in private preview..

The challenge for innovators in science

Many of the most challenging and potentially valuable problems in the world are related to optimization. You might be trying to minimize latency in a data center, maximize the stability of a protein, or find the most efficient route for a logistics fleet.

AlphaEvolve pairs the creative problem-solving capabilities of our Gemini models with automated evaluators that verify answers, along with an evolutionary framework to improve upon the most promising ideas.

It then tests these changes against a “ground truth” evaluator that you define. If the new code performs better, it becomes the parent for the next generation. This creates a feedback loop that allows the system to learn and improve over time, eventually discovering algorithms that are significantly more efficient than the ones you started with.

How it works:

Here is how Alphaevolve discovers and improves upon existing algorithms in more detail:

Input: You define a problem specification, evaluation logic (to measure how well a proposed solution works), and a seed initialization program. The seed is a compile-ready piece of code that is the algorithm that you want to optimize. To start, it solves the problem, even if sub-optimally.
Mutation: Gemini models (Flash for speed, Pro for depth) process the context and generate mutated, optimized versions of the code that are added to the “population space.”
Evolution: Evolution algorithms select which of the various code mutations from the population space to combine and further mutate to prioritize as the starting point for the next evolution of mutations.
Loop: The results from the Evaluation scores are then used by the ensemble of LLMs to generate the next set of improved solutions. The cycle repeats recursively, evolving the codebase from the initial seeds to state-of-the-art algorithms.

Proven impact at Google

At Google, we have already used this technology to tackle some of our own hardest engineering problems.

Data center efficiency: AlphaEvolve found a better way to schedule tasks in our data centers, continuously recovering on average 0.7% of our global compute resources.
Gemini training: AlphaEvolve sped up a vital kernel in Gemini’s architecture by 23%, leading to a 1% reduction in Gemini’s training time.
Hardware design: It accelerated the design of our next-generation TPUs by discovering more efficient arithmetic circuits.

To learn more about impact, read our paper.

How AlphaEvolve can help businesses across industries

You can apply this same engine to your own proprietary data and unique algorithmic challenges. Here are a few ways improved algorithms can potentially help different industries:

Biotech and pharma: Optimize the algorithms used for molecular simulation, which helps shorten the timelines for drug discovery and increases the success rate of new therapeutics.
Logistics and supply chain: Discover superior heuristics for routing and inventory management, helping you reduce fuel costs and build more resilient delivery networks.
Financial services: Evolve algorithmic risk models to manage complex portfolios more effectively.
Energy: Optimize load balancing on smart grids to improve stability and better integrate renewable energy sources.

Get started on Google Cloud

AlphaEvolve is made to help with complex optimization problems that you can define in code and objectively measure. The AlphaEvolve Service API is now available through an Early access program with Google Cloud. If you have one of these problems and are interested in participating in the Early Access Program, please reach out to your Google Cloud Representative.

Read More for the details.

2025 12 09

AWS – Amazon EC2 X8g instances now available in Europe (Stockholm) region

Tibor Kiss AWS, Cloud AWS

Starting today, Amazon Elastic Compute Cloud (Amazon EC2) X8g instances are available in Europe (Stockholm) region. These instances are powered by AWS Graviton4 processors, and they offer up to 3 TiB of total memory and increased memory per vCPU compared to other Graviton4-based instances. X8g instances are ideal for memory-intensive workloads, such as electronic design automation (EDA) workloads, in-memory databases (Redis, Memcached), relational databases (MySQL, PostgreSQL), real-time big data analytics, real-time caching servers, and memory-intensive containerized applications.

X8g instances offer larger instance sizes with up to 3x more vCPU (up to 48xlarge) and memory (up to 3TiB) than Graviton2-based X2gd instances. They offer up to 50 Gbps enhanced networking bandwidth and up to 40 Gbps of bandwidth to the Amazon Elastic Block Store (Amazon EBS). Elastic Fabric Adapter (EFA) networking support is offered on 24xlarge, 48xlarge, and bare metal sizes, and Elastic Network Adapter (ENA) Express support is available on instance sizes larger than 12xlarge.

X8g instances are currently available in the following AWS Regions: US East (N. Virginia, Ohio), US West (Oregon), and Europe (Frankfurt, Stockholm).

To learn more, see Amazon EC2 X8g Instances. To quickly migrate your workloads to Graviton-based instances, see AWS Graviton Fast Start program. To get started, see the AWS Management Console, AWS Command Line Interface (AWS CLI), and AWS SDKs.

Read More for the details.

2025 12 09

GCP – Nutanix NC2 is now officially supported on Google Cloud

Tibor Kiss Cloud, Google Cloud gcp

Today, we are thrilled to announce Nutanix Cloud Clusters (NC2) is generally available on Google Cloud.

NC2 on Google Cloud is designed to migrate and modernize specialized, regulated, and mission-critical applications without refactoring your workloads or compromising on performance. This partnership brings the power of Google Cloud’s infrastructure and advanced AI models to your hybrid cloud, without compromising on data residency, connectivity, or operational consistency. You can now run your Nutanix Hybrid Cloud directly on Google Compute Engine.

“The General Availability of Nutanix Cloud Clusters (NC2) on Google Cloud is a significant milestone empowering our joint customers to become AI-ready. We are excited to extend the simplicity and resilience of Nutanix NC2 onto Google Cloud’s high-performance workload-optimized compute. Nutanix on Google Cloud enables our customers to migrate and modernize their critical workloads while unlocking the full power of Google’s industry-leading data and AI capabilities.” – Saveen Pakala, VP, Product Management, Hybrid Cloud, Nutanix

Nutanix and Google Cloud allow you to maximize agility and minimize disruption for your critical applications. By combining NC2’s enterprise flexibility with Google Cloud’s power, you gain access to three core advantages. First, your workloads run on Compute Engine’s dynamically scalable workload-optimized infrastructure powering all machine families. Nutanix NC2 supports Compute Engine bare metal instances in the Z3 and C4 families. These are powered by the Titanium offload system and leverage Titanium SSDs for low-latency, high-throughput storage performance, hosted in Google Cloud with global reach, enterprise-grade security, and commitment to sustainability. Second, you accelerate AI innovation by co-locating data and machine learning services like Gemini Enterprise and Vertex AI. Finally, you can save costs by dynamically scaling capacity and utilizing committed use discounts (CUDs) and Flex CUDs.

Key use cases to accelerate your cloud journey

The integration of NC2 on Google Cloud offers flexible, strategic options for hybrid cloud operations. Beyond consolidation and cost control, these capabilities set the stage for true modernization:

Seamless workload migration: Move entire applications between your on-premises Nutanix environment and Google Cloud without re-factoring or re-architecting. This capability saves significant time during data center consolidation.
Consistent operations: Maintain the same management plane, security policies, and automation across your private data center and Google Cloud, which dramatically reduces operational complexity and training costs.
Disaster recovery (DR): Leverage Google Cloud as a robust and cost-efficient recovery target. Usage of a minimal “pilot light” cluster reduces compute costs, so you scale up only when a disaster event occurs.
Capacity bursting: Instantly add capacity in the cloud to handle seasonal demands, VDI workloads, development/test cycles, or requirements from mergers and acquisitions (M&A).
License portability: Protect your software investments by easily moving your existing Nutanix software licenses to Google Cloud as your business needs evolve.

“Like many others, we are always on a journey to modernize and shift to achieve the best outcomes for our customers. Nutanix Cloud Clusters (NC2) on Google Cloud brings us a solid platform to continue our hybrid cloud expansion. Our ability to seamlessly run workloads on-premises and on NC2 on Google Cloud without having to re-factor is increasingly valuable as we continue our modernization journey. We look forward to continuing our strong partnership with Google Cloud and Nutanix.” – VP of IT at a global oil & gas company based in Oklahoma

The architecture

NC2 on Compute Engine simplifies building a hybrid cloud by deploying the Nutanix Cloud Infrastructure (NCI) software stack, including the Acropolis Hypervisor (AHV), directly onto high-performance Compute Engine infrastructure.

The key components of the solution include:

Compute Engine instances: NC2 runs on Google Compute Engine bare metal instances in the recently introduced C4 and Z3 machine families. These powerful instances provide the foundation with high-density compute, memory, local NVMe storage, and high network bandwidth.

Machine Family	GCE Machine Type	vCPUs	Memory	Storage	Processor
Z3, Storage Optimized	z3-highmem-192-highlssd-metal	192	1536GB	72TB of NVMe Local SSD	Intel, Sapphire Rapid
C4, General Purpose	c4-highmem-288-lssd-metal	288	1080GB	18TB of NVMe Local SSD	Intel, Granite Rapid
C4, General Purpose	c4-standard-288-lssd-metal	288	2232GB	18TB of NVMe Local SSD	Intel, Granite Rapid

Simplified networking : NC2 runs entirely within your existing Google Cloud Virtual Private Cloud (VPC). Built-in Nutanix Flow Virtual Networking for overlay is integrated to reduce hybrid cloud complexity.
Unified management: The entire environment, both on-premises and in Google Cloud, is managed through the familiar Prism Central console, simplifying day-to-day operations and skill requirements for your IT teams.
Easy procurement: Later this month, you’ll be able to purchase Nutanix NC2 licensing directly from Google Cloud Marketplace . This offers a single, unified billing experience for both your Google Cloud infrastructure and Nutanix NC2, in one simple process. A key benefit is the ability to use your existing Google Cloud spend commitments for Nutanix NC2 software. This helps you maximize your investment and streamline your financial operations, providing more value from your cloud budget.

Connect your data to Google Cloud AI and analytics

A significant modernization opportunity comes from connecting your stable, trusted Nutanix workloads with Google Cloud’s powerful data and AI tools. Your applications running on NC2 can tap directly into services like BigQuery and Vertex AI with low latency, enabling you to:

Derive deeper business value: Easily send application log data, transactional records, and other operational data from your Nutanix VMs to BigQuery for real-time, scalable data warehousing and complex analysis.
Build custom machine learning models: Use Vertex AI to create, deploy, and manage custom ML models that analyze data generated by your core applications (e.g., predictive maintenance or fraud detection).
Use conversational AI: Quickly build and deploy conversational agents using technologies like Dialogflow that interact directly with the application data residing on your NC2 cluster.

Ready to simplify your cloud operations?

NC2 on Google Cloud is currently available across 17 Google Cloud regions, with a planned expansion continuing through 2026. For precise details on regional and zonal availability, please check the official Google Compute Engine bare metal regional availability documentation, and reference the Compute Engine pricing page for infrastructure costs. To learn more about the solution, try taking a test drive or visit Nutanix partner page. Available later this month, you will be able to explore NC2 on Google Cloud licensing through the Google Cloud Marketplace.

Read More for the details.

2025 12 09

AWS – AWS Partner Central now includes opportunity deal sizing

Tibor Kiss AWS, Cloud AWS

Today, AWS announces deal sizing capability in AWS Partner Central. This new feature, available within the APN Customer Engagements (ACE) Opportunities, uses AI to provide deal size estimates and AWS service recommendations. Deal Sizing capability allows Partners to save time on deal management by simplifying the process of estimating AWS monthly recurring revenue (MMR) when creating or updating opportunities.

Partners can optionally import AWS Pricing Calculator URLs to automatically populate AWS service selections and corresponding spend estimates into their opportunities, reducing the need for manual re-entry. When a Pricing Calculator URL is provided, deal sizing delivers enhanced insights including pricing strategy optimization recommendations, potential cost savings analysis, Migration Acceleration Program (MAP) eligibility indicators, and modernization pathway analysis. These enhanced insights help Partners refine their technical approach and strengthen funding applications, accelerating the funding approval process.

Deal sizing is now available in AWS Partner Central worldwide. The feature is accessible through both AWS Partner Central and the AWS Partner Central API for Selling, which is available in the US East (N. Virginia) Region.

To get started, log in to AWS Partner Centr al in the console to create or update opportunities and view deal sizing insights. For API integration with your CRM system, see the AWS Partner Central API Documentation. To learn more about deal sizing, visit the Partner Central Sales Guide.

Read More for the details.

2025 12 09

AWS – Amazon RDS and Aurora now support resource tagging for Automated Backups

Tibor Kiss AWS, Cloud AWS

Amazon RDS and Aurora now support resource tagging for automated backups and cluster automated backups. You can now tag your automated backups separately from the parent DB instance or DB cluster, enabling Attribute-Based Access Control (ABAC) and simplifying resource management and cost tracking.

With this launch, you can tag automated backups in the same way as other RDS resources using the AWS Management Console, API, or SDK. Use these tags with IAM policies to control access and permissions to automated backups. Additionally, these tags can help you categorize your resources by application, project, department, environment, and more, as well as manage, organize, and track costs of your automated backups. For example, create application specific tags to control permissions for describing, deleting, or restoring automated backups and to organize and track backup costs of the application.

This capability is available in all AWS Regions, including the AWS GovCloud (US) Regions where Aurora and RDS are available.

To learn more about tagging Aurora and RDS automated backups, see the Amazon documentation on Tagging Amazon Aurora resources, Tagging Amazon RDS resources, and Using tags for attribute-based access control.

Read More for the details.

2025 12 09

AWS – Amazon GameLift Servers enhances AWS Console for game developers with AI powered assistance

Tibor Kiss AWS, Cloud AWS

Today, Amazon GameLift Servers is launching AI-powered assistance in the AWS Console, leveraging Amazon Q Developer to provide tailored guidance for game developers. This new feature integrates specialized GameLift Servers knowledge to help customers navigate complex workflows, troubleshoot issues, and optimize their game server deployments more efficiently.

Developers can now access AI-assisted recommendations for game server integration, fleet configuration, and performance optimization directly within the AWS Console via Amazon GameLift Servers. This enhancement aims to streamline decision making processes, reduce troubleshooting time, and improve overall resource utilization, leading to cost savings and better player experiences.

AI-powered assistance is now available in all Amazon GameLift Servers supported regions, except AWS China. To learn more about this new feature, visit the Amazon GameLift Servers documentation.

Read More for the details.

2025 12 08

GCP – Is your DR plan just wishful thinking? Prove your resilience with chaos engineering

Tibor Kiss Cloud, Google Cloud gcp

When was the last time you knew — not just hoped — that your disaster recovery plan would work perfectly?

For most of us, the answer is unclear. Sure, you may have a DR plan, a meticulously crafted document stored in a wiki or a shared drive, that gets dusted off for compliance audits or the occasional tabletop drill. You assume its procedures are correct, its contact lists are current, and its dependencies are fully mapped, and you certainly hope it works.

But hope is not a strategy.

Why wouldn’t it work? One problem is that systems are rarely static anymore. In a world where you deploy new microservices dozens of times per day, make constant configuration changes, and maintain an ever-growing web of third-party API dependencies, the DR plan you wrote last quarter is probably just as useful as one from 10 years ago.

And if the failover does work, will it work well enough to meet the promises you’ve made to your customers (or board of directors or regulators)? When a key component fails, could you still even meet your target availability and latency targets, a.k.a., your Service Level Objectives (SLOs)?

So, how do you close this gap between your current aspirational DR plan and a DR plan that you actually have confidence in? The answer isn’t to write more documents or run more theatrical drills. The answer is to stop assuming and start proving.

This is where chaos engineering comes in. Unlike what the name might imply, chaos engineering isn’t a tool for recklessly breaking things. Instead, it’s a framework that provides data-driven confidence in your SLOs under stress. By running controlled experiments that simulate real-world disasters like a database failover or a regional outage, you can quantitatively measure the impact of those failures on your systems’ performance. Chaos engineering is how you transform your DR hypotheses into a proven method to ensure resilience. By validating your plan through experimentation, you create tangible evidence, verifying that your plan will safeguard your infrastructure and keep your promises to customers.

Demystifying chaos engineering

In a nutshell, chaos engineering is the practice of running controlled, scientific experiments to find weaknesses in your system before they cause a real outage.

At its core, it’s about building confidence in your system’s resilience. The process starts with understanding your system’s steady state, which is its normal, measurable, and healthy output. You can’t know the true impact of a failure without first defining what “good” looks like. This understanding allows you to form a clear, testable hypothesis: a statement of belief that your system’s steady state will persist even when a specific, turbulent condition is introduced.

To test this hypothesis, you then execute a controlled action, which is a precise and targeted failure injected into the system. This isn’t random mischief; it’s a specific simulation of real-world failures, such as consuming all CPU on a host (resource exhaustion), adding network latency (network failure), or terminating a virtual machine (state failure). While this action is running, automated probes act as your scientific instruments, continuously monitoring the system’s state to measure the effect.

Together, these components form a complete scientific loop: you use a hypothesis to predict resilience, run an experiment by applying an action to simulate adversity, and use probes to measure the impact, turning uncertainty into hard data.

Using chaos to validate disaster recovery plans

Now that you understand the building blocks of a chaos experiment, you can build the bridge to your ultimate goal: transforming your DR plan from a document of hope into an evidence-based procedure. The key is to stop seeing your DR plan as a set of instructions and start seeing it for what it truly is: a collection of unproven hypotheses.

When you think about it, every significant statement in your DR document is a claim waiting to be tested. When your plan states, “The database will failover to the replica in under 5 minutes,” that isn’t a fact, it’s a hypothesis. When it says, “In the event of a regional outage, traffic will be successfully rerouted to the secondary region,” that’s another hypothesis. Your DR plan is filled with these critical assumptions about how your system should behave under duress. Until you test them, they remain nothing more than educated guesses.

Chaos experiments are the ultimate validation tools, live-fire drills that put your DR hypotheses to a real, empirical test. Instead of just talking through a scenario, you use controlled actions to safely and precisely simulate the disaster. You’re no longer asking “what if?”; you’re actively measuring “what happens when.”

For example, imagine you have a DR plan for a regional outage. When you adopt chaos engineering, you break down that plan into a hypothesis and an experiment. For example:

The hypothesis: “In case our primary region us-central1 becomes unreachable, the load balancers will failover all traffic to us-east1 within 3 minutes, with an error rate below 1%.”
The chaos experiment: Run an action that simulates a regional outage by injecting a “blackhole” that drops all network traffic to and from us-central1 for a limited time. Your probes then measure the actual failover time and error rates to validate the hypothesis.

In other words, by applying the chaos engineering methodology, you systematically move through your DR plan, turning each assumption into a proven fact. You’re not just testing your plan; you’re forging it in a controlled fire.

Connecting chaos readiness to your SLOs

Beyond simply proving system availability, chaos engineering builds trust in your reliability metrics, ensuring that you meet your SLOs even when services become unavailable. An SLO is a specific, acceptable target level of your service’s performance measured over a specified period that reflects the user’s experience. SLOs aren’t just internal goals; they are the bedrock of customer trust and the foundation of your contractual service level agreements (SLAs).

A traditional DR drill might get a “pass” because the backup system came online. But what if it took 20 minutes to fail over, during which every user saw errors? What if the backup region was under-provisioned, and performance became so slow that the service was unusable? From a technical perspective, you “recovered.” But from a customer’s perspective, you were down.

A chaos experiment, however, can help you answer a critical question: “During a failover, did we still meet our SLOs?” Because your probes are constantly measuring performance against your SLOs, you get the full picture. You don’t just see that the database failed over; you see that it took 7 minutes, during which your latency SLO was breached and your error budget was completely burned. This is the crucial, game-changing insight. It shifts the entire goal from simple disaster recovery to SLO preservation, which is what actually determines if a failure was a minor hiccup or a major business-impacting incident. It also provides the data necessary to set goals for system improvement. So the next time you run this experiment, you can measure if and how much your system resilience has improved, and ultimately if you can maintain your SLO during the disaster event.

Build a culture of confidence

The journey to resilience doesn’t start by simulating a full regional failover. It starts with a single, small experiment. The goal is not to boil the ocean; it’s to build momentum. Test one timeout, one retry mechanism, or one graceful error message.

The biggest win from your first successful experiment won’t be the technical data you gather. It will be the confidence you build. When your team sees that they can safely inject failure, learn from it, and improve the system, their entire relationship with failure changes. Fear is replaced by curiosity. That confidence is the catalyst for building a true, enduring culture of resilience. To learn more and get started with chaos engineering, check out this blog and this podcast. And if you’re ready to get started, but unsure how, reach out to Google Cloud professional services to discuss how we can help.

Read More for the details.

2025 12 08

GCP – Streamline the design and deployment of application infrastructure with Application Design Center, now GA

Tibor Kiss Cloud, Google Cloud gcp

Earlier this year, we unveiled a big investment in platform and developer team productivity, with the launch of Application Design Center, helping them streamline the design and deployment of cloud application infrastructure, while ensuring applications are secure, reliable, and aligned with best practices. And today, Application Design Center is generally available.

We built Application Design Center to put applications at the center of your cloud experience, with a visual, canvas-style and AI-powered approach to design and modify Terraform-backed application templates. It also offers full lifecycle management that’s aligned with DevOps best practices across application design and deployment.

Application Design Center is a core component of our application-centric cloud experience. When you use Application Design Center to design and deploy your application infrastructure, your applications are easily discoverable, observable, and manageable. Application Design Center works in concert with App Hub to automatically register application deployments, enabling a unified view and control plane for your application portfolio, and Cloud Hub, to provide operational insights for your applications.

“Google Application Design Center is a valuable enabler for Platform Engineering, providing a structured approach to harmonizing resource creation in Google Cloud Platform. By aligning tools, processes, and technologies, it streamlines workflows, reducing friction between development, operations, and other teams. This harmonization enhances collaboration, accelerates delivery, and ensures consistency across Google Cloud environments.” – Ervis Duraj, Principal Engineer, MediaMarktSaturn Technology

The gateway to an app-centric cloud

Our goal with Application Design Center is for you to innovate more, and administer less. It consists of four key elements to help you minimize administrative overhead and maximize efficiency, so you can design and deploy applications with integrated best practices and essential guardrails. Let’s take a closer look.

1. Terraform components and application templates
Develop applications faster with our growing library of opinionated application templates. These provide well-architected patterns and pre-built components, including innovative “AI inference templates” to help you leverage AI to create dynamic and intelligent application foundations. As an example, at launch, Application Design Center provides opinionated templates for Google Kubernetes Engine (GKE) clusters (Standard, Autopilot and NodePool) to run AI inference workloads using a variety of LLM models, as well as for enterprise-grade production clusters or single-region web app clusters.

You can also ingest and manage your existing Terraform configurations (“Bring your own Terraform”) directly from Git repositories. Once imported, you can use Application Design Center to design with your own Terraform, or in combination with Google-provided Terraform, to create standardized, opinionated infrastructure patterns for sharing and reuse across your application teams.

2. AI-powered design for rapid application designing and prototyping
Application Design Center integrates with Google’s Gemini Cloud Assist Design Agent, empowering you to design actual, deployable application infrastructure application templates on Google Cloud that you can export as Terraform infrastructure-as-code.

With Gemini Cloud Assist, you can describe your application design intents using natural language. In return, Gemini interactively generates multi-product application template suggestions, complete with visual architecture diagrams and summarized benefits. You can then refine these proposals through multi-turn reasoning or by directly manipulating the architecture within the Application Design Center canvas.

Additionally, all designs that you create with Gemini are automatically observable, optimizable, and enabled for troubleshooting assistance during runtime, thanks to their tight integration with Gemini Cloud Assist.

3. A secure, sharable catalog of application templates with full lifecycle management
Platform admins can curate a collection of application templates built from Google’s best-practice components. This provides developers a trusted, self-service experience from which they can quickly discover and deploy compliant applications. Tight integration with Cloud Hub transforms these governed templates into a live operational command center, complete with unified visibility into the health and deployment status of the resulting applications. This closes the critical loop between design and runtime, so that your production environments reflect your organization’s approved architectural standards.

Also, Application Design Center’s robust application template revisions serve as an immutable audit trail. It automatically detects and flags configuration drift between your intended designs and deployed applications, so that developers can remediate unauthorized changes or safely push approved configuration updates. This helps ensure continuous state consistency and compliance from Day 1 and through the subsequent evolution of your application.

4. GitOps integration automating developers’ day-to-day software design lifecycle tasks
By integrating Application Design Center into existing CI/CD workflows, platform teams empower developers to own the complete software delivery lifecycle right from their IDE. Developers can leverage compliant application and infrastructure (IaC) code using Application Design Center application templates.

Further, every infrastructure decision made through Application Design Center is committed to code, versioned, and auditable. Specifically, developers can download the application IaC template from Application Design Center and import it into their app repos (the single source of truth), clone their repo, and edit the Terraform directly in their local IDEs. Any modifications go through a Git pull request for review. Once approved, this automatically triggers the existing CI/CD setup to build, test, and deploy both app and infra changes in lockstep. This unified approach minimizes friction, enforcing “golden paths” and providing an end-to-end automated pathway from a line of code in the IDE to a fully deployed change in production.

What’s new since preview

This GA launch is packed with features that users have been asking for. We’re excited to share powerful new capabilities: enterprise-grade governance and security with public APIs and gcloud CLI support; full compatibility with VPC service controls; bring your own Terraform and GitOps support for integration with your existing application patterns and automation pipelines; agentic application patterns using GKE templates (Standard, Autopilot and NodePool); and finally, a simplified onboarding experience with app-managed project support, making Application Design Center an AI-powered engine for your applications on Google Cloud.

Get started today

To help you get started, Google provides a growing library of curated Google application templates built by experts. These templates combine multiple Google Cloud products and best practices to serve common use cases, which you can configure for deployment, and view as infrastructure as code in-line. Platform teams can then create and securely share the catalogs and collaborate with teammates on designs and self-service deployment for developers. For enterprises with existing Terraform patterns and assets, Application Design Center interoperates by enabling their import and reuse within its native design and configuration experience.

Ready to experience the power of Application Design Center? You can learn more about ADC and get started building in minutes using the quickstart. You can start building your first AI-powered application template in minutes, free of cost, and quickly deploy applications with working code. For deeper insights, explore the comprehensive public documentation here. We can’t wait to see how you innovate with the Application Design Center!

Read More for the details.

2025 12 08

AWS – Amazon Quick Suite integrates Quick Research with Quick Flows for report automation

Tibor Kiss AWS, Cloud AWS

Amazon Quick Suite now includes Quick Research as a step within Quick Flows. This integration enables teams to generate comprehensive research reports as part of automated, multi-step workflows, transforming research projects into reusable workflows that can be shared across their organization.

Quick Suite is Amazon’s new AI-powered workspace that helps organizations get answers from their business data and move quickly from insights to action. With this integration, teams can trigger research automatically within their flows rather than conducting separate analysis. This addresses a critical productivity challenge by enabling teams to capture and scale proven research methods across hundreds of automated use cases. The integration also allows users to automate research workflows through scheduled triggers so users can set up flows that automatically generate research at specific times. Common use cases include automated account plan creation, standardizing product compliance analysis, and scheduled industry reports.

Users benefit from pre-configured flows that generate research based on flow creator instructions and optional user inputs. The generated research report can be used further to automatically trigger downstream actions like updating a Salesforce opportunity for an account team to follow up on, posting on a Jira ticket for a compliance team to review, or creating an Asana task for a patent lawyer to approve. This unlocks “set and forget” workflows that deliver consistent analysis without manual heavy lifting. Now operating within these automated workflows, Quick Research maintains its core strength of streamlining analysis across diverse enterprise data sources while delivering verified, source-traced insights. For existing Flow users, this provides access to more comprehensive analysis.

Quick Research with Flows integration is available in the following AWS Regions: US East (N. Virginia), US West (Oregon), Asia Pacific (Sydney), and Europe (Ireland). To learn more about automating your research needs, read the Quick Suite user guide.

Read More for the details.

2025 12 08

GCP – Integrating MedGemma into clinical workflows just got easier!

Tibor Kiss Cloud, Google Cloud gcp

Our team, Google Heath AI Developer Foundations, introduced MedGemma earlier this year in May and later followed up in July with a 27-billion-parameter multimodal variant plus MedGemma’s vision encoder: MedSigLIP. We’ve been humbled by the wide variety of model adaptations, research papers, and applications MedGemma has created across academia and industry!

Our aim is to meet you where you are in your research, development, and clinical integration journey. In the earlier releases, we prioritized simplicity, where image prompts were constructed from pixels decoded from non-medical image formats such as JPEG and PNG, and medical record snippets were fed into the model in JSON format or as plain text.

However, we acknowledge the complexities of integrating MedGemma into clinical workflows in an interoperable way. That’s why standard protocols like Digital Imaging and Communications in Medicine (DICOM) and Fast Healthcare Interoperability Resources (FHIR) are crucial for integration into clinical workflows. Today, we’re pleased to announce that we have made it simpler for developers who are working with these data formats.

DICOMweb integration

We are releasing a new Docker container for MedGemma which accepts medical images as DICOMweb links:

You can use this new Docker container or source code directly to deploy DICOM-aware MedGemma services on any compute platform. However, if you’re a user of Google Cloud Platform (GCP) with data stored in Cloud DICOM Store, visit get started section in this post to get up and running in minutes using pre-built resources on Vertex Model Garden.

Note that since inception, MedSigLIP container has had native understanding of DICOM; here’s the public container, the container source code, and API spec.

When your interactive user-facing applications have to deal with complex modalities such as digital pathology Whole Slide Imaging (WSI) or multi-dimensional radiology imaging such as Computed Tomography (CT) or Magnetic Resonance Imaging (MRI), reading images server-side optimizes network performance and bypasses API payload restrictions. Furthermore, this architecture hardens security in transit and ensures consistent, deterministic data preprocessing.

Note that as a GCP user, you are not limited to deployment via Model Garden which currently deploys the model and its custom container using a fixed configuration to a Vertex online inference endpoint for you. Please refer to the serving architecture document to understand your deployment options.

FHIR navigation agent demonstration

In our FHIR integration approach, we configured MedGemma and the GCP FHIR Store as executable tools for an agent. We show how the agent formulates prompts requiring a patient’s full medical records without feeding their entire FHIR history into MedGemma’s context window, leveraging MedGemma’s awareness of the FHIR standard to intelligently navigate patient data. We demonstrate an implementation using LangGraph, a popular agentic framework, though the same can be achieved using other agentic frameworks including the GCP’s Agent Development Kit (ADK). Visit the get started section in this post and see how an agent can intelligently navigate patient data.

Get started

For DICOM-aware MedGemma, start with the model on Model Garden and use the new drop down options to deploy either 4B or 27B variants. Once deployed, use this tutorial notebook to see how to prompt the model with links to the medical images instead of the image pixels.

For FHIR navigation demonstration, start with the illustrative app and then look into the technical details in the demo notebook.

What’s next

If you are looking to build advanced agentic solutions, systems that use LLMs to perform complex, multi-step tasks, Model Context Protocol (MCP) is a reliable way to manage and deliver all the necessary context and data. To leverage MCP on GCP, you should take advantage of the open MCP Toolbox for Databases and its integration with GCP Healthcare API. If you are working with medical imaging, you can use the new DICOM-aware MedGemma to achieve more efficient server-side DICOM processing in your MCP configuration, accelerating the preparation of clinical context for your agentic applications.

Resources and support

Our mission is to enable your success. Here are the best ways to engage with our team and the community:

Seek technical support on the HAI-DEF developer forum.
File technical issues directly in the MedGemma or MedSigLIP GitHub repos.
Help shape our roadmap by sharing your use cases via our feedback form. This helps us align our engineering efforts with the industry’s most common needs.
Stay updated on new tools and models by signing up for our newsletter.
Access all resources by bookmarking goo.gle/hai-def, your one-stop shop for everything we offer.

This post includes additional contributions from Liron Yatziv – Software Engineer, Kenneth Philbrick – Software Engineer, Bram Sterling – Software Engineer, and Tiffany Chen – Software Engineer.

Read More for the details.

2025 12 08

AWS – Announcing Spatial Data Management on AWS to accelerate spatial-data insights

Tibor Kiss AWS, Cloud AWS

Today, AWS is announcing Spatial Data Management on AWS (SDMA), a solution that enables customers to store, enrich, and connect spatial data at scale. SDMA enables customers to store their multimodal spatial data representing their physical assets (3D, geospatial, behavioral, temporal data) in a secure, centralized cloud environment. SDMA serves as a collaborative hub enabling connectivity between customer’s spatial data, their ISV SaaS applications, and AWS Services. In addition, customers can use SDMA’s collection rules to define how their spatial data is organized and enriched, helping maintain consistency and governance. Customers can use SDMA’s APIs, desktop application, and web interface to efficiently manage spatial data to accelerate insights and informed decision making around physical operations.

SDMA centralizes customer’s spatial data in a secure and highly available cloud repository to enhance data transparency and accessibility across workflows. Leveraging SDMA’s automated metadata extraction for spatial data file formats, starting with: .LAZ, .E57, .GLB, and .GLTF, customers can improve data discoverability and relationships. SDMA’s REST APIs and customizable connectors simplify integrations with external applications — eliminating manual file handling and enhancing cloud and on-premises interoperability. SDMA’s intuitive web and desktop interfaces enable users across technical skill levels to manage spatial data efficiently. Auto-generated file previews are designed to improve workflow speed and data accuracy, they allow users to view and validate data without downloading large files.

SDMA is available in the following AWS regions: Asia Pacific (Tokyo, Singapore, Sydney), Europe (Frankfurt, Ireland, London), US East (N. Virginia, Ohio), US West (Oregon).

To learn more, visit the SDMA Product page.

Read More for the details.