Cloud

2025 06 17

GCP – Spanner’s enduring impact: Celebrating the 2025 ACM SIGMOD Systems Award

Earlier this year, the Association for Computing Machinery’s Special Interest Group on Management of Data (ACM SIGMOD) announced that Spanner, Google’s globally distributed database, was awarded the 2025 SIGMOD Systems Award. The SIGMOD Systems Award specifically honors systems whose technical contributions have profoundly impacted the theory or practice of large-scale data management. On behalf of the entire Spanner team, especially the engineers who were there at the beginning of Spanner’s journey, it is with deep humility and immense pride that we receive this recognition from such a distinguished community. We’re thrilled to be participating in the 2025 SIGMOD conference from June 22-27 in Berlin, Germany as a Platinum sponsor.

This honor feels particularly significant following Spanner’s 2022 SIGOPS Hall of Fame Award, which highlighted the crucial role of technologies like TrueTime and our network infrastructure, reaffirming the lasting significance of the original vision laid out in the first Spanner paper.

Spanner’s core innovation: TrueTime and external consistency

For Spanner to be recognized in this way is a powerful affirmation of the vision we set out to achieve years ago and the new ways Spanner enables applications to be built. According to the award citation, Spanner is recognized “for reimagining relational data management to enable serializability with external consistency at global scale.”

Why “reimagining”? Before Spanner, databases offered a stark choice: you could have ACID transactions and SQL, or you could have scale and multi-datacenter reliability. Scale and availability required a distributed system, and that meant eventual consistency and other forms of best-effort synchronization. Spanner showed that this choice was not fundamental — that it was possible to build a database that offered the horizontal scalability of a distributed system with the power and ease of use of transactions and SQL. It enabled companies for whom scale is job #1 to regain developer velocity and agility. Spanner drastically simplifies the logic required in distributed applications. Developers can reason about the state of the database as if it were a single, consistent entity, even when it spans the globe.

The key enabler for Spanner’s ability to deliver external consistency is TrueTime. Beyond just a synchronized global clock, TrueTime is an API that cleverly exposes clock uncertainty as a bounded interval, which allows higher-level algorithms to reason about the ordering of events. Google’s TrueTime implementation uses specialized hardware references like GPS receivers and atomic clocks to provide highly trustworthy and very tight time bounds. Spanner leverages this bounded uncertainty to achieve external consistency. When a transaction commits, Spanner assigns it a commit timestamp derived from TrueTime. Spanner then enforces a “commit wait” — which can be overlapped with making the transaction durable – to ensure that the commit timestamp is definitively in the past before making the transaction’s effects visible. This ensures that the assigned commit timestamps definitively reflect the true global serialization order of transactions, even across data centers. The result is remarkable: external consistency with no performance cost.

Addressing the consistency-scale dilemma

To truly appreciate the journey, it helps to travel back to the early and mid-2000s at Google. The internet was exploding, and our biggest challenge was scaling our software infrastructure to keep pace. We needed databases that could store and process a copy of the internet using vast fleets of commodity servers. This spurred the development of internal systems that delivered incredible performance and scalability, but they came with trade-offs.

As we gained more familiarity with these systems, and started using them to build big interactive applications like Gmail, we consistently heard from internal developers about the challenge of working with eventual consistency and cross-shard synchronization, as well as the friction of modeling every problem (no matter how complex) as key-value pairs. It quickly became apparent that we needed to build a globally distributed database that offered the familiarity and guarantees of traditional relational databases — including ACID transactions, serializability, and external consistency — without giving up Google’s ever-growing need for bigger databases serving bigger audiences. Moreover, working closely with our customers, it became clear that actually, it was something we could build. The rest is history!

Spanner as a cloud service

As a cornerstone of Google’s infrastructure, Spanner powers some of our most critical, planet-scale services, including Google Ads, Google Search indexing, Gmail, YouTube, Google Photos, metadata for Cloud Storage, and BigQuery, demonstrating its robustness and scalability under extreme load.

The next logical step was to make these capabilities available externally to customers through Google Cloud. The launch and subsequent evolution of Spanner aimed to democratize this technology, bringing the power of a globally consistent, scalable database to organizations of all sizes, from startups to global enterprises, simplifying their application development and operations.

Spanner’s core value proposition for customers stems directly from its unique architecture:

Global scale with strong consistency: Spanner delivers on the original promise: ACID transactions with external consistency across a database that can scale horizontally across regions and continents, automatically managing data distribution (sharding) as needed. This directly addresses the capability highlighted by the SIGMOD award.
Unmatched availability: Leveraging synchronous, Paxos-based replication across multiple zones or regions, Spanner offers an industry-leading 99.999% availability Service Level Agreement (SLA) for multi-region configurations. This helps provide extreme fault tolerance and minimizes downtime risk for mission-critical applications.
Simplified operations: As a fully managed service, Spanner automates complex operational tasks like sharding, replication management, backups, and maintenance. This frees development teams from significant operational burdens, allowing them to focus on building applications rather than managing database infrastructure. This contrasts sharply with the manual effort often required for traditional sharded databases or the complexity of implementing consistency logic at the application layer for NoSQL systems.
Developer productivity: Spanner offers familiar SQL query interfaces, supporting both Google SQL and PostgreSQL dialects, which significantly flattens the learning curve for developers. Furthermore, its strong consistency eliminates entire classes of complex problems related to data synchronization and reconciliation that often plague applications built on eventually consistent systems.

Empowering customers and industries

Of course, today’s data landscape has changed since we published the first Spanner paper in 2012. In the modern AI-first data world, we see customers more focused than ever on getting full value from their data, which is often spread out over many systems with different data models, varying scalability, and uneven reliability. We’re addressing these new challenges head on, introducing Spanner Graph, vector search for AI applications, and integrated full-text search. These allow you to bring together a wide range of data and iterate on it rapidly within a single, consistent, scalable platform. We’ve also helped you run even more cost-effectively as your data grows, with increased compute and storage density, tiered pricing through editions, and cost-optimizing features such as tiered storage. Finally, we’ve made it easier to bring scale-out workloads into Spanner through enhanced interoperability with tools and capabilities, such as Cassandra-compatible APIs.

While academic recognition such as the SIGMOD award is gratifying, we’ve always felt that the true measure of a system’s impact lies in how it empowers users to solve real-world problems and build innovative applications. Spanner’s unique combination of capabilities have proven transformative across various industries.

In financial services, where consistency, availability, and security are paramount, Spanner provides the foundation for many critical systems. Companies like Goldman Sachs use it to consolidate trade ledgers, while others like Arigato Bank rely on it to handle high volumes of financial transactions with perfect consistency, even during peak loads. Digital-native banks like Minna Bank have built their entire infrastructure on Spanner, leveraging its availability and consistency to meet stringent regulatory requirements and customer expectations.

The gaming industry is constantly pushing the boundaries of scale and real-time interaction. Spanner has helped game developers launch globally successful titles like Dragon Quest Walk by Colopl, handling millions of concurrent players from day one. Its ability to manage player profiles, in-game inventory, and leaderboards consistently across a global player base, while elastically scaling to handle unpredictable traffic spikes, has been crucial for delivering a seamless player experience

In retail and e-commerce, Spanner helps businesses manage the complexities of modern commerce. Walmart uses Spanner to modernize its inventory and payment management, providing a real-time, consistent view across online and physical stores. MercadoLibre, a global online marketplace and e-commerce provider, leverages Spanner to handle the global needs of their customers, including massive demand spikes during major product launches.

Leaders in transportation, such as Uber, rely on Spanner to handle millions of concurrent users and billions of trips per month across over 10,000 cities and billions of database transactions per day.

These examples illustrate that for many modern applications, Spanner’s specific blend of global consistency, massive scalability, high availability and interoperable multi-model isn’t just a technical advantage; it’s a fundamental enabler. Business models built around real-time global inventory, instantaneously consistent financial records, or seamless worldwide multiplayer experiences become significantly simpler, less risky, and more feasible to implement with Spanner.

The future with Spanner

The SIGMOD award recognizes the contributions of over 30 individuals, and countless other current and prior Googlers who have played roles throughout its development and evolution. It’s been a privilege to witness the journey from the initial ambitious concepts to the globally impactful system it is today. The journey from the original OSDI paper published to this 2025 SIGMOD Systems Award highlights over a decade of sustained research, engineering, and investment by Google. This long-term commitment is rare and is a key factor behind Spanner’s enduring success and impact.

Thanks to you, our customers, Spanner remains a living system that continues to evolve, benefiting from ongoing improvements, driven by both internal use and the needs of our cloud customers. Looking ahead, we remain committed to pushing the boundaries of what’s possible with distributed databases. We’re excited about enabling new kinds of applications, including those leveraging AI, and continuing our mission to simplify the complexities of data management for developers building the next generation of world-changing applications.

Experience the Spanner difference

The 2025 ACM SIGMOD Systems Award is a tremendous honor and a validation of the path we embarked on years ago. If you are at the 2025 SIGMOD conference, join us on Tuesday, June 24 at 4:30 pm to hear from and meet with the Google team.

If Spanner’s capabilities for consistency, scale, and availability resonate with the challenges you face, we encourage you to learn more:

Explore Spanner to learn about supported features and use cases, and get started with a 90-day Spanner free trial instance.
Dive deeper with documentation
For the academically inclined, read the original and successor papers on Spanner

In closing, we believe Spanner unlocks new possibilities for building reliable, scalable, and globally consistent applications, and we are excited to see what you, our customers, will build with it.

Read More for the details.

2025 06 17

GCP – Graduating the Google for Startups Accelerator: AI First in Europe & Israel

Tibor Kiss Cloud, Google Cloud gcp

Today, we’re incredibly proud to announce the graduation of the latest cohort from the Google for Startups Accelerator: AI First from Europe & Israel! This milestone marks the culmination of an intensive three-months journey for these 14 innovative startups, who’ve dedicated themselves to growing their businesses and pushing the boundaries of artificial intelligence. The hybrid program offered expert mentorship, robust technical support, and access to a powerful global network, empowering founders to scale their impact.

“With Google’s support, we brought our AI recruitment platform into its next generation — the most advanced in the world, with a business model built for $7M+ ARR within a year. Their guidance and exposure to breakthrough models took our tech years ahead.“ – Shira Spetter, CEO, iVERSE

With AI projected to contribute a staggering $15.7 trillion to the global economy by 2030 (PwC), supporting these AI startups is crucial for accelerating groundbreaking innovations, deploying scalable solutions, and ensuring AI’s benefits are widely accessible to businesses and communities worldwide. Because when AI innovation thrives, the world moves forward.

“We were initially cautious about deploying LLMs in production for key functionalities. However, after experiencing Gemini 2.5, we’re not just convinced – we’re actively integrating it to power exciting new features.” – Maria Fe Paz, CEO and founder of Connect by Circular-Lab.

The cohort celebrated the milestone at Viva Technology in Paris where they presented their companies, met potential venture capitalists and had an intimate fireside chat with Joëlle Barral, Research & Engineering Senior Director at Google DeepMind and Arno Amabile, Advisor to the French President Special Envoy for AI.

Learn more about the graduating startups and their inspiring work:

Ambr AI (UK) helps professionals master difficult workplace conversations through realistic Voice AI practice simulations, providing a safe, convenient environment with instant feedback to build crucial communication skills.
Connect by Circular-Lab (Spain) uses AI/ML to structure and centralize diagnostic data, making it accessible for labs, hospitals, and industry stakeholders.
Folio (Israel) empowers industrial sales and application engineers by turning technical specs, configuration data, and application info into instant answers, recommendations, and agentic workflows, speeding work, cutting errors, and boosting revenue for industrial manufacturers and distributors.
Good With (UK) delivers ai-driven financial behaviour analytics for real-time credit risk assessment, enabling lenders to convert binary ‘Yes/No’ decisioning into a ‘Safe Journey to Yes’ which increases ‘good’ customer acquisition and reduces loss.
Hybr (UK) is a SaaS enabled lettings platform for letting agents cut workloads by up to 80%, turning leads into lets faster, and building transparency into the rental process.
iVerse (Israel) is an AI talent platform built on 50+ years of occupational psychology — combining real-time behavioral analysis, proprietary evaluation layers, and a scalable model to match top AI talent with top AI opportunities worldwide.
Material Evolution (UK) is transforming cement with a novel AI-driven tech that uses industrial waste that requires no heat and significantly reduces carbon emissions.
Metsystem (Denmark) develops an AI powered metastasis-targeting platform to predict personalized cancer treatments and help pharmaceutical companies stratify patients for drug trials.
Noxon (Germany) builds wearable Muscle-Computer Interface (MCI) that makes muscle diagnostics and therapies more accessible, user-friendly, and scalable for remote care.
Punto Health (UK/ Spain) is transforming dementia care with an AI-powered platform that delivers continuous, personalised support for patients and carers, while improving monitoring and coordination for providers.
ShareID (France) enables real-time, privacy-first identity verification without storing personal or biometric data, redefining digital trust.
Tech1M (UK) is an intelligent recruitment engine with AI Agents for sourcing, screening, interviewing and hiring talents anywhere in the world.
V-Art (Ukraine) is a DeepTech startup streamlining IP monetization for brands and AI with a solution to manage and license any digital content at scale.
Whering (UK) is a digital wardrobe & AI styling app that allows users to unlock infinite outfit combinations from the clothes they already own.

We can’t wait to see what comes next for these AI-first solutions and incredible teams driving them. Learn more about Google for Startups Accelerator programs on startup.google.com.

aside_block: <ListValue: [StructValue([(‘title’, ‘Try Google Cloud for free’), (‘body’, <wagtail.rich_text.RichText object at 0x3ecd540be310>), (‘btn_text’, ‘Get started for free’), (‘href’, ‘https://console.cloud.google.com/freetrial?redirectPath=/welcome’), (‘image’, None)])]>

Read More for the details.

2025 06 17

GCP – Gemini momentum continues with launch of 2.5 Flash-Lite and general availability of 2.5 Flash and Pro on Vertex AI

Tibor Kiss Cloud, Google Cloud gcp

The momentum of the Gemini 2.5 era continues to build. Following our recent announcements, we’re empowering enterprise builders and developers with even greater access to the intelligence, and flexibility of our most capable models yet, directly within Vertex AI, our unified platform for enterprise-scale AI development.

The significant updates announced today are designed to help your organization build sophisticated, customized, and efficient AI solutions, more confidently. These include:

Gemini 2.5 Flash and 2.5 Pro now generally available: Our most intelligent models for speed and advanced reasoning are production-ready providing organizations with the stability, reliability and scalability needed to confidently deploy the most advanced AI capabilities into mission-critical applications.
New Gemini 2.5 Flash-Lite in public preview: Experience the cost-efficient Gemini 2.5 model yet with optimized performance for high-volume tasks.
New Supervised Fine-Tuning (SFT) for Gemini 2.5 Flash is generally available: Tailor our high-speed model to your unique enterprise data and needs.
New updated Live API with native audio in public preview: Streamline the development of complex, real-time audio AI systems.

Build with confidence using production-ready Gemini 2.5

Gemini 2.5 Flash: Optimized for speed, efficiency, and scale

Gemini 2.5 Flash, is now generally available in Vertex AI, the Gemini API, and Google AI Studio, engineered for high-throughput enterprise tasks such as large-scale summarization, responsive chat applications, and efficient data extraction. These advancements provide a comprehensive toolkit to elevate your enterprise applications and unlock new levels of productivity and innovation. Build with confidence on this production-ready foundation.

“SmartBear uses AI to power Test Hub, its solution for building and executing regression tests for web, desktop, and mobile. With Gemini 2.5 Flash on Vertex AI, we can accelerate tasks like translating extensive manual test scripts into robust automated tests with remarkable speed and cost-effectiveness. The ROI is multifaceted: we’re empowering our customers to realize the benefits of automation execution, while simultaneously producing intent-based, resilient-to-change test plans. This drastically increases testing velocity and enables faster feature delivery—helping our customers move with greater speed and confidence, powered by a more efficient and scalable AI foundation.”
– Fitz Nowlan, PhD, VP of AI, SmartBear

“At Connective Health, our mission is empowering healthcare providers and driving better patient outcomes. Gemini 2.5 Flash on Vertex AI is instrumental in helping us extract vital medical records from complex free-text records. Customer trust is paramount, so our AI initiatives are always developed in close collaboration with healthcare providers, ensuring its use is accurate and impactful. The rapid advancements in Gemini’s capabilities allow us to continually enhance how we deliver these critical insights, and we’re excited to explore further applications to improve the lives of more patients and providers.”
– Joe Athman, CTO, Connective Health

“At Suggestic, we’re advancing the future of personalized nutrition by making nutritional data instantly actionable through our next-generation, image-based inference API. By leveraging Gemini 2.5 Flash as our core model, we’ve consistently achieved exceptional accuracy and processing efficiency, significantly outperforming alternative models on the Nutrition5k dataset. Gemini 2.5 Flash delivered a remarkable 25% improvement across critical benchmarks, including processing speed, enabling us to implement advanced image modification tools that enhance inference accuracy without sacrificing response times. Its native support for structured output and unparalleled capability in handling complex, tool-augmented tasks ensures seamless, real-time experiences, making Gemini 2.5 Flash the optimal choice for robust, production-grade solutions.”
– Shai Rozen, Co-founder, Suggestic

Gemini 2.5 Pro: Unlock state-of-the-art intelligence

Our most capable model, Gemini 2.5 Pro, is also now generally available in Vertex AI, the Gemini API, and Google AI Studio. Designed for your most demanding enterprise AI challenges like making sense of massive datasets for scientific discovery or accelerating migration of critical legacy code, it excels at highly complex reasoning, advanced code generation, and deep multimodal understanding.

“At Snap, we believe today’s devices and user interfaces can constrain the full potential of AI. So, we’re bringing AI into the world through Spectacles, our standalone, see-through, immersive AR glasses, and Gemini on Google Cloud. Through the powerful combination of our Depth Module API and Gemini 2.5 Pro, it’s already possible to translate 2D coordinates of an image into 3D space, enabling information and annotations to be anchored on the real world – even as you move around. We’re excited to unlock a whole new paradigm for spatial intelligence on Spectacles.”
– Terek Judi, Staff Product Manager, Snap Inc.

“At Multimodal, we’re reimagining how business and IT teams in finance and insurance co-create intelligent agentic workflows. By integrating Gemini 2.5 Pro into our AgentFlow platform, we’ve transformed how customers experience Zero Shot AI—enabling them to instantly see how AI agents operate on their own documents, workflows, and use cases, without needing lengthy pilots or custom demos. Gemini 2.5’s large context window and structured reasoning unlock a level of depth and adaptability that’s been impossible before, allowing our agents to understand, reason through, and act across highly specific domain workflows. This fundamentally changes the go-to-market experience: business teams can now visualize and validate impact on day one. For industries where trust, compliance, and precision are paramount, that’s a game-changer.”
– Andrew McKishnie, VP of Engineering, Multimodal

Enhanced customization and efficiency for your needs

Gemini 2.5 Flash-Lite in public preview: Gain cost-efficiency with low latency
Get an early look at Gemini 2.5 Flash-Lite, the most cost-effective Gemini 2.5 model yet, optimized for performance in high-volume workloads. Delivering higher performance than the previous Flash-Lite model, 2.5 Flash-Lite is 1.5 times faster than 2.0 Flash, at a lower cost, on Vertex AI. It’s ideal for tasks like classification, translation, intelligent routing, and other cost-sensitive, high-scale operations.

Supervised Fine-Tuning (SFT) for Gemini 2.5 Flash: Customized AI for your business
Achieve unparalleled customization with the GA release of Supervised Fine-Tuning (SFT) for Gemini 2.5 Flash on Vertex AI. Adapt Gemini to your enterprise’s specific datasets, industry-specific terminology, and unique brand voice, leading to higher accuracy on specialized tasks.

Live API with native audio in public preview: Build real-time interactive services
Streamline the development of sophisticated, real-time AI systems with the Live API, now in public preview with native audio-to-audio capabilities. This enables more natural and responsive voice-driven applications and complex AI agent interactions.

“Newo.ai enables small and medium businesses to deploy fully functional AI receptionists that handle all incoming communication channels—voice and text—in just 3 minutes with one click. We’ve worked through thousands of customer scenarios to enable AI Employee creation using only a Google Maps listing or website. While this appears simple, we deliver sophisticated conversation flows requiring advanced reasoning, low latency, multilingual capabilities, and empathetic responses—features powered by the Live API and Gemini 2.5 Flash on Vertex AI. This combination allows us to deliver production-ready AI employees that generate up to 30x ROI for our clients.”
– David Yang, Co-founder, Newo.ai

Driving your enterprise AI initiatives forward, these comprehensive Vertex AI updates enable you to continue to scale confidently with robust, production-grade models. You can now tailor powerful AI precisely to your unique operational needs and data, optimize for cost-efficiency in high-throughput scenarios, and build next-generation, interconnected AI solutions that push the boundaries of innovation.

“At Citizen Health, we develop AI advocates that empower rare‑disease patients/caretakers to understand and navigate their healthcare journeys. Our data pipelines stream longitudinal EHR data – decades of clinician notes, imaging reports, and genomic panels – directly into Gemini  2.5 Pro’s million‑token context windows, enabling patients and caretakers to receive concise, context‑rich answers in near real-time. We orchestrate Gemini 2.5 Flash and Gemini 2.5 Pro models within a LangGraph‑powered multi‑agent framework, ensuring the most relevant evidence reaches patients and caretakers without hallucinations. Gemini’s long‑context comprehension coupled with rapid inference converts exhaustive document review into a seamless conversation, allowing families to spend less time deciphering records and more time making informed care decisions.”
– Daniel Wang, CTO, Citizen Health

Pricing and availability
The Gemini 2.5 family of models offers a range of options to meet diverse enterprise needs. With Gemini 2.5 Flash moving to general availability, its pricing has been updated to reflect its improved quality and comprehensive capabilities. We are also introducing preview pricing for Gemini 2.5 Flash-Lite, our most cost efficient Gemini 2.5 model yet. For complete details on pricing for Gemini 2.5 Flash, Gemini 2.5 Pro, and the Gemini 2.5 Flash-Lite preview, please visit our pricing page.

Start moving to production today with Gemini 2.5 Flash and Gemini 2.5 Pro, now generally available on Vertex AI.

Read More for the details.

2025 06 17

AWS – AWS IAM now enforces MFA for root users across all account types

Tibor Kiss AWS, Cloud AWS

Today AWS Identity and Access Management (IAM) announced comprehensive multi-factor authentication (MFA) requirements for root users across all account types, with the expansion to member accounts. The new MFA enforcement marks a significant milestone in our ongoing commitment of secure by design principles, setting a high bar for our customers’ default security posture and building upon our previous security enhancements. Our security journey began with requiring MFA for AWS Organizations management account root users in May 2024, followed by expanding MFA requirements to standalone account root users in June 2024, and introducing centralized root access management for AWS Organizations in November 2024.

IAM helps you securely manage identities and control access to AWS services and resources. MFA is a security best practice in IAM that requires a second authentication factor in addition to the user name and password sign-in credentials. MFA is available at no additional cost and prevents over 99% of password-related attacks. You can use a range of supported IAM MFA methods, including FIDO-certified security keys to harden access to your AWS accounts. AWS supports FIDO2 passkeys for a user-friendly MFA implementation and allows customers to register up to 8 MFA devices per root and IAM user. For AWS Organizations customers, we recommend centralizing access account management through the management account and removing root user credentials from member accounts, which represents an even stronger security posture.

To learn more:

Read More for the details.

2025 06 17

AWS – IAM Access Analyzer now identifies who in your AWS organization can access your AWS resources

Tibor Kiss AWS, Cloud AWS

AWS Identity and Access Management (IAM) Access Analyzer now identifies who within your AWS organization has access to your Amazon S3, Amazon DynamoDB, or Amazon Relational Database Service (RDS) resources. It uses automated reasoning to evaluate all identity policies, resource policies, service control policies (SCPs), and resource control policies (RCPs) to surface all IAM users and roles that have access to your selected critical resources.

After the new internal access analyzer is enabled in the IAM console, the analyzer monitors your selected resources daily, and surfaces findings in a unified dashboard. The updated dashboard combines internal and external access findings to provide a 360-degree view of all access granted to your critical resources. Security teams can respond to new findings in two ways: taking immediate action to fix unintended access, or setting up automated notifications through Amazon EventBridge to engage development teams for remediation.

Internal access findings provide security teams the visibility to strengthen access controls on their critical resources and help compliance teams demonstrate access control audit requirements. Internal access findings are available in all AWS commercial Regions. To learn more about IAM Access Analyzer internal access findings:

Read the AWS news blog post
Review the pricing page
Visit the IAM Access Analyzer documentation

Read More for the details.

2025 06 17

AWS – AWS Certificate Manager introduces public certificates you can use anywhere

Tibor Kiss AWS, Cloud AWS

AWS Certificate Manager (ACM) announces exportable public certificates that you can use on any workload that requires a public TLS certificate, whether within AWS or outside. With this release, you can issue public certificates that you can export and access the certificate’s private key to securely terminate TLS traffic on any compute workload. This includes EC2 instances, containers, or on-premises hosts.

ACM customers can now affordably issue, manage, and automate public certificates for use with your AWS, hybrid, or multicloud workloads. Previously, ACM-issued public certificates could only be used with integrated AWS services, such as Amazon CloudFront. Now, during certificate request, you can mark the certificate as exportable for use outside of integrated services as well. You can procure these certificates within seconds once you complete the required domain validation to prove that you are authorized to receive the certificate.

The exportable public certificates are valid for 395 days and costs $15 per FQDN and $149 per wildcard name. You don’t need to sign up for bulk issuance contracts and you only pay once for the lifetime of the certificate. Network and security administrators can monitor and automate the use of these certificates through ACM’s certificate lifecycle CloudWatch events

Security is top priority within AWS and your end users cannot export public certificates that were issued prior to this launch. AWS administrators can set IAM policies to authorize roles and users who can request exportable public certificates. The feature is available in all regions where ACM is available including the AWS GovCloud (US) and China Regions. Learn more about this feature here.

Read More for the details.

2025 06 17

AWS – Amazon CloudFront streamlines CDN setup with smart defaults and automation

Tibor Kiss AWS, Cloud AWS

Amazon CloudFront introduces a new console experience that simplifies the delivery of secure, high-performance applications to users on the internet. Setting up a content delivery network (CDN) traditionally required deep expertise in CDN configurations, domain management, and security best practices. The new CloudFront console experience streamlines this entire process with a unified approach to content delivery and security. The new experience automatically provisions and manages DNS records with Amazon Route 53 and TLS certificates with AWS Certificate Manager (ACM). Users can now create a secure, optimized distribution in as little as 30 seconds, regardless of their CDN expertise level.

When creating a distribution, CloudFront now automatically applies optimized settings based on your specific origin type. For example, when serving static websites from Amazon S3, CloudFront automatically configures Origin Access Control to prevent direct bucket access, optimizes caching settings for improved performance, and enables recommended security settings – all without requiring you to understand the underlying technical details of these components.

This new onboarding experience makes it easier for you to leverage AWS’ global edge network, reduce latency for your end users, and enhance the security posture of your applications. The new experience is available globally at no additional cost. To get started with the new CloudFront experience, visit the Amazon CloudFront console or check out our documentation.

Read More for the details.

2025 06 17

AWS – AWS Network Firewall launches support for active threat defense

Tibor Kiss AWS, Cloud AWS

AWS Network Firewall now offers active threat defense, a new security feature that helps you protect your Amazon Virtual Private Cloud (VPC) workloads against threat activities observed across AWS global infrastructure using Amazon threat intelligence.

AWS Network Firewall with active threat defense provides automated, intelligence-driven protection against dynamic, ongoing threat activities observed across AWS infrastructure. Once enabled, you can configure the managed rule group in your firewall policy to automatically block suspicious traffic, such as command-and-control (C2) communication, embedded URLs, and malicious domains. The feature provides protection by continuously updating rules based on current threat activity. AWS Network Firewall offers improved visibility for active threat defense rule group, allowing you to see indicator groups, types and threat names you’re protected against. If you are also an Amazon GuardDuty customer, related threat intelligence findings are marked with the threat list name “Amazon Active Threat Defense” going forward. These active threats can be automatically blocked by using the active threat defense managed rule group on AWS Network Firewall.

To get started with AWS Network Firewall with active threat defense, visit the AWS Network Firewall console or refer to our documentation. This feature is supported in all AWS Regions where AWS Network Firewall is available today, including the AWS GovCloud (US) Regions and China Regions. For more information about AWS Network Firewall and its features, please visit the AWS Network Firewall product page AWS Network Firewall.

Read More for the details.

2025 06 17

AWS – Introducing the reimagined AWS MSSP Competency

Tibor Kiss AWS, Cloud AWS

Introducing the updated AWS MSSP Competency (previously AWS Level 1 MSSP Competency) for partners with turn-key security solutions that transform how organizations approach cloud security. The update includes new categories to validate Partners’ security expertise in specific domains including Infrastructure Security, Workload Security, Application Security, Data Protection, Identity & Access Management, Incident Response, and Cyber Recovery. These categories validate service partners’ capabilities to deliver comprehensive security outcomes leveraging native AWS services and best-of-breed security tools.

Partners must meet core MSSP requirements and demonstrate expertise in at least one category through technical validation. Additionally, MSSP Competency Partners have the option to showcase how they integrate validated AWS Security Competency ISV solutions into their managed security services. This visibility helps AWS customers identify which MSSP Competency Partners can effectively manage their existing third-party security tools as part of a comprehensive security solution.

To learn more about AWS-validated fully managed security solutions, visit the AWS MSSP Competency page and contact a partner to evaluate your security needs.

Read More for the details.

2025 06 17

AWS – AWS Shield introduces network security director (preview)

Tibor Kiss AWS, Cloud AWS

Today, AWS Shield announces the preview of network security director, a new capability that provides visibility into the AWS resources in your network, identifies missing or misconfigured network security services, and recommends remediation steps. As threats continue to evolve, AWS Shield has expanded its capabilities beyond DDoS protection to help you easily identify resources requiring network and application protection and correctly secure them.

With network security director, AWS Shield helps you simplify network security management in three ways. First, it provides visibility into your network topology, which shows you the resources in your account and how they are connected to each other and the Internet. It discovers enabled AWS network security services, such as AWS WAF, VPC security groups, and VPC network access control lists (NACLs), and determines how well they are configured relative to AWS best practices and threat intelligence. Second, AWS Shield helps you quickly identify which missing or misconfigured firewalls require your immediate attention by showing you network security findings on your resources, prioritized by severity level.

Lastly, for each finding, you can view actionable remediation recommendations to correctly implement or update the configuration of the network security services you use.

Easily get answers, in natural language, to questions about your network security configurations from AWS Shield network security director within Amazon Q Developer in the AWS Management Console and chat applications. For example, you can ask “Are any of my Internet-facing resources vulnerable to DDoS?”, and Amazon Q shows relevant network security findings on specific resources with recommended remediation steps.This capability is available during preview at no additional cost in select AWS Regions: US East (N. Virginia) and Europe (Stockholm). Amazon Q Developer’s capability to analyze network security configurations is available in preview in US East (N. Virginia).

To learn more, visit the overview page.

Read More for the details.

2025 06 17

AWS – Introducing AWS Security Hub for risk prioritization and response at scale (Preview)

Tibor Kiss AWS, Cloud AWS

AWS announces an enhanced AWS Security Hub to prioritize your critical security issues and help respond at scale to reduce security risks, improve your team’s productivity, and protect your cloud environment. It detects critical issues by correlating and enriching security signals, for example, from threat detection and vulnerability management. This enables you to quickly surface and prioritize active risks in your cloud environment. The unified solution provides more comprehensive visibility into your security posture while reducing the complexity of manually piecing together information from multiple security tools.

Security Hub transforms correlated security signals into actionable insights through intuitive visualizations and contextual analytics, helping you identify critical patterns and trends and centralize security operations in your environment. For example, it detects and correlates scenarios where publicly exposed resources with highly exploitable vulnerabilities have access to storage with sensitive data. These insights provide enhanced risk context so you can make more informed decisions and take immediate action on security issues. Enhanced capabilities include exposure findings, security-focused asset inventory, attack path visualization, and automated response workflows with ticketing system integration. This centralized management enables streamlined remediation at scale while helping you minimize potential operational disruptions.

For more information about AWS Regions where Security Hub is available, see the AWS Region table. You can enable Security Hub for individual accounts or across your entire AWS Organization with centralized deployment and management. The service integrates with existing AWS security capabilities including Amazon GuardDuty, Amazon Inspector, AWS Security Hub CSPM, and Amazon Macie, providing more comprehensive security posture without additional operational overhead.

To learn more about the enhanced Security Hub and join the Preview, visit the AWS Security Hub console or the AWS Security Hub product page.

Read More for the details.

2025 06 17

GCP – Build and Deploy a Remote MCP Server to Google Cloud Run in Under 10 Minutes

Tibor Kiss Cloud, Google Cloud gcp

Integrating context from tools and data sources into LLMs can be challenging, which impacts ease-of-use in the development of AI agents. To address this challenge, Anthropic introduced the Model Context Protocol (MCP), which standardizes how applications provide context to LLMs. Imagine you want to build an MCP server for your API to make it available to fellow developers so they can use it as context in their own AI applications. But where do you deploy it? Google Cloud Run could be a great option.

Drawing directly from the official Cloud Run documentation for hosting MCP servers, this guide shows you the straightforward process of setting up your very own remote MCP server. Get ready to transform how you leverage context in your AI endeavors!

MCP Transports

MCP follows a client-server architecture, and for a while, only supported running the server locally using the stdio transport.

MCP-blog-image — https://modelcontextprotocol.io/introduction

MCP has evolved and now supports remote access transports: streamable-http and sse. Server-Sent Events (SSE) has been deprecated in favor of Streamable HTTP in the latest MCP specification but is still supported for backwards compatibility. Both of these two transports allow for running MCP servers remotely.

With Streamable HTTP, the server operates as an independent process that can handle multiple client connections. This transport uses HTTP POST and GET requests.

The server MUST provide a single HTTP endpoint path (hereafter referred to as the MCP endpoint) that supports both POST and GET methods. For example, this could be a URL like https://example.com/mcp.

You can read more about the different transports in the official MCP docs.

Benefits of running an MCP server remotely

Running an MCP server remotely on Cloud Run can provide several benefits:

Scalability: Cloud Run is built to rapidly scale out to handle all incoming requests. Cloud Run will scale your MCP server automatically based on demand.
Centralized server: You can share access to a centralized MCP server with team members through IAM privileges, allowing them to connect to it from their local machines instead of all running their own servers locally. If a change is made to the MCP server, all team members will benefit from it.
Security: Cloud Run provides an easy way to force authenticated requests. This allows only secure connections to your MCP server, preventing unauthorized access.

IMPORTANT: The security benefit is critical. If you don’t enforce authentication, anyone on the public internet can potentially access and call your MCP server.

Prerequisites

Python 3.10+
Uv (for package and project management, see docs for installation)
Google Cloud SDK (gcloud)

Installation

Create a folder, mcp-on-cloudrun, to store the code for our server and deployment:

code_block: <ListValue: [StructValue([(‘code’, ‘mkdir mcp-on-cloudrunrncd mcp-on-cloudrun’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b5b93a60>)])]>

Let’s get started by using uv to create a project. Uv is a powerful and fast package and project manager.

code_block: <ListValue: [StructValue([(‘code’, ‘uv init –name “mcp-on-cloudrun” –description “Example of deploying a MCP server on Cloud Run” –bare –python 3.10’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b5b93e20>)])]>

After running the above command, you should see the following pyproject.toml:

code_block: <ListValue: [StructValue([(‘code’, ‘[project]rnname = “mcp-on-cloudrun”rnversion = “0.1.0”rndescription = “Example of deploying a MCP server on Cloud Run”rnrequires-python = “>=3.10″rndependencies = []’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b5b93f10>)])]>

Next, let’s create the additional files we will need: a server.py for our MCP server code, a test_server.py that we will use to test our remote server, and a Dockerfile for our Cloud Run deployment.

code_block: <ListValue: [StructValue([(‘code’, ‘touch server.py test_server.py Dockerfile’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b5b932b0>)])]>

Our file structure should now be complete:

code_block: <ListValue: [StructValue([(‘code’, ‘├── mcp-on-cloudrunrn│ ├── pyproject.tomlrn│ ├── server.pyrn│ ├── test_server.pyrn│ └── Dockerfile’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b5b93700>)])]>

Now that we have our file structure taken care of, let’s configure our Google Cloud credentials and set our project:

code_block: <ListValue: [StructValue([(‘code’, ‘gcloud auth loginrnexport PROJECT_ID=<your-project-id>rngcloud config set project $PROJECT_ID’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b5b93070>)])]>

Math MCP Server

LLMs are great at non-deterministic tasks: understanding intent, generating creative text, summarizing complex ideas, and reasoning about abstract concepts. However, they are notoriously unreliable for deterministic tasks – things that have one, and only one, correct answer.

Enabling LLMs with deterministic tools (such as math operations) is one example of how tools can provide valuable context to improve the use of LLMs using MCP.

We will use FastMCP to create a simple math MCP server that has two tools: add and subtract. FastMCP provides a fast, Pythonic way to build MCP servers and clients.

Add FastMCP as a dependency to our pyproject.toml:

code_block: <ListValue: [StructValue([(‘code’, ‘uv add fastmcp==2.6.1 –no-sync’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b5b93220>)])]>

Copy and paste the following code into server.py for our math MCP server:

code_block: <ListValue: [StructValue([(‘code’, ‘import asynciornimport loggingrnimport osrnrnfrom fastmcp import FastMCP rnrnlogger = logging.getLogger(__name__)rnlogging.basicConfig(format=”[%(levelname)s]: %(message)s”, level=logging.INFO)rnrnmcp = FastMCP(“MCP Server on Cloud Run”)rnrn@mcp.tool()rndef add(a: int, b: int) -> int:rn “””Use this to add two numbers together.rn rn Args:rn a: The first number.rn b: The second number.rn rn Returns:rn The sum of the two numbers.rn “””rn logger.info(f”>>> ?️ Tool: ‘add’ called with numbers ‘{a}’ and ‘{b}'”)rn return a + brnrn@mcp.tool()rndef subtract(a: int, b: int) -> int:rn “””Use this to subtract two numbers.rn rn Args:rn a: The first number.rn b: The second number.rn rn Returns:rn The difference of the two numbers.rn “””rn logger.info(f”>>> Tool: ‘subtract’ called with numbers ‘{a}’ and ‘{b}'”)rn return a – brnrnif __name__ == “__main__”:rn logger.info(f” MCP server started on port {os.getenv(‘PORT’, 8080)}”)rn # Could also use ‘sse’ transport, host=”0.0.0.0″ required for Cloud Run.rn asyncio.run(rn mcp.run_async(rn transport=”streamable-http”, rn host=”0.0.0.0″, rn port=os.getenv(“PORT”, 8080),rn )rn )’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b5b93850>)])]>

Transport

We are using the streamable-http transport for this example as it is the recommended transport for remote servers, but you can also still use sse if you prefer as it is backwards compatible.

If you want to use sse, you will need to update the last line of server.py to use transport="sse".

Deploying to Cloud Run

Now let’s deploy our simple MCP server to Cloud Run.

Copy and paste the below code into our empty Dockerfile; it uses uv to run our server.py:

code_block: <ListValue: [StructValue([(‘code’, ‘# Use the official Python lightweight imagernFROM python:3.13-slimrnrn# Install uvrnCOPY –from=ghcr.io/astral-sh/uv:latest /uv /uvx /bin/rnrn# Install the project into /apprnCOPY . /apprnWORKDIR /apprnrn# Allow statements and log messages to immediately appear in the logsrnENV PYTHONUNBUFFERED=1rnrn# Install dependenciesrnRUN uv syncrnrnEXPOSE $PORTrnrn# Run the FastMCP serverrnCMD [“uv”, “run”, “server.py”]’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473b80>)])]>

You can deploy directly from source, or by using a container image.

For both options we will use the --no-allow-unauthenticated flag to require authentication.

This is important for security reasons. If you don’t require authentication, anyone can call your MCP server and potentially cause damage to your system.

Option 1 – Deploy from source

code_block: <ListValue: [StructValue([(‘code’, ‘gcloud run deploy mcp-server –no-allow-unauthenticated –region=us-central1 –source .’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473ee0>)])]>

Option 2 – Deploy from a container image

Create an Artifact Registry repository to store the container image.

code_block: <ListValue: [StructValue([(‘code’, ‘gcloud artifacts repositories create remote-mcp-servers \rn –repository-format=docker \rn –location=us-central1 \rn –description=”Repository for remote MCP servers” \rn –project=$PROJECT_ID’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473070>)])]>

Build the container image and push it to Artifact Registry with Cloud Build.

code_block: <ListValue: [StructValue([(‘code’, ‘gcloud builds submit –region=us-central1 –tag us-central1-docker.pkg.dev/$PROJECT_ID/remote-mcp-servers/mcp-server:latest’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b04732e0>)])]>

Deploy our MCP server container image to Cloud Run.

code_block: <ListValue: [StructValue([(‘code’, ‘gcloud run deploy mcp-server \rn –image us-central1-docker.pkg.dev/$PROJECT_ID/remote-mcp-servers/mcp-server:latest \rn –region=us-central1 \rn –no-allow-unauthenticated’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473d30>)])]>

Once you have completed either option, if your service has successfully deployed you will see a message like the following:

code_block: <ListValue: [StructValue([(‘code’, ‘Service [mcp-server] revision [mcp-server-12345-abc] has been deployed and is serving 100 percent of traffic.’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473340>)])]>

Authenticating MCP Clients

Since we specified --no-allow-unauthenticated to require authentication, any MCP client connecting to our remote MCP server will need to authenticate.

The official docs for Host MCP servers on Cloud Run provides more information on this topic depending on where you are running your MCP client.

For this example, we will run the Cloud Run proxy to create an authenticated tunnel to our remote MCP server on our local machines.

By default, the URL of Cloud Run services requires all requests to be authorized with the Cloud Run Invoker (roles/run.invoker) IAM role. This IAM policy binding ensures that a strong security mechanism is used to authenticate your local MCP client.

Make sure that you or any team members trying to access the remote MCP server have the roles/run.invoker IAM role bound to their IAM principal (Google Cloud account).

NOTE: The following command may prompt you to download the Cloud Run proxy if it is not already installed. Follow the prompts to download and install it.

code_block: <ListValue: [StructValue([(‘code’, ‘gcloud run services proxy mcp-server –region=us-central1’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473eb0>)])]>

You should see the following output:

code_block: <ListValue: [StructValue([(‘code’, ‘Proxying to Cloud Run service [mcp-server] in project [<YOUR_PROJECT_ID>] region [us-central1]rnhttp://127.0.0.1:8080 proxies to https://mcp-server-abcdefgh-uc.a.run.app’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473970>)])]>

All traffic to http://127.0.0.1:8080 will now be authenticated and forwarded to our remote MCP server.

Testing the remote MCP server

Let’s test and connect to the remote MCP server using the FastMCP client to connect to http://127.0.0.1:8080/mcp (note the /mcp at the end as we are using the Streamable HTTP transport) and call the add and subtract tools.

Add the following code to the empty test_server.py file:

code_block: <ListValue: [StructValue([(‘code’, ‘import asynciornrnfrom fastmcp import Clientrnrnasync def test_server():rn # Test the MCP server using streamable-http transport.rn # Use “/sse” endpoint if using sse transport.rn async with Client(“http://localhost:8080/mcp”) as client:rn # List available toolsrn tools = await client.list_tools()rn for tool in tools:rn print(f”>>> Tool found: {tool.name}”)rn # Call add toolrn print(“>>> Calling add tool for 1 + 2”)rn result = await client.call_tool(“add”, {“a”: 1, “b”: 2})rn print(f”<<< Result: {result[0].text}”)rn # Call subtract toolrn print(“>>> Calling subtract tool for 10 – 3”)rn result = await client.call_tool(“subtract”, {“a”: 10, “b”: 3})rn print(f”<<< Result: {result[0].text}”)rnrnif __name__ == “__main__”:rn asyncio.run(test_server())’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473580>)])]>

NOTE: Make sure you have the Cloud Run proxy running before running the test server.

In a new terminal run:

code_block: <ListValue: [StructValue([(‘code’, ‘uv run test_server.py’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473130>)])]>

You should see the following output:

code_block: <ListValue: [StructValue([(‘code’, ‘>>> Tool found: addrn>>> Tool found: subtractrn>>> Calling add tool for 1 + 2rn<<< Result: 3rn>>> Calling subtract tool for 10 – 3rn<<< Result: 7’), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3ec3b0473c10>)])]>

You’ve done it! You have successfully deployed a remote MCP server to Cloud Run and tested it using the FastMCP client.

Want to learn more about deploying AI applications on Cloud Run? Check out this blog from Google I/O to learn the latest on Easily Deploying AI Apps to Cloud Run!

Continue Reading

Read More for the details.

2025 06 16

AWS – Amazon S3 extends additional context for HTTP 403 Access Denied error messages to AWS Organizations

Tibor Kiss AWS, Cloud AWS

Amazon S3 now includes additional context in HTTP 403 Access Denied errors for requests made to resources in accounts within the same AWS Organization. This context includes the type of policy that denied access, the reason for denial, and information on the AWS Identity and Access Management (IAM) user or role that requested access to the resource. This context helps you to troubleshoot access issues, identify the root cause of access denied errors, and fix incorrect access controls by updating the relevant policies. This additional context is also available in AWS CloudTrail logs.

Enhanced access denied error messages are rolling out in the coming weeks in all AWS Regions. To learn more about how to troubleshoot access denied errors in S3, visit the S3 User Guide and the IAM troubleshooting documentation.

Read More for the details.

2025 06 16

AWS – Amazon RDS for MySQL announces Innovation Release 9.3 in Amazon RDS Database Preview Environment

Tibor Kiss AWS, Cloud AWS

Amazon RDS for MySQL now supports community MySQL Innovation Release 9.3 in the Amazon RDS Database Preview Environment, allowing you to evaluate the latest Innovation Release on Amazon RDS for MySQL. You can deploy MySQL 9.3 in the Amazon RDS Database Preview Environment which provides the benefits of a fully managed database, making it simpler to set up, operate, and monitor databases.

MySQL 9.3 is the latest Innovation Release from the MySQL community. MySQL Innovation releases include bug fixes, security patches, as well as new features. MySQL Innovation releases are supported by the community until the next innovation minor, whereas MySQL Long Term Support (LTS) Releases, such as MySQL 8.0 and MySQL 8.4, are supported by the community for up to eight years. Please refer to the MySQL 9.3 release notes and Amazon RDS MySQL release notes for more details.

Amazon RDS Database Preview Environment supports both Single-AZ and Multi-AZ deployments on the latest generation of instance classes. Amazon RDS Database Preview Environment database instances are retained for a maximum of 60 days and are automatically deleted after the retention period. Amazon RDS database snapshots created in the Preview Environment can only be used to create or restore database instances within the Preview Environment.

Amazon RDS Database Preview Environment database instances are priced the same as production RDS instances created in the US East (Ohio) Region. For further information, see Working with the Database Preview Environment. To get started with the Preview Environment from the RDS console, navigate here.

Read More for the details.

2025 06 16

AWS – AWS Compute Optimizer now identifies idle EC2 Auto Scaling groups with GPU instances

Tibor Kiss AWS, Cloud AWS

AWS Compute Optimizer now detects idle EC2 Auto Scaling groups using G and P instance types, enabling you to identify additional savings opportunities in your AWS spend. As AI development accelerates, organizations are creating more Auto Scaling groups with G and P instance types for training and inference workloads. Once you enable the NVIDIA CloudWatch agent, Compute Optimizer analyzes utilization data and identifies groups that have completed jobs and remained idle during your specified lookback period, making it easier to identify and prevent waste on these high-cost instance types.

This new feature is available in all AWS Regions where AWS Compute Optimizer is available except for the AWS GovCloud (US) and the China Regions. The new recommendations will also be available in Cost Optimization Hub. For more information about Compute Optimizer, visit our product page and documentation. You can start using AWS Compute Optimizer through the AWS Management Console, AWS Services CLI, or AWS SDK.

Read More for the details.

2025 06 16

AWS – AWS Network Firewall now supports AWS Transit Gateway native integration

Tibor Kiss AWS, Cloud AWS

AWS Network Firewall now supports native integration with AWS Transit Gateway for simplified deployment and management of network security across your global AWS infrastructure. This capability is available in 5 AWS Regions, allowing customers to implement security controls more efficiently.

AWS Transit Gateway interconnects your Amazon Virtual Private Clouds (VPCs) and on-premises networks, while AWS Network Firewall provides comprehensive security controls for those VPCs. Native attachment simplifies connecting these services, providing centralized security control without complex VPC configurations. Additionally, you can configure one or multiple Availability Zones (AZs) for high availability, maintaining traffic flow within the same AZ.

This integration is available in the following AWS Regions: Africa (Cape Town), Asia Pacific (Hyderabad), Europe (Stockholm), Europe (Zurich), and Middle East (UAE). There are no additional charges for this native integration beyond standard pricing of AWS Network Firewall and AWS Transit Gateway.

To get started, visit the AWS Network Firewall service documentation.

Read More for the details.

2025 06 16

GCP – C4D now GA: up to 80% higher performance for your business critical workloads

Tibor Kiss Cloud, Google Cloud gcp

We’re excited to announce the general availability of our next-generation C4D virtual machine family. Powered by 5th Gen AMD EPYC processors (Turin) paired with Google Titanium’s latest advancements, C4D provides customers with meaningful performance improvements — up to 80% higher throughput for web serving and 30% better performance for general computing workloads compared to the previous generation. This improvement in performance enables you to maximize your cloud investment and achieve more with fewer resources.

Beyond raw performance, C4D supports key enterprise capabilities including our first AMD-based Bare Metal instances offering direct access to all the resources on the server for maximum control and performance (bare metal will be available in the coming weeks); and the next-gen Titanium Local SSD, which enhances I/O-intensive operations with up to 35% lower latency vs. the prior generation. These hardware advancements are paired with enterprise-grade security and reliability, featuring a 30-day uptime window between planned maintenance events. With this combination of peak performance, expanded capabilities and enterprise-grade controls, C4D is suited for a wide range of general-purpose computing workloads, from databases, AI inference, web, application and game servers, to mission-critical business applications.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud infrastructure’), (‘body’, <wagtail.rich_text.RichText object at 0x3e64ec35a670>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/compute’), (‘image’, None)])]>

Supercharge your workloads while optimizing costs

General computing and web serving workloads

For general-purpose computing workloads, C4D VMs deliver up to 25% better performance per dollar than C3D, based on estimated SPECrate®2017_int_base benchmark results. C4D also delivers up to 20% better performance per dollar than comparable, generally available offerings from other cloud providers on the same benchmark. This helps you meet demanding performance requirements for a wide range of workloads — including web servers, game, ad and application servers, and containerized microservices — while optimizing resource usage and reducing costs.

For web-serving workloads, C4D leverages AMD Turin’s improved L3 cache efficiency and next-generation branch prediction to deliver higher throughput/vcpu, resulting in up to 75% increase in price-performance compared to the previous-generation C3D. This drives faster page rendering and a smoother end-user experience, with up to 45% higher price-performance compared to competitive offerings.

“AppLovin, a global leader in mobile advertising, is constantly looking for cutting-edge infrastructure innovations to deliver exceptional performance for our clients. Google Cloud’s C4D VMs enable us to do just that — driving up to 40% improvement over the prior generation, which leads to significant efficiency gains and latency reduction.” – Basil Shikin, Chief Technology Officer, AppLovin

“On C4D, our ad servers perform 191% faster than N2D, and 81% faster than C3D to serve the same amount of ad-requests. This improved performance comes at a lower overall cost because we can run a smaller number of more efficient nodes, but not only that, for us more performance/less latency means not only savings but more revenue since the fill-rate for ads (successful bid/ask matching) grows exponentially.” – Pablo Loschi, Principal Systems Engineer, Verve Group

“C4D Performance was impressive — most workloads, including page rendering & video processing, were 40+% faster than the previous generation. This kind of improvement makes a real difference to users of platforms like SpareRoom.” – Dimitrios Kechagias, Principal Developer, Cloud Infrastructure Optimisation Lead, SpareRoom

Databases and data-intensive applications

The C4D family is purpose-built for data-intensive applications such as databases and data analytics, by offering the latest generation compute and advanced storage capabilities. C4D’s high core frequency of up to 4.1Ghz and improved Instructions Per Clock (IPC) accelerate transactional workloads such as MySQL by up to 55% versus the prior generation with faster, more efficient query processing. For applications that require large datasets in memory, C4D provides VM sizes scaling up to 384vcpu and 3TB of high-bandwidth DDR5 memory. To scale database I/O performance, customers benefit from the integrated Hyperdisk Extreme storage with up to 500k IOPS and the new Titanium Local SSD that reduces read latency by 35% compared to the prior generation. Together, these capabilities increase performance and responsiveness for your mission-critical databases, delivering up to 35% better price-performance for Redis and MySQL workloads than comparable generally available VMs from other hyperscalers.

“We are constantly looking for advanced computing options to improve experience for the players. With Google’s new C4D VMs, we see drastically improved performance for our observability stack which handles 50k inserts/sec concurrently. Compared to C3D, we were able to cut our resource footprint by half, while reducing CPU load by 35% and seeing a 30% improvement in indexing latency. We look forward to adopting C4D at scale.” – Grzegorz Dlugolecki, Principal Cloud & Kubernetes Engineer, Chess.com

“Across over 100 benchmarks, going from the C3D to C4D yielded 1.7x the performance! This is a heck of a generational improvement for Google Cloud or any public cloud provider for that matter. C4D performance is extremely compelling and opens up a lot of new compute possibilities in the public cloud.” – Michael Larabel, Founder and Principal Author, Phoronix (Read the full study here)

AI inference and complex computations
C4D’s processor offers full support for AVX-512 with a 512 bit datapath, a 50% increase in memory channels, and higher IPC compared to the prior generation. This provides significant improvements for compute-heavy tasks such as CPU-based inference, matrix operations, financial modeling and simulations, and analytics. For recommendation inference, C4D VMs demonstrate an up to 75% price-performance uplift compared to C3D and up to 35% better price-performance versus the comparable competitive offerings from leading hyperscalers, to accelerate the time to results and reduce TCO.

“Silk has tested C4D, and found it to deliver a dramatic increase of up to 40% in performance compared to the previous generation, C3D, enabling our customers to enjoy significant gains in efficiency and agility of their mission critical workloads, over the transactional, analytics and AI use cases.” – Adik Sokolovski, Chief R&D Officer, Silk

Security, maintenance controls and shapes

With Titanium, C4D offers improved infrastructure performance, lifecycle management, reliability, and security. Storage and network management is offloaded to the Titanium adapter, reserving the host resources for running customer workloads.

Titanium also enables our first AMD-based Bare Metal instances, which provide direct access to server resources. Bare metal instances are ideal for workloads that require low-level system access — like custom hypervisors, container platforms, or applications with specialized performance or licensing needs. Sectors such as financial services, security, and private cloud platforms will particularly benefit from C4D Bare Metal offerings.

C4D VMs support Hyperdisk, Google Cloud’s workload-optimized block storage. Designed for high performance and scalability, Hyperdisk is cost-efficient, easy to manage at scale, and enterprise-ready. C4D VMs are compatible with Hyperdisk Balanced and Extreme, supporting up to 512 TiB of capacity per instance. With up to 320K IOPS per instance, Hyperdisk Balanced offers an optimal mix of performance and cost-efficiency, for a broad range of workloads. Hyperdisk Extreme delivers ultra-low latency and supports up to 500K IOPS and 10,000 MiB/s throughput per instance — making it well-suited for demanding workloads like databases and caching layers. With real-time tuning of IOPS and bandwidth, Hyperdisk helps ensure your applications always have the storage resources they need.

To enhance security, C4D VMs support confidential computing with AMD Secure Encrypted Virtualization (AMD SEV), utilizing hardware-based memory encryption to help protect your data and applications while in use. This makes C4D an excellent choice for sensitive data, privileged information, PII, and workloads subject to data privacy regulations and compliance requirements.

Experience C4D today

C4D delivers exceptional performance, scalability, and efficiency for today’s most demanding workloads. Powered by next-gen AMD processors, Titanium infrastructure, and Hyperdisk storage, C4D delivers the performance and capabilities needed to make the most of your cloud resources. Whether you’re just getting started with Compute Engine or planning to upgrade from previous generations, C4D offers a clear path to greater efficiency and performance. C4D is now available in 12 regions and 28 zones — check regional availability on our Regions and Zones page and deploy your first instance in the Google Cloud console or with Google Kubernetes Engine.

Read More for the details.

2025 06 16

GCP – Simplify your multi-cloud strategy with Cloud Location Finder, now in preview

Tibor Kiss Cloud, Google Cloud gcp

As cloud environments expand beyond traditional architectures to include multiple clouds, managing your infrastructure effectively becomes more complex. Imagine easily accessing consistent and up-to-date location information across different cloud providers, so your multi-cloud applications are designed and optimized with performance, security, and regulatory compliance in mind.

Today, we are making this a reality with Cloud Location Finder, a new Google Cloud service which provides up-to-date location data across Google Cloud, Amazon Web Services (AWS), Microsoft Azure, and Oracle Cloud Infrastructure (OCI). Now, you can strategically deploy workloads across different cloud providers with confidence and control.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud infrastructure’), (‘body’, <wagtail.rich_text.RichText object at 0x3e64f09e92b0>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/compute’), (‘image’, None)])]>

Why use Cloud Location Finder?

Unified location data: Cloud Location Finder makes it easy for you to access the latest location information, providing a single source of truth for cloud location data for Google Cloud, AWS, Azure, and OCI.
Rich location attributes: Data includes public cloud region and zone information, and location metadata like proximity¹, territory code, and carbon footprint.
Up-to-date information: Cloud Location Finder provides 24-hour data freshness for active regions, removing outdated information promptly when locations are turned down. This eliminates inconsistencies from hard-coded lists and ensures your information is current without manual monitoring.
Programmatically accessible: Especially for organizations with diverse cloud and location requirements, Cloud Location Finder eliminates the need to manually manage and choose new locations. And because it is an API, it is easy to integrate into your applications.

How can Cloud Location Finder help you?

Whether you’re a partner or customer, a Cloud Architect or application developer designing a hybrid setup or a platform admin ensuring governance, Cloud Location Finder offers valuable insights:

Optimize deployments: Easily identify the nearest Google Cloud region or zone to an existing AWS/Azure/OCI deployment, to help you optimize your multi-cloud application for performance and latency.
Meet sustainability goals: As your business grows, choose nearby cloud locations that are also sustainable.
Help ensure compliance: Find a list of regions and zones in a specified territory to help you ensure compliant log storage or data processing across multiple clouds.
Improve reliability: Rely on a consistent source of truth for location data, which can be integrated directly into your applications.

Getting started

Cloud Location Finder is accessible via REST APIs and gcloud CLI, and available at no cost. You can easily list locations, get specific location details, find nearby locations, and filter these based on criteria such as cloud provider, location type, territory, or carbon footprint.

Ready to streamline your multi-cloud location strategy? Explore the Cloud Location Finder documentation to learn more and start building with consistent, accurate location data today!

^{1. Currently, for GCP regions only}

Read More for the details.

2025 06 16

GCP – Build a multi-agent KYC workflow in three steps using Google’s Agent Development Kit and Gemini

Tibor Kiss Cloud, Google Cloud gcp

Know Your Customer (KYC) processes are foundational to any Financial Services Institution’s (FSI) regulatory compliance practices and risk mitigation strategies. KYC is how financial institutions verify the identity of their customers and assess associated risks. But as customers expect instant approvals, FSIs face pressure to streamline their manual, time-consuming and error-prone KYC processes.

The good news: As LLMs get more capable and gain access to more tools to perform useful actions, employing a robust ‘agentic’ architecture to bolster the KYC process is just what FSIs need. The challenge? Building robust AI agents is complex. Google’s Agent Development Kit (ADK) gives you essential tooling to build multi-agent workflows. Plus, combining ADK with Search Grounding via Gemini can help give you higher fidelity and trustworthiness for tasks requiring external knowledge. Together, this can give FSIs:

Improved efficiency: Automate large portions of the KYC workflow, reducing manual effort and turnaround times.
Enhanced accuracy: Leverage AI for consistent document analysis and comprehensive external checks.
Strengthened compliance: Improve auditability through clear reporting and source attribution (via grounding).

To that end, this post illustrates how Google Cloud’s cutting-edge AI technologies – the Agent Development Kit (ADK), Vertex AI Gemini models, Search Grounding, and BigQuery – can be leveraged to build such a multi-agent KYC solution.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud AI and ML’), (‘body’, <wagtail.rich_text.RichText object at 0x3e64ec38b8e0>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/vertex-ai/’), (‘image’, None)])]>

Tech stack from Google Cloud

This multi-agent architecture we’ll show you today effectively utilizes several key Google Cloud services:

Agent Development Kit (ADK): Simplifies the creation and orchestration of agents. ADK handles agent definition, tool integration, state management, and inter-agent communication. It’s a platform and model-agnostic agentic framework which provides the scaffolding upon which complex agentic workflows can be built.
Vertex AI & Gemini models: The agents are powered by Gemini models (like gemini-2.0-flash) hosted on Vertex AI. These models provide the core reasoning, instruction-following, and language understanding capabilities. Gemini’s potential for multimodal analysis (processing images in IDs or documents) and multilingual support further enhances the KYC process for diverse customer bases.
Search Grounding: The google_search tool, used by the Resume_Crosschecker and External_Search agents, leverages Gemini’s Google Search grounding capabilities. This connects the Gemini model’s responses to real-time information from Google Search, significantly reducing hallucinations and ensuring that external checks are based on up-to-date, verifiable public data. The agents are instructed to cite sources (URIs) provided by the grounding mechanism, enhancing transparency and auditability.
BigQuery: The search_internal_database custom tool demonstrates direct integration with BigQuery. The KYC_Agent uses this tool early in the workflow to check if a customer profile already exists within the institution’s internal data warehouse, preventing duplicate entries and leveraging existing information. This showcases how agents can securely interact with internal, structured datasets.

Deep dive: How to build a KYC agent in three steps

Our example KYC solution utilizes a root agent (KYC Agent) that orchestrates several specialized sub-agents:

Document Checker: Analyzes uploaded documents (ID, proof of address, bank statements, etc.) for consistency, validity, and potential discrepancies across documents.
Resume Crosschecker: Verifies information on a customer’s resume against public sources like LinkedIn and company websites using grounded web searches.
External Search: Conducts external due diligence, searching for adverse media, Politically Exposed Person (PEP) status, and sanctions list appearances using grounded web searches.
Wealth Calculator: Assesses the client’s financial position by analyzing financial documents, calculating net worth, and verifying the source of wealth legitimacy.

The root KYC_Agent manages the overall workflow, calling these child agents sequentially and handling tasks like checking if the customer is already present in internal databases and generating unique case IDs to track KYC requests.

Diagram showing the KYC Agent’s structure with sub-agents and tools

Step 1: Define your root agent (which receives the initial request from the user) and the child agents which handle the specialised tasks involved in the KYC process.

code_block: <ListValue: [StructValue([(‘code’, ‘# kyc_agent/agent.py (Illustrative Snippet)rnrn# Child Agents Definitions (Simplified)rndocument_checker_agent = Agent(rn model=MODEL, # e.g. gemini-2.0-flash-001rn name=”Document_Checker”,rn description=’Analyses documents and finds discrepancies…’,rn instruction=instructions_dict[‘Document_Checker’],rngenerate_content_config=GenerateContentConfig(temperature=0.27),rn)rnrnresume_crosschecker = Agent(rn model=MODEL,rn name=’Resume_Crosschecker’,rn description=’Uses `google_search` tool for verifying resume…’,rn instruction=instructions_dict[‘Resume_Crosschecker’],rn tools=[google_search], # Leverages Search Groundingrn generate_content_config=GenerateContentConfig(temperature=0.27),rn)rnrnexternal_search_agent = Agent(rn model=MODEL,rn name=”External_Search”,rn description=’Uses `google_search` tool to find negative news…’,rn instruction=instructions_dict[‘External_Search’],rn tools=[google_search], # Leverages Search Groundingrn generate_content_config=GenerateContentConfig(temperature=0.27),rn)rnrnwealth_calculator_agent = Agent(rn model=MODEL,rn name=”Wealth_Calculator”,rn description=”Assesses the client’s financial position…”,rn instruction=instructions_dict[‘Wealth_Calculator’],rn generate_content_config=GenerateContentConfig(temperature=0.27),rn)rnrn# Wrap Resume_Crosschecker Agentrnresume_crosschecker_tool = AgentTool(agent=resume_crosschecker_agent)rnrn# Wrap External_Search Agentrnexternal_search_tool = AgentTool(agent=external_search_agent)rnrn# Root KYC Agent orchestrating the workflowrnroot_agent = Agent(rn model=MODEL,rn name=”KYC_Agent”,rn description=”KYC Onboarding Assistant”,rn # Add the AgentTool wrappers to the tools list, alongside the original toolsrn tools=[rn generate_case_id,rn search_internal_database,rn resume_crosschecker_tool, # AgentToolrn external_search_tool # AgentToolrn ],rn sub_agents=[rn document_checker_agent,rn wealth_calculator_agentrn ],rn generate_content_config=GenerateContentConfig(temperature=0.27),rn instruction=instructions_dict[‘KYC_Agent’], # Instructions should still guide the LLM to call the tools by namern global_instruction=’You will always give detailed responses and follow instructions’rn)’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e64ec38b8b0>)])]>

Step 2: Define the tools needed by your agents in order to perform their respective tasks

code_block: <ListValue: [StructValue([(‘code’, ‘# kyc_agent/custom_tools.py (Illustrative Snippet)rnrndef search_internal_database(input_name: str) -> Dict[str, Any]:rn “””rn Finds names in an internal BigQuery table…rn “””rn try:rn client = bigquery.Client(project=PROJECT_ID)rn query = f”””rn SELECT `Full Name`, `UID`, `Risk Level`, `Citizenship`, `Networth`rn FROM `{TABLE_NAME}` # Defined in constants.pyrn WHERE LOWER(`Full Name`) LIKE LOWER(‘%{input_name}%’)rn “””rn query_job = client.query(query)rn results = query_job.result()rn df = results.to_dataframe()rn return df.to_dict(‘records’)rn except Exception as e:rn error_message = f”An error occurred with BigQuery: {e}”rn # Handle errors, potentially fallback to alternate data sourcern # Fallback logic would go here if neededrn return {“error”: error_message}’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e64ec38ba30>)])]>

Step 3: Run your agent locally using the command “adk web”. ADK provides a built-in UI for developers to visualise and debug the agent during the development process:

Screenshot of the ADK Dev UI used for developing agents

Start building now

This multi-agent KYC architecture demonstrates the power of combining ADK, Gemini, Search Grounding, and BigQuery. It provides a blueprint for building intelligent, automated solutions for complex business processes.

Learn more: Dive deeper into the technologies used:

Build your own: Adapt this pattern to your specific KYC requirements and integrate it with your existing systems on Google Cloud using services like Cloud Run for deployment.

Contact us: Reach out to Google Cloud Sales for a deeper discussion on implementing AI-driven KYC solutions tailored to your organization.

By embracing a multi-agent approach powered by Google Cloud’s AI stack, FSIs can transform their KYC processes, achieving greater efficiency, accuracy, and compliance in an increasingly digital world.

Read More for the details.

2025 06 16

GCP – How Google Cloud is securing open-source credentials at scale

Tibor Kiss Cloud, Google Cloud gcp

Credentials are an essential part of modern software development and deployment, granting bearers privileged access to systems, applications, and data. However, credential-related vulnerabilities remain the predominant entry point exploited by threat actors in the cloud.

Stolen credentials “are now the second-highest initial infection vector, making up 16% of our investigations,” said Jurgen Kutscher, vice-president, Mandiant Consulting, in his summary of our M-Trends 2025 report.

Ensuring the safe management of these credentials is a vital task. Developers may accidentally include credentials in artifacts like source code, built software packages, or Docker images. If these credentials fall into the wrong hands, they can be used by malicious actors for data exfiltration, cryptojacking, ransomware attacks, and general resource abuse.

Safeguarding credentials is particularly acute for open-source developers because when a credential is accidentally included in an artifact that is pushed to a public repository (like GitHub, PyPI or DockerHub), that credential becomes available to anyone on the Internet.

To address this critical issue, we’ve developed a powerful tool to scan open-source package and image files by default for leaked Google Cloud credentials to help protect Google Cloud customers who publish open-source artifacts. Created by Google’s deps.dev team in collaboration with Google Cloud’s credential protection team, we’ve seen significant results in identifying and reporting exposed credentials like API keys, service account keys, and OAuth client secrets in historical artifacts.

While this effort has initially focused on Google Cloud credentials, we plan to expand scanning to include third-party credentials later this year.

Beyond retrospective reporting, the tool also scans newly published open-source artifacts for leaked credentials. This pivotal advance can help drive remediation for immediate security breach threats, significantly reducing the risk of developer compromise.

The tool can also cultivate a culture of improved security by effectively shifting security to earlier in the development lifecycle when problems are easier to solve. By shifting left and encouraging earlier security awareness, the tool can help foster improved credential management practices in the open-source community, ultimately strengthening the resilience and security of the entire software supply chain.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud security products’), (‘body’, <wagtail.rich_text.RichText object at 0x3e64d41f80d0>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/welcome’), (‘image’, None)])]>

Understanding the dangers of exposed cloud credentials

Exposed credentials present a serious security risk to cloud users because they allow an individual to gain access to a user’s cloud environment, including their resources, applications and managed user data. A malicious actor can exploit this access for nefarious purposes such as data theft, cryptojacking, ransomware attacks, and general resource abuse which can result in severe financial, reputational, and operational damage.

Once a credential is obtained by malicious actors it should be considered permanently compromised because compromised credentials are easily copied and shared.

Open source developers, while contributing to the collaborative ecosystem, face the risk of inadvertently exposing sensitive credentials. While source code repository hosts like GitHub and GitLab already scan public source code (and, in some cases, package repositories) for exposed credentials, the challenge extends significantly beyond source code.

Built packages and Docker images often include configuration, compiled binaries, and build scripts, all potential sources of leaked credentials. Publishing these artifacts on open-source repositories like Maven Central, PyPI, or DockerHub can expose leaked credentials to exploitation by any individual on the internet. The ease and speed with which open-source artifacts are shared and distributed magnifies the potential damage, making strong credential management and proactive leak detection and remediation critical.

How to scan open source code for credentials

The deps.dev team provides services to help developers better understand the structure, construction, and security of open-source software. The team maintains and analyzes a continuously updated corpus of over 5 billion unique files, across hundreds of millions of open-source software artifacts like source code repositories, software packages and Docker containers.

The pipeline to support this corpus automatically ingests hundreds of millions of public artifacts from a variety of open source repositories. These include package managers (such as npm, Maven Central, PyPI,) source code repository hosts (such as GitHub and GitLab) and Docker images.

Once artifacts are ingested, they undergo a comprehensive decomposition process, which extracts all constituent parts: every file at every commit in a Git repository, every unarchived or unzipped file in a software package, and every file in every individual layer of a Docker image — not just the files in the final image filesystem. These files are then analyzed which includes scanning them for exposed Google Cloud credentials.

When a suspected Google Cloud credential is detected, the credential reporting backend immediately alerts the credential protection program. Since its creation, we’ve observed this system detect and remediate leaked credentials in minutes of their publication, matching or exceeding the speed with which malicious actors have been demonstrated to exploit them.

Credential containment and recovery

We’ve set up a web endpoint so vetted Google Cloud users and security researchers can submit suspected exposed credentials for review.Once a submitter’s identity is validated, the Google Cloud credential protection system proceeds to confirm the validity of the reported credentials. If the credential is confirmed to be active, Google Cloud provides immediate customer notification through multiple channels, including email, telemetry logs, and in-product alerts.

Google Cloud may take automated remediation steps to mitigate potential damage in accordance with customer configurable policy, such as disabling affected service account keys.

What’s next?

We are actively working to further secure open source communities and protect Google Cloud customers alike by taking a proactive approach to credential exposure. Our efforts in this area include several key initiatives:

Broadening the scope of credential scanning: We’re expanding the range of credential types the tool can scan for, which can help protect more organizations and developers.
Increasing open-source coverage: We’re scanning more open-source platforms and repositories to discover exposed credentials, which can help mitigate risks across more of the ecosystem.
Empowering open-source communities with preventative measures: We’re developing and offering tools that allow open-source communities to integrate credential exposure checks directly into their publish workflow, which can help prevent credential leaks before they happen.

By focusing on both detection and prevention, we aim to foster a more secure and resilient open source environment. To report exposed Google Cloud credentials, please contact gcp-credentials-reports@google.com. If you are a credential provider and would like to talk about partnering with us to scan for your credentials, please contact depsdev@google.com.

Read More for the details.

Cloud

Spanner’s core innovation: TrueTime and external consistency

Addressing the consistency-scale dilemma

Spanner as a cloud service

Empowering customers and industries

The future with Spanner

Experience the Spanner difference

Learn more about the graduating startups and their inspiring work:

Build with confidence using production-ready Gemini 2.5

Enhanced customization and efficiency for your needs

MCP Transports

Benefits of running an MCP server remotely

Prerequisites

Installation

Math MCP Server

Transport

Deploying to Cloud Run

Option 1 – Deploy from source

Option 2 – Deploy from a container image

Authenticating MCP Clients

Testing the remote MCP server

Continue Reading

Supercharge your workloads while optimizing costs

Security, maintenance controls and shapes

Experience C4D today

Driving enterprise transformation with new compute innovations and offerings

Why use Cloud Location Finder?

How can Cloud Location Finder help you?

Getting started

Tech stack from Google Cloud

Deep dive: How to build a KYC agent in three steps

Start building now

Understanding the dangers of exposed cloud credentials

How to scan open source code for credentials

Credential containment and recovery

What’s next?