Tibor Kiss

About Tibor Kiss

Posts by Tibor Kiss:

2025 10 29

GCP – Announcing docs.cloud.google.com: The new home for Google Cloud documentation

We’ve reached a significant milestone in improving our documentation experience with the launch of a new, dedicated home for all Google Cloud technical documentation at https://docs.cloud.google.com.

This initiative is the latest step in our AI-first transformation of how we deliver the technical documentation you rely on every day.

Building a new foundation

AI is transforming the way we work, and it’s changed how we read and use documentation. We know that excellent AI experiences are powered by excellent content.

By moving all our technical documentation to one, dedicated platform, we’ve created a unified foundation that makes it much easier to build the next generation of AI-driven experiences—from smarter, context-aware assistance to more deeply integrated help—right where you need it.

We’re also using this opportunity to improve how we create content. We’re not just building AI features for our documentation; we’re using AI to build the documentation itself.

“To accelerate the creation process, we have integrated Gemini directly into our writers’ authoring environments. This acts as a productivity multiplier…” [Source].

A faster, more global experience today

You’ll notice a faster, more responsive experience where you can quickly find the content you need. We achieved this by streamlining code and reducing backend complexity, all to make the site work better for you.

Our translation process is now AI-powered. This shift has allowed us to translate a significantly greater volume of content, faster than ever, making documentation for most of our products available in 12 languages.

What this means for you

We know that changes to a site you use every day can raise questions. Here’s a quick guide to what’s happening.

Where should I go for documentation?

You can start using https://docs.cloud.google.com today. This is the new, primary home for all Google Cloud technical documentation.

What will happen to cloud.google.com?

Content like the Google Cloud Blog, product discovery pages, and pricing information will remain on cloud.google.com indefinitely.

All of the documentation that was on cloud.google.com is now available on docs.cloud.google.com. Over the next few weeks, we will begin redirecting visitors from cloud.google.com documentation pages to our new domain.

What about my bookmarks and URLs?

In most cases, the only thing that’s changing is the domain. If you have workloads that rely upon content at specific URLs, you can begin updating them–but the old links will continue to work for the foreseeable future.

To ensure a smooth transition and positive experience, we maintained the existing URL patterns. The new site’s structure generally mirrors the old one, preserving the links you rely on.

For example:

Old URL: https://cloud.google.com/compute/docs/instances
New URL: https://docs.cloud.google.com/compute/docs/instances

What’s next?

This new platform unlocks the potential for many more advancements. We’re committed to building on this foundation to deliver a truly world-class developer experience in the months ahead.

If you want to hear about the latest changes coming to our documentation, check out our What’s New in Cloud docs page.

This post includes additional contributions from Brett Johnson, Head of Google Cloud Developer Knowledge Engineering.

Read More for the details.

2025 10 29

AWS – AWS Control Tower is now available in AWS Asia Pacific (New Zealand) Region

Tibor Kiss AWS, Cloud AWS

Starting today, customers can use AWS Control Tower in the AWS Asia Pacific (New Zealand) Region. With this launch, AWS Control Tower is available in 34 AWS Regions and the AWS GovCloud (US) Regions. AWS Control Tower offers the easiest way to set up and govern a secure, multi-account AWS environment. It simplifies AWS experiences by orchestrating multiple AWS services on your behalf while maintaining the security and compliance needs of your organization. You can set up a multi-account AWS environment within 30 minutes or less, govern new or existing account configurations, gain visibility into compliance status, and enforce controls at scale.

If you are new to AWS Control Tower, you can launch it today in any of the supported regions and you can use AWS Control Tower to govern your multi-account environment in all supported Regions. If you are already using AWS Control Tower and you want to extend its governance features to the newly supported regions in your accounts, you can go to the settings page in your AWS Control Tower dashboard, select your regions, and update your landing zone. Once you update all your governed accounts, your landing zone, managed accounts, and registered OUs will be under governance in the new region(s).

For a full list of Regions where AWS Control Tower is available, see the AWS Region Table. To learn more, visit the AWS Control Tower homepage or see the AWS Control Tower User Guide.

Read More for the details.

2025 10 28

GCP – The Blueprint: How Giles AI transforms medical research with conversational AI

Tibor Kiss Cloud, Google Cloud gcp

Welcome to The Blueprint, a new feature where we highlight how Google Cloud customers are tackling unique and common challenges across industries using the latest AI and cloud technologies. We hope to inspire others looking to innovate in their work.

The challenge:

Giles AI is a London-based startup that helps healthcare and life sciences organizations quickly extract insights from fragmented data, whether that data is available in an online repository (e.g. PubMed, NICE, the FDA etc.), local documents, or internal IP. Users can connect to internal and external data repositories and upload documents and images to the Giles AI platform; this integration allows users to the combined knowledge base for insights more quickly and efficiently, using natural language prompts and an intuitive interface.

As Giles AI grew in popularity, our incumbent cloud provider struggled to cope with complex data flows, new LLMs, and external APIs. Latency increased, slowing the user interface and impacting critical activities. Engineers also required a more agile development environment. Security is also a foundational feature of Giles AI and everything we build — with clinical, medical, and healthcare standards in mind, sensitive data must be protected at every step, both at rest and in transit.

The solution:

Giles AI leveraged Google Cloud’s modular, API-friendly, microservices-based architecture to minimize latency, easily manage complex clinical data flows in real-time, and capitalize on the latest and greatest AI foundation models as they are released.

Backend service orchestration in Google Kubernetes Engine and lightweight microservices in Cloud Run are complemented by specialized workloads in Compute Engine to keep the Giles AI platform available, flexible, and scalable without the heavy management and maintenance demands of legacy infrastructure. Cloud Load Balancing ensures efficiency.

Cloud SQL, Cloud Storage, and Document AI help the Giles AI platform manage structured and unstructured data and extract insights. Under the hood, Vertex AI handles model selection and prompt orchestration. The system is model-agnostic by design, enabling Giles AI to route queries to the most appropriate language model including hundreds available through Model Garden on Vertex AI.

With this highly flexible approach, Giles AI is able to deliver numerous healthcare and life sciences use cases from systematic literature reviews and regulatory reviews to meta-analyses, data extraction, and patient eligibility screening — all with high levels of accuracy and agreement.

To enhance security, we’re leveraging Cloud Armor to defend against Web-based attacks and Security Command Center to keep a close eye on its posture. Google Cloud regional databases help Giles AI localize data at rest — a critical need given healthcare regulations.

The architecture:

The conclusion:

What we love about Vertex AI is that it supports our AI workflow experimentation. In simple terms, this means we can plug any LLM of our choice in and out of our workflow, drawing from the hundreds of models available in the Model Garden on Vertex AI. This provides amazing flexibility and efficiency, which is key to our success.

So far, the results of our migration have been impressive.

One of Giles AI’s early customers achieved an 85% reduction in the time required for clinical research tasks and over 94% response accuracy, with references provided when they wanted to be certain and verify. This customer was so compelled with the results that they went on to make a significant investment into the company and became a strategic partner.

Latency, uptime, and scalability have all improved significantly, even with complex, multi-layered data queries. From an internal perspective, Giles AI has seen an increase in developer velocity, with infrastructure-as-code and managed services reducing engineering overheads.

The Giles AI generative AI assistant interface

Looking to the future, our team at Giles AI is excited for the potential of Google Cloud’s AI foundation models designed for the medical community. These include MedGemma, a family of open-source AI models tailored for medical applications, and TxGemma, a suite of open therapeutic-language models derived from Gemma 2 that help streamline drug discovery and development.

With these powerful tools on the horizon, Giles AI is poised to deliver smarter, more verticalized decision-making across the entire healthcare R&D pipeline. For clients, this means turning complex data into real-world breakthroughs, faster than ever before.

Read More for the details.

2025 10 28

GCP – Unlock the AI performance you need: Introducing managed DRANET for A4X Max on GKE

Tibor Kiss Cloud, Google Cloud gcp

As AI/ML models grow, their infrastructure demands are pushing traditional networking to its limit, creating critical performance bottlenecks. This is especially true for models running on Kubernetes and Google Kubernetes Engine (GKE).

At Google, we’ve been working in the open-source community to make Kubernetes aware of specialized hardware capabilities. For example, we’ve been active in developing the Kubernetes Dynamic Resource Allocation (DRA) framework, a generic API for specialized hardware. Building on DRA, we proposed the Dynamic Resource Allocation for Networking, or DRANET, which extends the DRA API to manage network interfaces as first-class, schedulable resources, with a focus on performance.

Today, we are proud to announce a preview managed DRANET in Google Kubernetes Engine (GKE), launching first with our brand-new A4X Max instances. With this release, Google Cloud is deploying managed DRANET into production, starting with the A4X Max. Managed DRANET offers an enterprise-grade, integrated solution to intelligently allocate high-performance network interfaces alongside accelerators on Kubernetes, addressing the core challenges of network performance and operational complexity for demanding AI workloads.

Hidden performance bottlenecks in AI networking

DRANET on GKE is specifically designed for AI workloads that run across multiple GPUs. Modern accelerator instances like the new A4X Max use multiple high-throughput RDMA network interfaces to feed those powerful GPUs. However, the traditional Kubernetes Networking interface has limitations that make it hard to take full advantage of these networking capabilities:

Topology blindness: Peak performance requires network interface alignment. To reduce latency, the GPU and its network interface must be physically “close,” ideally on the same non-uniform memory access (NUMA) node. The default Kubernetes scheduler is unaware of this hardware topology, which can lead to sub-optimal pairings and severely degraded performance.
Poor operational performance: The inability to co-schedule NICs and GPUs also leads to sub-optimal resource utilization. This impacts overall cluster performance and efficiency, as schedulers cannot effectively match available accelerators with the specific network interfaces they require.

How GKE with DRANET unlocks performance

When powered by our managed DRANET integration, GKE’s control plane delivers higher performance through:

Intelligent alignment for higher throughput: This is the core performance win. GKE can now allocate network interfaces that are NUMA-aligned with the assigned GPUs, resulting in lower latency and higher throughput. NUMA alignment can be critical: as detailed in our DRANET research paper, we saw bus bandwidth increased by up to 59.6% during a set of internal tests.
A dynamic resource specification: DRANET allows you to dynamically express your workload’s networking needs directly in your pod specification. You can ask for a specific number of high-performance network interfaces right alongside your GPU request. GKE then ensures your pod is only scheduled to a node that has both the required GPU and the specific network interfaces available.

These are sophisticated, complex processes, but with managed DRANET on GKE, the complexity is abstracted away. You get the performance of a topology-aware cluster with the flexibility and simplicity of a mature, enterprise-grade container orchestration platform.

DRANET and the new A4X Max: a perfect match

Managed DRANET for GKE arrives just in time for the Google Cloud A4X Max instance, our new flagship AI platform based on the NVIDIA GB300 NVL72 rack-scale system. These instances are built for extreme-scale AI and feature multiple RDMA interfaces.

Managed DRANET on GKE unlocks the full performance of this hardware, ensuring every GPU has the dedicated, aligned, low-latency network path it needs. For a deeper dive into the A4X Max instance itself, please read our full launch blog [add-link-here].

The future of AI networking on GKE

The launch of managed DRANET on GKE is a milestone, shifting Kubernetes from topology-agnostic to topology-aware resource management. That’s the power of Google Cloud: innovating and leading a powerful open-source concept, and delivering it as a simple, scalable, and managed solution.

To learn more about DRANET and get started:

Read the A4X Max launch blog
Get started with DRANET on GKE
Explore the open source project
Learn more in the DRANET open source blog
Go under the covers in the DRANET research paper

Read More for the details.

2025 10 28

GCP – Expanding our NVIDIA partnership: Now shipping A4X Max, Vertex AI Training, and more

Tibor Kiss Cloud, Google Cloud gcp

Today’s AI models are moving from billions to trillions of parameters, and are capable of complex, multi-modal reasoning. This leap in sophistication demands a new class of purpose-built infrastructure and software to handle the immense computational and memory requirements of these next-generation models.

At Google Cloud, we’re committed to empowering developers and organizations to build and deploy what’s next in AI. Today, we are excited to deepen our partnership with NVIDIA with a suite of new capabilities that strengthens our platform for the entire AI lifecycle:

New A4X Max instances powered by NVIDIA’s GB300 NVL72, purpose-built for multimodal AI reasoning tasks
Google Kubernetes Engine (GKE), now supporting Dynamic Resource Allocation Kubernetes Network Driver (DRANET), boosting bandwidth in distributed AI/ML workloads
GKE Inference Gateway, now integrating with NVIDIA NeMo Guardrails
Vertex AI Model Garden to feature NVIDIA Nemotron models
Vertex AI Training recipes on top of the NVIDIA NeMo Framework and NeMo-RL

Let’s take a closer look at these developments.

A4X Max with NVIDIA GB300 GPUs

A4X Max is now shipping in production. These new instances, powered by NVIDIA GB300 NVL72, are optimized for the most demanding, multimodal AI reasoning workloads. A4X Max includes 72 Blackwell Ultra GPUs and 36 NVIDIA Grace CPUs connected with NVIDIA’s fifth-generation high-speed GPU interconnect NVIDIA NVLink to function as a single, unified compute platform with shared memory and high-bandwidth communication. Together with Google’s Titanium ML adapter and Google Cloud’s Jupiter network fabric, A4X Max is purpose-built to scale to tens of thousands of GPUs in non-blocking, rail-optimized clusters. Compared to A4X powered by NVIDIA GB200 NVL72, A4X Max delivers 2x the network bandwidth on each system.

A4X Max leverages Google Cloud’s Cluster Director, letting you combine optimized compute, networking, and Google’s storage offerings into a cohesive, performant, and easily managed environment. Cluster Director manages the complete lifecycle of A4X Max clusters — from provisioning and topology-aware placement across the NVL72 domains, to providing powerful observability and resiliency capabilities. It integrates with optimized storage solutions like Managed Lustre, while a managed pre-configured Slurm environment offers fault-tolerant scalable job scheduling for A4X Max. Cluster Director also provides deep observability into job and system performance across the GPUs, NVLink and DC networking fabrics. To maximize throughput, Cluster Director helps ensure high reliability with features like automatic straggler detection and in-job recovery. Cluster Director capabilities like topology aware scheduling, maintenance management, and faulty node reporting are also available transparently through Google Kubernetes Engine (GKE), enabling customers to stay in the GKE environment while running A4X Max.

What all this this means for your workloads:

Optimized reasoning and inference: With its 72-GPU NVLink domain, delivering 1.5x FP4 FLOPs, 1.5x HBM memory capacity, and 2x the network bandwidth compared to A4X, A4X Max is specifically designed for low-latency inference, especially for the largest reasoning models. When integrated with GKE Inference Gateway, you benefit from prefix-aware load balancing, improving Time to First Token latency for prefix-heavy workloads. Disaggregated serving can also be enabled to further optimize performance. This is achieved by leveraging Inference Gateway, llm-d, and vLLM together, resulting in significant throughput improvements.
Enhanced training and serving performance: With more than 1.4 exaflops per GB300 NVL72 system, A4X Max offers a 4x increase in LLM training and serving performance compared to A3 VMs powered by NVIDIA H100 GPUs.
Maximum scalability and parallelization: Based on RDMA over Converged Ethernet (RoCE), A4X Max’s networking fabric delivers low-latency high-performance GPU-to-GPU collectives for distributed training and disaggregated serving workloads. By leveraging a new data-center-scaling design, A4X Max clusters can be 2x larger compared to A4X clusters.

The preview of A4X Max instances comes on the heels of our new G4 VMs powered by NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs, and support for NVIDIA Omniverse libraries. Taken together, these offerings underscore our commitment to delivering an end-to-end platform for every AI workload, while our deepening partnership with NVIDIA provides you with a powerful, comprehensive ecosystem to build what’s next in AI.

Increased RDMA performance with GKE DRANET

Today, we’re deploying managed DRANET into production, starting with A4X Max. By enabling topology-aware scheduling of GPUs and RDMA network interface cards, DRANET boosts bus bandwidth for all-gather and all-reduce operations in distributed AI/ML workloads. This translates to improved cost efficiency due to better VM utilization. It does this by scheduling GKE Pods on nodes where the RDMA device and the GPU have the best possible connectivity. DRANET also simplifies RDMA management by making RDMA devices first-class, native resources within GKE. Learn more about DRANET for GKE here.

GKE and NVIDIA NeMo Guardrails

As organizations deploy their AI models into production, they must ensure their safety, security, and responsible behavior. Today, we are announcing the integration of NVIDIA NeMo Guardrails with GKE Inference Gateway, an extension to GKE Gateway for serving generative AI applications.

GKE Inference Gateway optimizes model serving with features like model-aware routing and autoscaling, while NeMo Guardrails add a critical layer of safety, preventing models from engaging in undesirable topics or responding to malicious prompts. Together, they offer a secure, scalable, and manageable inference solution to speed up your AI initiatives.

Vertex AI Model Garden to feature NVIDIA Nemotron models

To give developers greater choice and performance, Vertex AI Model Garden will soon have support for NVIDIA’s Nemotron family of open models as NVIDIA NIM microservices. This integration — starting with the upcoming availability of the NVIDIA Llama Nemotron Super v1.5 model — will give developers and organizations access to the NVIDIA’s latest open-weight models directly within Vertex AI. With a Vertex AI managed deployment, you can rapidly develop and deploy custom AI agents powered by Nemotron models, all while maintaining control over performance, cost, and compliance.

Models deployed through Vertex AI offer the following benefits :

Granular control over your deployments, with the ability to optimize for performance or cost by selecting from a wide range of machine types and Google Cloud regions.
Robust security by deploying models entirely within your own VPC and adhering to your VPC-SC policies.
Incredible ease of use — you can discover, license, and deploy these cutting-edge models in just a few clicks.

Vertex AI Training with NVIDIA NeMo Integration

Vertex AI Training provides the essential control and flexibility enterprises need to adapt foundation models to their proprietary data. To accelerate the creation of highly accurate, proprietary models, we are announcing expanded capabilities in Vertex AI Training that simplify and accelerate the path to developing large-scale models.

Customers benefit from a fully managed and resilient Slurm environment that simplifies large-scale training. Automated resiliency features improve cluster uptime. Our comprehensive data-science tooling removes much of the guesswork from complex model development. Finally, curated and optimized pre-training and post-training recipes built on top of standardized frameworks like NVIDIA NeMo and NeMo-RL empower builders to move from a novel idea to a production-ready, domain-specialized model with greater speed and efficiency.

Take the next steps

These updates enhance the capabilities and flexibility of our Google Cloud platform for running AI workloads. You can choose between the flexibility and control of infrastructure as a service (IaaS) with Google Compute Engine or GKE with Cluster Director; or the fully managed, end-to-end experience of Vertex AI, which provides a secure, scalable, and simplified workflow to train, tune, and manage models.

Together, these infrastructure innovations represent a significant step forward in our mission to provide a complete platform for AI development and deployment. The combination of Google Cloud’s infrastructure and NVIDIA’s latest technology provides a solid foundation for building the next generation of AI applications.

To get started with the A4X Max preview, please contact your Google Cloud sales representative. Vertex AI Training, meanwhile, has everything you need to transform your models into proprietary assets that define your business advantage. To deploy and manage AI models at scale with enterprise-grade security and efficiency, learn how GKE Inference Gateway can help you serve inference workloads. We are excited to see what you will build.

Read More for the details.

2025 10 28

GCP – Enabling a safe agentic web with reCAPTCHA

Tibor Kiss Cloud, Google Cloud gcp

The emergence of the agentic web — an internet where autonomous AI agents can independently execute complex, multi-step tasks and transactions that previously required human interaction — promises a fundamental shift for how customers and businesses interact. While agents can help deliver a frictionless customer experience, they can also enable new abuse and fraud vectors.

In the agentic web, automation raises key questions for enterprise fraud and risks teams to address, including:

How do you identify an AI agent and the user behind it to ensure the agent hasn’t been taken over?
How do you determine the legitimate intent of the agent’s tasks and that the agent has not gone rogue?
How do you mitigate risks when malicious AI has the ability to solve the very challenges designed to stop them?
How do you enable safe agentic commerce when new communication surfaces such as agent-to-agent and agent-to-services are prevalent?

At Google Cloud, we believe preventing fraud and abuse in the agentic web should fundamentally result in a simpler customer experience. To deliver this safe agentic web, we must evolve from pure prevention to active enablement. As such, we are building a proactive, trust-based model founded on a framework for agentic trust.

A framework for agentic trust

The stakes for every enterprise are high. Consider a high-demand product launch: 10,000 individual customers task their personal AI agents to each buy one item the moment it drops. This is a high-value, desirable use case. Now consider one malicious scalper deploying 10,000 agents to buy the entire inventory for resale. To a traditional security system, both scenarios look like an identical “attack.”

If your system can’t tell the difference, you either block your best customers or fail your entire launch. It’s no longer just about detecting automation; it’s about differentiating intent, and challenging that intent when risks are detected.

In this agentic web, the most fundamental question is, how do you protect your businesses from fraud and abuse and at the same time deliver an autonomous and frictionless agentic ecommerce experience?

Agent and user identity (knowing who it is)
Like human users, AI agents should have their own trusted identities and be accountable for all the activities they perform. In the agentic world, there are agents that act on behalf of the user, and they can leverage the user’s existing session and context. There are also agents that operate remotely as a cloud service, such as Gemini, while performing tasks for the users.

It’s critical that businesses have visibility and control on both the agent and user identities, as well as their relationships, to prevent attacks such as agent takeovers. At Google, we are actively identifying and labeling agentic activities, integrating with different agent identity protocols (including SPIFFE and Web Bot Auth), and building flexible controls to challenge and block agents based on identities and behaviors.

By using Google’s fraud intelligence protecting billions of accounts, over 7 million sites, and 50% of the Fortune 100, Google Cloud can deliver unparalleled visibility into agent and user identities to prevent takeovers.

Agent behavior (analyzing what it’s doing)
To effectively stop fraud in the agentic world, you need to look beyond identity and continuously analyze an agent’s actions and intent in real time. That’s because a trusted identity isn’t enough to stop potential attacks from compromised or rogue agents performing fraudulent activities.

We are building dedicated risk models that segment traffic into “agentic” and “non-agentic,” combine an agent’s identity with its live actions, and allow our systems to perform comprehensive risk and trust analysis on both good and bad activities. The adaptive analysis can allow us to understand intent by scrutinizing the sequence of actions of users, weigh the reputation of a signed-in account, and analyze behavior over time.

By using Google’s unique and global insights, we are protecting customers at the web layer, and also new communication surfaces such as agent-to-agent and Model Context Protocol (MCP) layers.

Mitigation (responding effectively)
When high risk is detected, the response must be effective against an AI — not just simple bots. That’s why we are investing in a new class of AI-resistant challenges, which are explicitly designed to be economically unviable for AI to solve at scale, since software can’t affordably fake a unique human interacting with a unique piece of hardware.

A prime example is our new mobile-device based challenge, which requires a user to scan a QR code with their physical mobile device in order to provide a high-assurance attestation that a unique human is present. This new approach provides stronger, AI-resistant security, and also effectively breaks the business model for large-scale attackers. It can simplify the challenge process, too, providing a better experience for end-users.

Enablement (accelerating business)
In the new agentic web, trusted agents will act on behalf of shoppers, finding the best value, and making purchases. Google recently announced the Agent Payments Protocol (AP2), an open protocol developed with leading payments and technology companies to securely enable this use case. However, additional security guardrails must be put in place to mitigate the risk of attacks with the new agentic commerce protocols.

Today, customers can already use reCAPTCHA’s transaction risk API to detect and prevent scaled carding attacks and increased chargebacks. In addition, we are actively working to deepen the integration of reCAPTCHA transaction fraud detection models directly into Google Cloud’s AI services to ensure a seamless customer integration and end user commerce experience.

An invitation to build the future

Ultimately, this framework provides the visibility and control needed to shift from prevention to enablement. We’re enabling a safe agentic web by empowering you to create nuanced security strategies that blocks threats and confidently accelerates trusted interactions. This agility means you can always deploy the right response for the right situation, fostering an environment for your legitimate users and their agents.

For organizations building on Google Cloud, we are natively integrating reCAPTCHA‘s agent-aware security with Google Cloud’s powerful AI services including Vertex AI, Agent Engine, and Gemini Enterprise, and providing a platform for you to build innovative agentic services and deploy them with the confidence that they are secure from the ground up.

The agentic web will redefine digital interaction. Fraud and risk business and security leaders can use this pivotal moment to help their organizations stay on top of the agentic future. By evolving into enablers of this new agentic web, they can help drive the next phase of business growth and build a foundation of digital trust.

To learn how reCAPTCHA‘s solutions can protect your business from fraud today and help you build safe, frictionless agentic experiences for tomorrow, we invite you to have a conversation with our team to explore this framework and prepare for the next generation of digital business.

Read More for the details.

2025 10 28

GCP – From Oracle transactions to AI actions: Activate your data with intelligent automation

Tibor Kiss Cloud, Google Cloud gcp

Many enterprises have built their foundational business operations on robust transactional systems powered by Oracle Database. And with Oracle on Google Cloud, they can deploy and manage Oracle Database instances directly within Google Cloud’s highly scalable and secure infrastructure, benefiting from low-latency network connectivity and integrated services.

But in today’s digitized world, data utilization is crucial for competitive advantage. Oracle excels at Online Transaction Processing (OLTP). However, to fully leverage its analytical capabilities and integrate it with cutting-edge AI, seamless integration with a scalable, cloud-native data platform like Google Cloud’s BigQuery is often essential.

In this blog post, we discuss best practices for moving Oracle data into BigQuery. Then, once it’s there, we discuss some of the things that you can now do with that data, and how. Finally, we present some short examples. Let’s get started.

Bridging the gap: Connecting Oracle data to BigQuery

The journey to advanced analytics begins by bringing your Oracle data into an environment designed for scale and analytical workloads. Luckily, Google Cloud offers several powerful services that can help to facilitate this.

Your first thought might be to use ODBC/JDBC drivers — and in many respects, you’re not wrong. However, it’s crucial to understand their primary role in this context.

The Google Cloud documentation on ODBC/JDBC drivers for BigQuery describes how client applications and reporting tools (which might be running on Oracle, or simply need to connect to BigQuery for data access) can use these drivers to query BigQuery. These drivers establish a direct connection between an application and BigQuery, acting as an intermediary to translate SQL queries and retrieve results. For example, an application could use a JDBC driver to connect to BigQuery over TCP/IP, send a SQL query string, and receive a result set back in a structured format. In other words, these drivers are primarily designed for interactive querying and reporting, rather than large-scale, continuous data movement from transactional systems.

To truly integrate Oracle operational data into BigQuery for analytical purposes, the most efficient and recommended approach involves continuous data replication. Google Cloud’s Datastream for BigQuery stands out as a key solution. Datastream enables low-latency Change Data Capture (CDC), capturing transactional changes from your Oracle source database at the redo-log level, streaming real-time inserts, updates, and deletes from your Oracle databases directly into BigQuery. Datastream handles schema evolution and data-type conversions, helping to ensure data integrity and consistency between Oracle and BigQuery. This means your analytical datasets are always fresh and ready for immediate insights. Then, for less frequent updates or large historical loads, you can also stage data in Google Cloud Storage and then load it into BigQuery, using BigQuery Data Transfer Service or bq load commands or queried directly via BigQuery external tables; this allows BigQuery to read data directly from Cloud Storage without explicit loading.

Unlocking BigQuery Analytics and AI

Once your Oracle data is in BigQuery, a world of possibilities opens up. BigQuery provides a fully managed, serverless, and highly scalable data platform that can easily handle petabytes of data. Its columnar storage format and massively parallel processing (MPP) architecture optimize analytical query performance. You can run complex SQL queries on your consolidated datasets, combining Oracle’s transactional history with other data sources to gain a comprehensive, unified view of your business.

But the real fun begins when you integrate Gemini capabilities into BigQuery, namely:

Natural language data exploration: Gemini lets you interact with your data using natural language, regardless of your technical skill level. Features like data canvas and data insights let you ask questions in plain English, generate visualizations, and discover patterns — all without writing a single line of code. This is powered by large language models (LLMs) that understand natural language queries and translate them into SQL.
AI-assisted SQL and Python: For data professionals, Gemini powers intelligent assistance for writing, explaining, and debugging SQL and Python code within BigQuery, dramatically increasing productivity and reducing development time. Gemini’s code generation and debugging capabilities are trained on vast code repositories, and can provide context-aware suggestions and error explanations.
Advanced analytics with BigQuery ML: If you’re a data analyst, get ready to integrate Gemini models directly into your BigQuery ML workflows. Perform tasks like sentiment analysis, entity extraction, text generation, or leverage advanced forecasting models (like TimesFM) on your integrated data, all within the familiar BigQuery environment. BigQuery ML lets you create and execute machine learning models using standard SQL queries, democratizing ML for data analysts. Gemini models can be invoked as user-defined functions (UDFs) within BigQuery ML, facilitating complex AI tasks directly on your data.
Multimodal Capabilities: BigQuery moves beyond traditional text and numbers by integrating multimodal capabilities, powered by models like Gemini 1.5 Pro. This allows you to analyze diverse unstructured assets—such as images, audio, and video—directly alongside your structured data. The result is richer, context-aware analysis across your complete enterprise dataset.
Agents: AI agents enable complex, multi-step data operations that go beyond simple querying. Leveraging frameworks like the Agent Developer Kit (ADK), these specialized agents autonomously plan, reason, and orchestrate steps like querying BigQuery tables, running BigQuery ML models, and generating comprehensive, natural language reports based on a single high-level goal. Google’s Gemini for Enterprise (previously Agentspace) acts as the central AI agent hub, bridging your structured BigQuery data with various work applications and data sources. Agents deployed here leverage Gemini’s reasoning to perform end-to-end workflows. For example, an agent could synthesize sales trends from BigQuery (including modernized Oracle data), interact with a CRM to identify at-risk customers, and automatically generate and send personalized outreach emails, turning data insights into automated action.

Real-world impact: Industry use cases

With BigQuery and Google’s Gemini for Enterprise, the possibilities are endless. Here are some early examples of how, together, they can transform raw data into actionable intelligence:

Retail: Retailers can stream sales transactions, customer purchase histories, and inventory data from Oracle ERP and POS systems into BigQuery. With Gemini AI and Google’s Gemini for Enterprise, they can then build sophisticated customer recommendation engines, develop highly accurate demand forecasts, optimize inventory levels, and create personalized marketing campaigns with a 360-degree view of each customer. For instance, you could use BigQuery ML to build collaborative filtering models for recommendations, and time-series models like ARIMA or Prophet for demand forecasting. AI agents could automate the generation of personalized product catalogs or even initiate reorder processes based on demand forecasts derived from this data.
Education Tech and universities: Institutions can integrate student enrollment data, course histories, financial aid information, and administrative records from Oracle systems into BigQuery. Leveraging Google’s Gemini for Enterprise, they can predict student success rates, identify students at risk of dropping out, tailor course recommendations to individual student interests and career aspirations, and optimize resource allocation for academic programs. Predictive models in BigQuery ML can identify students at risk using historical academic performance and engagement data. AI agents could assist faculty with curriculum development, learning plans based on student interest trends, content preference or help administrators streamline student support processes. Pearson is partnering with Google Cloud to create Next-Generation AI Tools across various data sources to develop agentic AI-powered study tools that enable personalized learning that adapts to each student’s unique pace and progress, keeping learners engaged, supported, and on track for academic success.

In conclusion, integrating Oracle Database with Google Cloud services like BigQuery, Datastream, Google’s Gemini for Enterprise is more than just a technical migration; it’s a strategic transformation. By moving transactional data from a robust system of record into a powerful analytical and AI-driven ecosystem, enterprises can unlock unprecedented value. This fusion allows businesses to not only gain a comprehensive, real-time view of their operations but also to infuse their data with advanced intelligence. From natural language queries and AI-assisted coding to predictive modeling and automated workflows, the combination of Oracle on Google Cloud empowers organizations to turn historical data into actionable insights, driving innovation, efficiency, and a significant competitive edge in a data-centric world.

Further resources:

Explore AI Solutions: Google Cloud AI Products
Official guide on Oracle to BigQuery, covering architectural differences, migration strategies, and AI-powered BigQuery Migration Services: Oracle to BigQuery migration
Evaluate BQ Agents using Agent Development Kit (ADK)

Read More for the details.

2025 10 28

GCP – Agent Factory Recap: AI Agents for Data Engineering and Data Science

Tibor Kiss Cloud, Google Cloud gcp

Welcome to another exciting episode of The Agent Factory, the podcast that goes beyond the hype to build production-ready AI agents! In this episode, we were thrilled to host Lucia Subatin, who guided us through the world of data agents and their transformative power for data engineers and scientists. She also showcased some truly innovative applications of graph databases and AI for better access to knowledge.

A podcast discussing data science agents

This post guides you through the key ideas from our conversation. Use it to quickly recap topics or dive deeper into specific segments with links and timestamps.

The Agent Industry Pulse

Timestamp: [01:45]

This week, the agent industry is buzzing with some groundbreaking releases:

Gemini API’s Computer Use Model: A new model that grants AI agents the ability to “see” and interact with your computer screen. It takes screenshots, decides on UI actions (click, scroll, type, open webpage), and executes them, allowing agents to automate real-world browser tasks like filling forms or testing user flows. Built with robust safety layers, every action undergoes a safety check, requiring human confirmation for risky operations. We even saw a demo of it looking up pricing on a documentation page!

CodeMender – AI Agent for Code Security: This AI agent is designed to autonomously patch new vulnerabilities as they arise (reactive) and rewrite existing code to secure entire classes of flaws (proactive). Leveraging the reasoning power of Gemini Deep Think and equipped with self-correction tools like static analysis and fuzzing, CodeMender automates the creation and validation of high-quality security patches at scale. It has already upstreamed 72 security fixes to open-source projects, marking a significant breakthrough for software security.

The Factory Floor

The Factory Floor is our segment for getting hands-on. Here, we moved from high-level concepts to practical code with live demos.

The BigQuery Data Engineering Agent

Timestamp: [06:44]

We dove into the BigQuery Data Engineering Agent, a powerful tool for automating data pipeline creation and management directly within BigQuery.

Generating Sales Regions: Lucia demonstrated how to use the agent to add a new sales_region field to an accounts table based on the billing_country, leveraging BigQuery’s AI_GENERATE function to call Gemini 2.5 Flash from a SQL statement.
Creating a Time Dimension Table: The agent was then prompted to generate a comprehensive time_dimension table, crucial for natural language to SQL queries by providing readily available date components (year, quarter, month name) for easier analysis.
Automating Data Quality Assertions: Finally, Lucia showed how the agent can automatically generate data quality assertions for all tables, such as ensuring non-null IDs and unique account names, to maintain data cleanliness and reliability for agent applications.

The Data Science Agent

Timestamp: [07:24]

Next, we explored the Data Science Agent, operating within Colab Enterprise, to extract insights and prepare data for agent applications.

Anomaly Detection: Lucia tasked the agent with detecting anomalies in a Case table. The agent formulated a plan to load and describe data, preprocess it for anomaly detection, train an isolation forest model, and provide visualizations.
Identifying Anomalous Records: After executing its plan, the agent successfully identified anomalous records, provided a summary of its findings, and even presented a visual confirmation of the separation between normal and anomalous data points. It also offered insights and next steps to understand the root causes of these anomalies, proving invaluable for improving data collection processes.

Creating Comics from Spanner Concepts using an ADK

Timestamp: [26:01]

In a truly unique demonstration, we saw how to combine a graph database with AI for creative content generation.

Spanner Graph Database: Lucia explained Spanner as a globally distributed, strongly consistent database with graph capabilities. She showcased a graph database built from Spanner’s documentation, traversable via GQL.
Knowledge Traversal and Comic Generation: Using an ADK application, a knowledge agent traversed this Spanner graph database to answer “What are regions?” Based on the retrieved information, another agent generated a detailed prompt for Nano Banana, an image generation model, to create a six-panel comic strip explaining Spanner regions in a vibrant tech illustration style. The comic visually explained regional, dual-region, and multi-region configurations.

The following is an example of another comic generated by the agent, responding to the question “What is interleaving?”

It was incredible to see how agents could not only retrieve precise information but also transform it into engaging visual content, even with multiple iterations to refine text clarity in the generated images.

Developer Q&A

Timestamp: [38:49]

We wrapped up with some great questions from our developer community:

On the Availability of Data Science and Data Engineering Agents

Timestamp: [38:53]

Both the Data Science Agent and the Data Engineering Agent are currently in preview. The Data Science Agent is in public preview, while access to the Data Engineering Agent requires following a specific link, which we’ll provide in the description. This means developers can start experimenting with these powerful tools today!

On the Scalability and Deployment of the Data Engineering Agent

Timestamp: [39:33]

The Data Engineering Agent leverages highly scalable platforms: BigQuery and Dataform. It can perform analysis across multiple tables, datasets, and projects, provided the executing pipeline has the necessary permissions. For deployment to higher environments (staging, production), Dataform excels in assisting the data pipeline lifecycle by generating declarative artifacts that can be released and configured for deployment across various project and dataset combinations, ensuring a robust software delivery lifecycle for your data pipelines.

What an incredible journey through the world of data agents and creative AI! We hope this episode inspired you to explore the possibilities of augmenting your data workflows and even generating engaging content with these innovative tools. The power to build cleaner data pipelines, derive deeper insights, and bring complex concepts to life through AI is truly at your fingertips.

Your turn to build

Ready to get hands-on? Dive into the resources linked below and start building your own data agents and AI-powered applications today! Don’t forget to watch the full episode for all the practical demonstrations.

Connect with us

Smitha Kolan → LinkedIn, YouTube, X, Instagram
Lucia Subatin → LinkedIn, YouTube, TikTok, Instagram

Read More for the details.

2025 10 28

GCP – Keys to the Kingdom: A Defender’s Guide to Privileged Account Monitoring

Tibor Kiss Cloud, Google Cloud gcp

Written by: Bhavesh Dhake, Will Silverstone, Matthew Hitchcock, Aaron Fletcher

The Criticality of Privileged Access in Today’s Threat Landscape

Privileged access stands as the most critical pathway for adversaries seeking to compromise sensitive systems and data. Its protection is not only a best practice, it is a fundamental imperative for organizational resilience. The increasing complexity of modern IT environments, exacerbated by rapid cloud migration, has led to a surge in both human and non-human identities, comprising privileged accounts and virtual systems [compute workloads such as virtual machines (VMs), containers, and serverless functions, plus their control planes], significantly expanding the overall attack surface. This environment presents escalating challenges in identity and access management, cross-platform system security, and effective staffing, making the establishment and maintenance of a robust security posture increasingly challenging.

The threat landscape is continuously evolving, with a pronounced shift towards attacks that exploit privileged access. Mandiant’s 2025 M-Trends report highlights that stolen credentials have surpassed email phishing to become the second-most frequently observed initial access method, accounting for 16% of intrusions in 2024. This resurgence is fueled, in part, by the proliferation of infostealer malware campaigns, which facilitate the collection and trade of compromised user credentials. However, threat actors of all types have found myriad new ways to compromise identity, including social engineering, which has been on the rise alongside several other tactics, techniques, and procedures (TTPs). ENISA documents criminal use of generative artificial intelligence (AI) for credential-stealing social-engineering and “fraud kits.”

Stolen credentials provide not just a high-value vector for initial access during intrusions, but also further enable actors to conduct internal reconnaissance, move laterally, and complete their mission. Compromised credentials, alongside stolen session tokens, social engineering, and other techniques to compromise identity, underscore the critical need for organizations to make identity security one of the foundational pillars of their security posture. Even with advanced perimeter defenses, if privileged credentials are weak or poorly managed, attackers will inevitably find a way into an organization’s critical systems. Breaches can be difficult to detect and contain; M-Trends 2025 reports a global median dwell time of 11 days in 2024—5 days when the adversary notifies, 26 days when an external entity notifies, and 10 days when detected internally. A concise defense-in-depth approach is required, where you should assume breach and implement layer controls so failure of one control is caught by the next layer of defense:

Verify every request (Zero Trust).
Require multifactor authentication (MFA) for all administrative paths.
Enforce privileged access management (PAM) with credential rotation and session recording.
Administer only from privileged access workstations (PAWs) on a segmented management network.
Tune security information and event management (SIEM) for privileged anomalies to reduce dwell time and radius.

Beyond external attacks, organizations face significant risk from account takeover (ATO) and insider activity. Adversaries routinely weaponize stolen credentials and session tokens, while negligent or malicious insiders can move quickly once they have access. In both cases, the trust model is being exploited, and privileged identities are the shortest path to impact.

At the same time, the business impact of breaches continues to rise and third-party exposure remains a frequent entry point. These realities reinforce an assume-breach posture with layered controls that reduce dwell time and blast radius.

This blog post provides recommendations and insights into preventing, detecting, and responding to intrusions targeting privileged accounts. To secure these “keys to your kingdom,” Mandiant’s strategy is built upon a comprehensive framework of three interdependent pillars:

Prevention: Securing Privileged Access to Prevent Compromise
Detection: Maintaining Visibility and Engineering Detections for Privileged Accounts
Response: Taking Action to Investigate and Remediate Privileged Account Compromise

This blog post serves as a reusable resource, emphasizing practical, threat-informed strategies to secure the most valuable digital assets.

01: Prevention: Securing Privileged Access to Limit Attack Impact

Effective privileged access management begins with an understanding of what constitutes a privileged account and a strategy for securing these critical assets.

Defining Privileged Accounts: Beyond the Obvious

Access is a privilege; every account is a grant of trust. A privileged account is any human or non-human identity whose entitlements can change system state, alter security policy, or reach sensitive data beyond a normal role. Privilege is contextual to role and tier: an entitlement is “privileged” when misuse would cause material impact for that asset. In modern enterprises, this also includes business users with access to sensitive financial or personal data via web apps and developers with cloud-platform access.

The evolving definition of “privileged” directly reflects the decentralization of IT and the rise of cloud-native and DevOps environments. Attackers are no longer solely targeting domain admins; they increasingly focus on developers’ workstations, service accounts, and API keys, knowing these give access to systems. This wider scope requires a more complete PAM strategy that covers the entire enterprise, not just traditional IT. The definition must also cover non-human accounts, such as service accounts, application accounts, and API keys. These are prime targets in real compromises because they hold broad access, yet are less monitored than human accounts. A PAM strategy that only focuses on human domain admins is incomplete and leaves attack surfaces open. Therefore, maintain a single inventory that classifies every human, service, and API account by business impact and maps each to a role with least-privilege entitlements—owner, purpose, systems touched, permitted actions, tier (T0/T1/T2), allowed pathways (PAW/jump), and Segregation of Duties (SoD) constraints—with quarterly attestation in the identity and access management (IAM) source of truth.

Categorizing and Tiering Privileged Accounts and Dependencies

Many organizations struggle with a broad and unclear understanding of “privileged accounts,” limiting their focus to only domain admins or global admins. This narrow view overlooks the dependencies on which those accounts rely. Mandiant’s Identity Security Modernization Engagements offer principles for better defining and categorizing privileged accounts beyond these views. These assessments help identify and reduce the number of accounts with highly privileged roles. This includes accounts or groups with permissions for modifying Group Policy Objects (GPOs), explicit permissions on domain controllers (DCs) or Tier-0 endpoints, privileged roles for virtualization platforms, and permissions to run processes as SYSTEM on many endpoints.

Dependencies often overlooked include jump servers, management workstations, specific network segments, applications, and continuous integration and continuous delivery/deployment (CI/CD) pipelines. “Trusted Service Infrastructure” directly addresses these dependencies, including management interfaces for asset and patch management tools, network devices, virtualization platforms, backup technologies, security tooling, and PAM systems themselves. Attackers target these components for persistence and lateral movement, knowing that compromising them can give broad control over an environment.

“Tiering” is key for PAM. It moves beyond a flat “privileged” versus “non-privileged” view by categorizing accounts based on compromise impact and their dependencies. For example, an account that can access a Tier-0 asset (like a domain controller) or the infrastructure supporting it (e.g., a jump server) poses a higher risk to the operation. Overlooking these dependencies means that even if a “privileged” account is secure, the less-secure system used to access it can become the weakest link. This shows the need for a holistic security approach that extends PAM controls to the entire “privileged access pathway,” ensuring controls are layered across identities, endpoints, networks, and applications. The context of access—from where, when, and how—becomes as important as the identity itself.

Common Privileged Account Categories and Critical Dependencies

Category	Examples	Critical Dependencies	Criteria for Privileged Roles
Human accounts	Domain administrators, local administrators (Linux/Unix), business users, developers	Jump servers, management workstations, critical networks, CI/CD pipelines	Default privileged roles, GPO modification permissions, explicit permissions on DC, local admin access
Non-human accounts	Service accounts, application accounts, API keys	Asset management tools, network management tools, virtualization platforms, backup technologies, security tooling, PAM systems	Accounts or groups with permissions to invoke processes as SYSTEM on a large scope of endpoints

Establishing a PAM Foundation

A PAM program is not built overnight; it is a journey that progresses through distinct phases, each building upon the last to systematically reduce risk and enhance security posture.

The PAM Maturity Journey

Effective privileged access management adoption is an evolutionary process through levels of maturity. These levels build on each other, reducing risk and improving security. Mandiant sees four stages: Uninitiated, Ad-Hoc, Repeatable, and Iterative Optimization. As an organization moves through these stages, its protection covers more types of privileged users, sensitive systems, and their accounts.

Uninitiated. Privileged access sits largely uncontrolled: manual account creation, spreadsheet tracking, shared credentials, weak/absent MFA, loose password policy, and missed deprovisioning. Service/API accounts appear without owners or documentation. Security lacks a full map of privileged pathways and Tier-0 assets—high cyber risk by default.

Ad-Hoc. First risk reductions begin: a subset of shared credentials gets vaulted/rotated; a few guardrails appear. Operations stay reactive, tools remain fragmented, role/tier separation is limited, reporting and attestation is difficult. PAM exists as point solutions rather than a program.

Repeatable. Controls become consistent and broad. PAM covers business users, developers, third parties, servers, workstations, and software-as-a-service (SaaS). Role-based access control (RBAC) standardizes access; MFA is on all admin paths; local-admin rotation (e.g., Local Administrator Password Solution [LAPS]) is in place; PAWs for Tier-0/1; just-in-time/just-enough-administration (JIT/JEA) introduced; change control and ticketing integrated. Scheduled discovery, classification, role mapping, and quarterly attestation create a dependable operating rhythm.

Iterative Optimization. Automation and analytics drive continuous improvement. Full lifecycle orchestration—provision → approve/JIT → session oversight → auto-rotate → deprovision—runs end-to-end. SIEM / extended detection and response (XDR) / security orchestration, automation, and response (SOAR) detect and contain privileged anomalies. Human standing privilege trends toward zero; service/API identities move to group Managed Service Account (gMSA)/managed identities; dual-control on vault release; break-glass tested; controls validated through red/purple-team exercises. PAM is woven through IT and DevOps, reinforcing defense-in-depth so failure of one layer is caught by the next.

Implementing a Dedicated PAM Solution

A key step in building a strong PAM foundation is implementing a dedicated privileged access management solution (e.g., CyberArk, BeyondTrust, Delinea). Onboarding all privileged accounts into a centralized PAM system provides visibility into who accesses which credentials and from where. Leading PAM tools discover, vault, and manage credentials; enforce security policies (like checkout approvals and one-time passwords); and log all privileged activities for audit. For cloud control planes, pair your PAM with cloud-native PIM/JIT services (e.g., Google Cloud Privileged Access Manager, Microsoft Entra ID PIM) to grant time-bound elevation rather than standing admin rights. This greatly reduces the risk of unmanaged, ad hoc credential use.

However, simply deploying a PAM product is not enough—PAM must be treated as an ongoing program with defined policies and ownership. The organization should establish central governance and processes around privileged access: enforce tiered account structures, require multifactor authentication for all admin access, and mandate least privilege. Pair top-down role design with bottom-up discovery. Governance must also audit effective permissions at the resource level (access control lists [ACLs] on data stores, app/database (DB) roles, SaaS admin scopes, cloud IAM policies) to surface shadow admins—accounts that do not appear privileged in directory groups but can fully control sensitive resources (e.g., HR/finance datasets). Feed these findings into tier mapping and PAM onboarding so those identities are either right-sized to least privilege or brought under PAM with JIT/JEA and session oversight. Scheduled entitlement discovery can be done via identity governance and administration (IGA) / cloud infrastructure entitlement management (CIEM) or dedicated entitlement-analysis tools as well as native exports from the platforms themselves.

Having a PAM tool does not automatically mean you are “doing PAM.” Many organizations park credentials in a vault yet fail to align with a tiered model or manage a full identity lifecycle. PAM must live inside a broader governance program. In practice, classify assets and platforms, map each privileged identity to that classification, then configure the tool to enforce policy (password rotation cadence, session recording, JIT/JEA, approvals, network restrictions). With that context, PAM simplifies the complexity by tying process to technology, turning policy into consistent, auditable controls. Run scheduled resource-permission crawls and reconcile deltas (new owners, new admin scope) back into the PAM inventory and approval workflows.

Properly implemented, a PAM solution yields many benefits: centralized insight into privileged access patterns, automated password management (eliminating hard-coded or stale credentials), and real-time alerting on suspicious behavior. Without such automation, managing thousands of privileged accounts manually is error prone and high risk. PAM tools mitigate human error by enforcing consistent policies and reducing reliance on individual administrators. They also integrate with monitoring systems (or built-in analytics) to flag anomalous admin activities. Organizations still relying on spreadsheets or disparate teams to manage admin passwords face scalability limits and blind spots. A dedicated PAM system, combined with strong processes, closes these gaps.

Considerations for Self-Managed PAM Initiatives

While a dedicated PAM solution is best for security and efficiency, organizations may manage some PAM aspects themselves, especially in earlier stages or for niche needs. If an organization undertakes a self-managed PAM initiative, these factors must be accounted for:

Manual Inventory and Tracking Overhead: Without automated discovery tools, keeping an accurate, up-to-date inventory of all privileged accounts (human and non-human, including application accounts, especially in finance organizations) becomes a large, error-prone, manual effort. This includes tracking permissions, dependencies, and owners across systems.
No Centralized Visibility: Separate manual processes lead to fragmented visibility. Combining logs from various sources (operating systems, applications, network devices) and correlating privileged activity for a unified view is hard without a central system (like a SIEM).
Inconsistent Policy Enforcement: Manual policy enforcement (e.g., password complexity, rotation, least privilege) across many privileged accounts is prone to human error and inconsistency, leading to security gaps.
Scalability Limits: Manual PAM processes do not scale. As privileged accounts grow, management overhead becomes too much, affecting security and operations.
Slower Incident Response: Without automated detection, real-time alerting, and integrated response, finding and containing a privileged account compromise will be slower, increasing dwell time and damage.
Higher Risk of Human Error: Manual management increases misconfigurations, forgotten deprovisioning, and accidental or coerced credential exposure, all leading to security incidents.
Compliance Burden: Showing compliance with regulations (e.g., PCI DSS, NIST) for privileged access becomes a laborious, manual audit process without automated reporting and session records.
Application-Specific Privileged Accounts: Applications, especially in finance, need attention to ensure they are managed with accounts that follow least privilege, rather than standard corporate accounts. This needs a detailed understanding of application roles and privileges.
No Bottom-Up Entitlement Discovery: Without resource-level audits of effective access, “shadow admin” rights remain invisible, leaving PAM scope incomplete and high-impact accounts unmanaged.

Organizations starting a self-managed PAM journey must know these limits and be ready to invest much manual effort and accept a higher risk than with a purpose-built PAM solution.

Hardening Critical Infrastructure and Credentials

Strong PAM needs hardening around it. Reduce privileged credentials to the minimum, lock down those that must exist, and restrict where/when they can operate. Treat the full credential lifecycle—creation, storage, use, rotation, retirement—as a control surface. Even if a password or token leaks, tight hardening should keep it noisy, short-lived, or useless.

Secure Administrative Access Paths (RDP, SMB, WinRM)

Admin pathways are prime lateral-movement rails. Collapse them into monitored, gated channels.

No direct exposure: Block Remote Desktop Protocol (RDP) / Secure Shell (SSH) from the internet. For remote admin, force access through PAWs or jump hosts with MFA and bastion logging.
Segregate management networks: Only PAWs and PAM session managers reach server admin interfaces; deny user subnets by default.
Protocol hygiene: Enforce Server Message Block (SMB) signing; prefer Kerberos; phase down NTLM; disable default admin shares (e.g., ADMIN$) where operationally viable.
WinRM/RDP hardening: Enforce encrypted Windows Remote Management (WinRM) always (AllowUnencrypted=0). In Active Directory (AD)-joined scenarios using Kerberos/Negotiate, WinRM over HTTP already provides message-level encryption; still prefer HTTPS to add TLS, server certificate validation, and for non-domain/cross-forest use. Require HTTPS for Basic auth, workgroup hosts, or any untrusted network path. Restrict to approved admin groups and endpoints; disable CredSSP unless explicitly required. For RDP, enable Network Level Authentication (NLA), limit access via firewall rules/GPO, and log via bastions/PAM session managers.
Broker sessions: Use PAM session management for high-risk systems (Tier-0/1) with keystroke/command capture and real-time termination.

Endpoint Security Controls (Least Privilege and Unknown-Code Execution)

Stop unapproved tools and script abuse on machines admins touch (endpoint privilege management [EPM]).

Least privilege by role: Remove local admin from user workstations; perform admin tasks only from PAWs.
Application control: Enforce WDAC/AppLocker allow-lists; block unsigned and unknown binaries; restrict PowerShell to Constrained Language Mode; enable AMSI + Script Block Logging.
Protect secrets on the host: Turn on Credential Guard/LSA Protection (RunAsPPL); disable legacy caches (e.g., WDigest); prefer AES-only Kerberos; shorten Ticket-Granting Ticket (TGT) lifetimes for admin roles.
Baseline + EDR: Apply CIS/Microsoft baselines via GPO/MDM; require endpoint detection and response (EDR) with tamper protection, USB/device control, and quarantine actions.
Classify by tier: Mark PAWs/jump hosts as T0/T1, workstations as T2, and enforce stronger baselines and update rings for higher tiers.

Credential Protection and Usage Hardening

Even when privileged accounts exist, we can limit their exposure and utility to attackers. Enforce technical controls such as:

Block Credential Reuse on Endpoints: Prevent privileged domain accounts from logging into standard user workstations. Administrators should use separate admin accounts only on admin systems (tiered access model). If a privileged credential is never used on a low-security machine, malware on that machine cannot steal it. Similarly, for local administrator accounts, disallow remote use (e.g., via Group Policy restrictions on those accounts’ Security Identifiers [SIDs]) to stop lateral movement. Use Microsoft LAPS, CyberArk Loosely Connected Device (LCD) (feature designed to manage and rotate credentials for endpoints, regardless of their connection to the corporate network or Active Directory), or equivalent to ensure each machine’s local admin password is unique and regularly rotated.
Service Account Restrictions and Residency: Define explicit residency for every service/API account, and which hosts, networks, and tiers they can run on. On Windows, prefer gMSA and restrict which computers may retrieve/use the credential via PrincipalsAllowedToRetrieveManagedPassword; grant “Log on as a service” only on those hosts; deny interactive and RDP logon everywhere; limit network logon as required. Use Kerberos constrained or resource-based constrained delegation only to named backend services; avoid unconstrained delegation. Residency boundaries enforce least privilege, prevent credential spread, and make misuse obvious in logs.
Memory and Credential Cache Protections: On Windows (Domain Member) systems, enable features like Protected Users group membership for admins (which disables legacy authorization protocols and forces Kerberos, etc.)—though do not apply this to service accounts, which may break if subject to those restrictions. Disable WDigest authentication and other settings that might keep credentials in memory in plaintext. These measures ensure that even if malware lands on a system, it is harder to scrape credentials from memory (Local Security Authority Subsystem Service [LSASS]). Reducing the “live” presence of passwords and tickets closes off common credential theft techniques (pass-the-hash, ticket reuse).

These hardening steps illustrate a mindset: even if privileged credentials exist, make them hard for attackers to capture or use. By reducing where they reside and how long they remain valid, you decrease the value of a stolen credential. This directly reduces an attack’s impact, forcing attackers to spend more effort or give up.

Minimize Standing Privileges

An emerging best practice is to reduce the number of privileged accounts that exist with always-on rights. Instead of giving every administrator their own always-privileged user account, consider a model of ephemeral or checked-out privileges. For example, a team of 10 admins may not need 10 separate domain admins active at all times. Using PAM, you could maintain a small pool of privileged accounts that admins check out when needed (one-at-a-time, with unique login tracking for accountability) and that get automatically locked or rotated afterward. Many PAM solutions support “exclusive access” or one-time password checkout, ensuring no two people use the same shared account simultaneously and every action is tied back to an individual.

This approach shrinks the attack surface by having fewer privileged credentials in existence. It also enforces discipline—admins must go through the PAM process to get access, which is logged and monitored. While shared accounts are generally risky, with strict PAM controls (per-user checkouts, full session recording, and audited approvals) they can be used in a way that preserves accountability while limiting credential proliferation.

The goal is zero standing privilege: no one has permanent admin rights unless actively approved and in use. Just-in-time administration (discussed later in this post) is a related concept that achieves this by granting rights only when needed.

Secrets Management

For highly sensitive secrets—master encryption keys, signing certificates, cloud API keys, etc.—organizations should use dedicated secret management systems (often termed “key vaults”). A key vault (whether services like Azure Key Vault, HashiCorp Vault, or CyberArk’s Identity Security Platform) is a hardened repository that securely stores secrets and tightly controls their access. The vault becomes the single source of truth for sensitive credentials, enabling fine-grained access control, auditing, and automated rotation from one central point. This reduces the risk of secrets sprawl (e.g., passwords stashed in configuration files or plaintext) and helps prevent unauthorized access to critical secrets.

When we say “secrets management,” we refer to the general practice of centralized secrets management, not a specific product. For example, Azure Key Vault, Amazon Web Services (AWS) Key Management Service (KMS) / Secrets Manager, Google Cloud KMS, or a third-party vaulting tool all serve a similar purpose. The key is that these systems are purpose-built to protect secrets through strong encryption, access control, and monitoring.

Key considerations for effective secrets management include:

Hardware Security Module (HSM): Integrate key vaults with HSMs as they provide a tamper-resistant environment for cryptographic operations and key storage, protecting keys from logical and physical attacks.
Least-Privilege Access: Only authorized users or automated processes should be able to retrieve or manage secrets, and access should be granted on a just-in-time, just-enough basis.
Auditing and Monitoring: Implement comprehensive logging and monitoring of all access to and operations within the key vault. Integrate these logs with your SIEM (e.g., Google SecOps) to detect anomalous behavior and unauthorized access attempts in real time.
Automated Rotation and Lifecycle Management: Automate the rotation of secrets stored in the key vault to reduce the impact of any potential compromise. This includes automated certificate renewals, API key rotations, and password changes for managed accounts. The key vault should manage the entire lifecycle of secrets, from creation to destruction.
Geographical Dispersion and Redundancy: Deploy key vaults in a highly available and geographically dispersed architecture to ensure business continuity and disaster recovery.
Segregation of Duties: No single individual should have complete control over all aspects of the key vault.
Secure Backup and Recovery: Establish secure, offline, and encrypted backup procedures for the key vault itself. This ensures that even in a worst-case scenario, critical secrets can be safely restored.

Segregation of Duties and Tiered Access for Secrets Management: Robust Credential Security

Segregation of Duties (SoD) is a security cornerstone: no single individual controls critical processes. For key vaults, housing sensitive cryptographic keys, privileged credentials, SoD is vital.

SoD Imperative in Secrets Management

SoD prevents fraud, errors, and malicious activity by distributing control over critical processes so no single individual can act unilaterally. For key vaults, this prevents a single point of failure and mitigates the risk of insider threats and external attacks that exploit stolen credentials. Without SoD, a single compromised account could grant an attacker unfettered control, leading to immediate data exfiltration.

SoD proactively defends against cyberattacks by “decompressing” the attack pathway. Attackers use stolen credentials to bypass initial access defenses, but SoD fragments the control over a key vault, so even if one person’s credentials are breached, the attacker lacks the full permissions needed to compromise the vault completely. This increases the complexity and time needed for an attack, making it easier to detect.

SoD deters malicious activity by ensuring accountability. When administrators know their actions are subject to forensic auditing, they are less likely to misuse their access. This architecture makes malicious activity more difficult, reduces human error, and fosters shared vigilance. By forcing privileged actions through approved, dual-controlled paths, the design makes unauthorized tradecraft noisy and easy to spot—attempts outside sanctioned workflows fail fast and alert.

Tiered Access Control for Key Vaults

Enforce tiering inside PAM and vault workflows. Treat the vault, PAM components, identity provider (IdP), and admin workstations as Tier-0 control planes. Permit Tier-0 identities only on Tier-0 systems; block cross-tier logons; require PAWs for Tier-0; isolate management networks so lower tiers cannot reach them. Make dual-control the default for vault release and role changes; ensure all break-glass paths are audited. Encode this in PAM: dedicated Tier-0 roles, approval chains, session isolation. Validate in SIEM: vault access, policy edits, role elevation, key retrieval.

Administrative Silos Per Tier

Build discrete silos for T0, T1, and T2. Separate admin groups, PAWs, credential stores, management tooling, logging, and network segments. No shared hosts, no shared identities, no shared jump paths across silos. Deny-by-default between tiers; allow only vetted, one-way orchestration flows.

Tier Controls that Protect Tiers from Each Other—and Themselves

Cross-tier protections: Block interactive logon from lower to higher tiers; restrict credential injection and token reuse; require JIT elevation with time bounds; enforce change windows and peer approval for T0 actions.
Intra-tier firebreaks: Session recording with command risk scoring; rate-limit or pause high-impact operations; require two-person integrity for destructive changes (key purge, policy delete); automatic rollback checkpoints for T0 policy edits.

Tier Definitions

T0 (crown-jewel control plane): AD domain controllers; cloud IdP tenants (Microsoft Entra ID, Okta, Ping); cloud management planes and root roles (AWS IAM/root, Azure management groups/subscriptions, Google Cloud org/projects), Kubernetes control plane; PAM infrastructure; secrets/key services (CyberArk Vault, HashiCorp Vault, Azure Key Vault, AWS KMS/Secrets Manager); public key infrastructure (PKI) / certificate authority (CA) and CI/CD orchestrators.
T1: Core business platforms (critical apps, databases).
T2: Workstations, lower-impact servers. Key vaults are unequivocally T0.

Tiering must extend across the entire privileged pathway—identities, endpoints, networks, and applications that touch the vault—in both on-premises and cloud environments so a weak hop cannot bypass controls. SoD + tiering work together: tiering sets asset criticality and isolation boundaries; SoD fragments authority so no single operator can subvert Tier-0. Net effect: PAM encodes and enforces the enterprise tier model for every vault operation, while monitoring, approvals, and session isolation keep even the most privileged actions accountable and recoverable.

Advanced PAM Capabilities: JIT and JEA (Just-In-Time / Just-Enough-Access)

Modern PAM programs are increasingly adopting just-in-time (JIT) access and just-enough-access (JEA) models to enforce least privilege dynamically. These approaches aim to eliminate standing high-level access and only grant privileges when and to the extent needed.

Just-Enough-Access (JEA). Constrain privilege to the exact commands required for the task—nothing more. Example: instead of making a helpdesk user a domain admin, expose a PowerShell JEA endpoint that can only unlock accounts. JEA forces least privilege per command, produces high-fidelity logs, and blocks actions outside the allowed scope by design.

Application Control / EPM as the runway to JEA. Before or alongside JEA, apply application allow-listing and per-process elevation so only approved binaries run and only approved binaries can receive elevation. Concretely: WDAC/AppLocker on Windows, sudoers and signed-binary policies on Linux/macOS, or endpoint privilege management (e.g., CyberArk EPM) to elevate a specific installer/tool without giving the user local admin. This shrinks the privilege surface on endpoints and makes the later jump to JEA much smoother.

Just-In-Time (JIT) Access. JIT focuses on time-bound privilege elevation. Instead of an account having 24×7 admin rights, it can be configured so that admin privileges can be activated for a short window when needed, often requiring approval. For instance, a cloud administrator might not normally have the Owner role on a production subscription, but through a privileged identity management (PIM) service (like Microsoft Entra ID PIM or CyberArk Secure Cloud Access), they can request that role, and upon approval it is granted for one or two hours and then removed automatically. JIT ensures that even if an account’s credentials are stolen, an attacker cannot do anything privileged with them unless they happen to steal it during an active privileged window (which is unlikely). It minimizes the duration of elevated access, cutting off opportunities for abuse.

Zero-Standing Privilege (ZSP). Target state: no human holds always-on admin rights. Access requires an approved request, step-up MFA, and either (a) time-bound role assignment or (b) an ephemeral token/credential. Session recording and command controls run by default. ZSP combines JEA (scope) + JIT (time) + strong approvals, making privilege both temporary and tightly bound.

App control/EPM prevents unknown tools from running; JEA restricts allowed actions; JIT/ZSP removes 24×7 rights; secure web sessions capture and deter misuse. Together they reduce blast radius, raise attacker friction, and generate auditable evidence for every privileged step.

Hardened Access Pathways

Restricting access to key vaults via hardened pathways is critical. Organizations should use PAWs or jump servers, which are highly secure, segmented systems used exclusively for privileged administrative tasks. This prevents attackers from moving laterally from a compromised, less-secure workstation to a high-value Tier-0 asset.

Hardening common lateral movement protocols like RDP, SMB, and WinRM is also vital. This includes disabling administrative shares, avoiding direct internet exposure, and enforcing MFA for RDP sessions. These measures contain a compromise even if initial access is gained.

Automated Credential Management

Automated secret rotation (for passwords, SSH keys, API keys, and certificates) is vital for reducing the window of opportunity for attackers. This automation is a form of SoD, as it removes the human element from handling sensitive credentials, minimizing accidental exposure or malicious manipulation during rotation. Dedicated PAM solutions can automate this process at scale, ensuring consistent policy application and reducing the “privilege of knowledge” by limiting how long any human needs to know a sensitive secret.

Dual Authorization and Approval Workflows

The “four-eyes” principle, or dual authorization, is a direct and stringent application of SoD. It requires a second or more, independent approval for high-impact actions within a key vault, such as retrieving a master encryption key or modifying critical policies. This ensures no single individual can perform a potentially irreversible action without independent verification, raising the bar for attackers and malicious insiders.

Monitoring and Auditing

Collect comprehensive logs from vault/PAM (checkouts, policy edits, session telemetry), IdP sign-ins, PAWs/jump hosts, EDR, and network controls. Correlate and aggregate these streams in the SIEM (e.g., Google SecOps) to build a single privileged-activity timeline. Combine analytics with context/assurance signals (device trust, geographic risk, user risk) to score events. Let automation auto-contain clear cases (suspend token, rotate secret), and surface in-role but abnormal activity to humans with the correlated context needed for fast decisions beyond automation.

These monitoring capabilities are also critical for compliance. Many regulatory frameworks, such as PCI DSS and NIST SP 800-53, mandate detailed auditing of all actions taken by individuals with administrative privileges.

02: Detection: Maintaining Visibility and Engineering Detections for Privileged Accounts

Distinguishing Privileged Account Monitoring from Normal IAM Abuse

Basic security tooling misses privileged misuse. Firewalls, simple intrusion detection systems (IDS), or SIEMs used as raw-log buckets give a flat view with little actor intent, leaving audit gaps. Close those gaps with defense-in-depth observability: collect high-fidelity, user-centric signals across control planes and correlate them—PAM vault checkouts, elevation/approval workflow events, session transcripts/commands, IdP sign-ins and Conditional Access outcomes, PAW posture, EDR process trees, network flows, change/configuration logs, and ticket metadata. Tie each privileged action to who/what/when/where/why/how, then apply behavioral analytics to flag authorized but abnormal use, verify dual-control, and automatically kill sessions, revoke tokens, or rotate secrets. On the defender side, organizations that deploy security AI/automation see materially better outcomes. IBM’s study reports ~USD $2.2M lower average breach costs and a shorter breach lifecycle—reinforcing the need for automated detections.

Key differences vs. normal IAM abuse:

Observation depth. Privileged activity demands Who, What, When, Where, Why, and How context captured from the aforementioned user-centric signals for both real-time and post-event assessment.
Impact-first triage. Prioritize by asset tier and action impact rather than “detect-all” volume.
Authorized-abuse focus. Validate approvals and scope; alert on mismatches in approver, time, device, or target.
Analytics + response. Combine PAM telemetry with identity threat detection and response (ITDR) and User Entity Behavior Analytics (UEBA) in SIEM/XDR to drive automated containment (session terminate, token revoke, secret rotation).

Key Distinctions: Privileged Account Monitoring vs. Normal IAM Abuse

Criteria	Privileged Account Monitoring	Normal IAM Abuse Monitoring	Shortcomings of Traditional Tools
Granularity	High-fidelity, user-centric context (screen, keystrokes, metadata)	Basic event logs; general access attempts	Incomplete picture, lack of detail, scattered events
Impact of Compromise	Disproportionately high impact (financial, operational, reputational)	Lower/variable impact; general threat detection	Fails to differentiate critical from non-critical events effectively
Contextual Understanding	Deep understanding of intentions/impacts; behavioral analysis	Focus on basic access patterns	No user-centric context; difficult interpretation
Compliance Requirements	Strict regulatory demands (PCI DSS, NIST, etc.)	Broader compliance; general logging	“Audit gap” where traditional logs do not meet detailed requirements
Insider Threat Mitigation	Primary focus for insider threat mitigation (malicious or negligent)	General threat detection; less specific focus on insider misuse	Cannot effectively identify subtle insider misuse

Engineering Specific Detections and Hunts

To monitor privileged accounts, organizations must move beyond static, rule-based detections to dynamic, intelligent approaches, including behavioral analytics with machine learning.

Generalized Detections for Anomalous Behavior

SIEM anomaly detection constantly monitors and analyzes data from network sources to establish a normal behavior baseline. Any deviation, like unusual login times, unexpected data transfers, or access by unfamiliar users, is flagged as an anomaly. Machine learning is at the heart of modern SIEM anomaly detection, letting the system learn from vast data, adjust baselines as the network changes, and find complex patterns across data sources. This finds subtle, low-and-slow attacks typical of threat actors. For instance, a system might detect a privileged user suddenly accessing new resources or acting outside their usual hours. While individual actions may seem harmless, their combination can show compromised credentials or insider misuse, which traditional rules might miss.

Nuanced Brute-Force Monitoring

Not all brute-force looks equal. Tune sensitivity by target impact and identity legitimacy. Deprioritize sprays at low-risk users; treat attempts against super admin/root, PAM/vault, secrets management, IdP break-glass, and cloud control planes as high-severity. Go beyond failure counts: classify the campaign by username quality (invalid-name ratio → enumeration; high valid-name ratio → likely stolen list), technique (spray vs. stuffing vs. targeted), MFA outcomes, lockout/rate-limit evasion, and source reputation. Correlate with role catalogs and allowed activity for that role: a spray that yields a success on a Tier-0 identity followed by atypical actions (token creation, role elevation, policy edits) signals compromise. Suppress noise by allow-listing approved scanners and pen-test windows; require change-ticket or source-IP tags to mark “legit testing.” Drive a risk score per campaign that blends target tier, username legitimacy, success events, and post-authorization behavior, then trigger automated response (step-up auth, session kill, account disable, secret rotation) only for high-risk series.

Privileged Session Monitoring and Auditing

For privileged activity, high-fidelity session capture (screens, keystrokes/commands, metadata) provides intent and impact at review time, detects insider misuse, and proves compliance. Feed session telemetry to SIEM/XDR and Privilege Threat Analytics/ITDR to enrich with risk factors: asset tier, origin/device trust, time, approval chain, command rarity, data movement. Link sessions to brute-force outcomes and dual-control artifacts (who requested, who approved). Use analytics to auto-summarize what mattered (privilege elevation, new tokens/keys, policy changes, lateral pivots) and assign a risk score so investigators can triage fast; auto-action when thresholds are crossed (terminate session, revoke tokens, rotate credentials). This keeps reviews focused on abnormal behavior while preserving full evidence for forensics.

Specific Detections and Hunts (Examples)

When engineering detections and hunts for privileged accounts, focus on high-impact behaviors and unusual patterns, covering human and non-human privileged entities.

Credential Exposure: Look for account lockouts or unexpected password resets, more login attempts on multiple services, logins from new devices or unfamiliar locations, multiple accounts accessed by the same device or IP address, uninitiated changes to account settings (e.g., recovery emails, security questions), and the use of emulators or virtual machines.
GPO Modifications: Detect Group Policy Object (GPO) modifications by checking Security event logs on domain controllers for Windows Security Event ID 5136. “Audit Directory Service Changes” must be on. Watch for modifications of GPOs like the Default Domain Policy or scheduled task additions via GPO.
Trusted Service Infrastructure Activity: Detect authentications and activities within platforms like asset and patch management tools, virtualization platforms, and security tooling.
Virtualization Infrastructure: Ensure centralized SIEM/logging platforms capture authentication, authorization, access events, and configuration changes for virtualization platforms. Baseline these events, then alert on any access where privileged identities are used.
Privileged Service Account Behavior: Keep an inventory of where and when privileged service accounts log on and create detections for any activity outside these baselined parameters. This is key for applications, especially in finance, that might be managed by service accounts. Detections should flag if a service account used for a financial application tries to access a different application, or logs in from an unexpected host.
Threat Hunting: Do regular, proactive threat hunting to find compromise evidence missed by existing detections. This also helps find visibility gaps and build new detection uses.
Compliance-Driven Auditing: Follow industry standards and regulations. PCI DSS requires auditing all actions by root or administrative privileges, and invalid logical access attempts. NIST SP 800-53 requires session audits at system start-up, user session content capture, and real-time viewing of user sessions.

By providing these detection examples, organizations can improve security. Linking detections to compliance standards adds a mandatory reason for implementation. Inventorying and monitoring service accounts, a common hurdle, can also be addressed.

Sample Detections for Privileged Account Activity

Detection Category	Specific Activity/Indicator	Potential Threat	Recommended Action (Google SecOps)
Anomalous Login	Login of a privileged account from a new geolocation or unusual IP address	Account Takeover, Compromised Credential	High-severity alert, automated account suspension
High-Impact Brute-Force	Rapid failed-login burst to a Tier-0 admin from one origin fingerprint (same IP, device/hostname, ASN/geo, user-agent) within minutes	Account Takeover, Credential Stuffing	Critical alert, force password reset, automated account lockout
Credential Exposure	Uninitiated password reset or account setting change on a privileged account	Account Takeover, Insider Threat	High-severity alert, trigger forensic investigation playbook
GPO Modification	Windows Security Event ID 5136 on domain controller for modification of Default Domain Policy or addition of a scheduled task	Ransomware Deployment, Lateral Movement, Persistence	Critical alert, automated GPO rollback (if feasible), immediate investigation
Privileged Service Account Anomaly	Privileged service account login or activity outside of baselined hours or on unapproved systems	Lateral Movement, Insider Threat, Compromised Service Account	Medium-to-high severity alert, automated account suspension/disablement
Trusted Service Infrastructure Access	Unusual authentication or activity within asset/patch management tools, virtualization platforms, or security tooling	Privilege Escalation, Command & Control, Data Exfiltration	High-severity alert, isolate source system, initiate threat hunt

Leveraging Google SecOps for Enhanced PAM Visibility

Google SecOps, especially its SOAR capabilities, serves as a central nervous system for privileged account monitoring. Its ability to ingest and analyze data from various sources is key for PAM.

Centralized Aggregation and Analysis

Google SecOps integrates with PAM solutions (e.g., CyberArk, via Syslog ingestion) and infrastructure logs (like Google Workspace activity). This allows central data aggregation and analysis, addressing “scattered events” and “incomplete pictures” from traditional logging. Once ingested, Google SecOps maps fields to a unified data model (UDM), enriching data with context and standardizing event types. This normalization is key, allowing cross-platform correlation and a unified view of privileged account activity across the enterprise. This aggregation and normalization overcome disparate log source limits, enabling better anomaly detection and threat hunting that might otherwise be impossible.

Automated Detection and Response Workflows

PAM data integration with Google SecOps’ SOAR enables automated response workflows. This addresses the need for fast action when privileged accounts are compromised. Google SecOps can trigger automated workflows for actions like revoking access, suspending accounts, or granting temporary access for incident response. When a PAM solution, integrated with the SIEM, detects deviations from a baseline, it can alert and then initiate actions like rotating compromised credentials or enforcing MFA to contain attacks.

This automation means that upon detecting a high-severity anomaly (e.g., a brute-force attempt on a super admin account), Google SecOps can automatically initiate containment, reducing manual Security Operations Center (SOC) effort and attacker opportunity. This shifts security from reactive alerting to proactive, automated defense. Automating containment, like suspending a compromised account or forcing a password reset, reduces “SOC effort” and “impact,” letting human analysts focus on investigation rather than initial containment. Google SecOps is a key enabler for a mature, efficient PAM program that detects and responds to threats fast, improving security.

03: Response: Taking Action to Investigate and Remediate Privileged Account Compromise

Even with prevention and detection, organizations must be ready for a privileged account compromise. Response speed and thoroughness dictate incident impact.

Tactical Hardening and Positioning During an Incident

Prepare before an incident: Map every service account to owner and workload, run continuous discovery cycles (using PAM discovery/scanning tools) to find systems and credentials, and onboard all human and non-human privileged identities into PAM; enforce unique credentials, MFA, and API-based rotation; migrate Windows services to gMSA / standalone Managed Service Account (sMSA) and block interactive logon for service accounts; create pre-approved, tested runbooks for bulk rotation, quarantine, and vault/IdP audit escalation; store break-glass credentials offline with dual-control retrieval and immutable logging; secure executive sponsorship for Tier-0 ownership and cross-team responsibilities (platform, app, IAM, PAM).

Immediate isolation: Pull suspected admin workstations off the network; restrict east-west to Tier-0; in cloud, revoke refresh tokens and active sessions, then force re-authorization. For vaults/IdP/DCs showing anomalous activity, sever untrusted network paths, keep console access for responders, and snapshot logs/state before change. Raise audit levels on targets (operating system [OS], IdP, vault, PAM) to capture follow-on actions for forensics.

Credential resets: Coordinated, not piecemeal. Use PAM to bulk-rotate human + service secrets once initial containment stabilizes. Close a common gap by onboarding service accounts comprehensively, blocking interactive logon, mapping each to an owner/workload, and attaching to rotation workflows. For Windows services, migrate to gMSA/sMSA to gain automatic, frequent password changes with no human handling; for non-Windows/app credentials, store in PAM and rotate via API. This yields rapid, low-friction resets without tipping the actor.

Break-glass that actually works: Maintain offline, tightly held emergency access for Tier-0 (e.g., local admin on vault/DC, offline DA credential) with dual-control retrieval, immutable logging, and post-use rotation. Drill these paths routinely.

Incident response (IR) support: Engage internal IR plus an external partner early for memory capture, log triage, and containment strategy while platform teams sustain core services. (IR playbooks should already assume the aforementioned Tier-0 model to avoid re-exposure during response).

Effective Investigation and Remediation

Investigation for privileged account compromise must be holistic, combining forensic analysis with understanding how privileged access is abused. This shows the need for logging and monitoring setup in the detection phase.

Investigation should include analyzing systems that interact with privileged infrastructure, like developer and signing systems, for malware. Initial access vectors, particularly phishing campaigns (e.g., fake job offers) and malicious web pages, must be investigated. Reviewing logs for interactions with privileged infrastructure, especially sending transactions from secret management platforms or API gateways, is also key. Understanding the full attack path—how access was gained, how privileges were increased, how lateral movement used privileged access, and what actions were done—is key for remediation and preventing recurrence.

Eradication hinges on a coordinated enterprise password reset (EPR)—a planned, organization-wide rotation of credentials and secrets to evict an attacker’s ability to reuse stolen material. Initiate EPR when there is evidence or strong suspicion of mass credential exposure (e.g., NTDS.dit dump, DCSync/DCShadow, Kerberoasting, or secrets pulled from code/repos/vaults). Scope EPR to cover domain, local, service, and application/technology accounts; API keys and embedded secrets; cloud sync/bind identities; and third-party integrations. Run it as a cross-functional operation (IR, IAM/PAM, platform, app/dev, cloud ops, SOC, help desk, legal/communications, executives) with staged playbooks, (e.g., dual KRBTGT rotations, trust key resets, service-account updates via PAM/gMSA, and immediate revocation of exposed tokens/keys). Executed well, EPR restores positive control with minimal disruption and removes the attacker’s persistence.

Recovery Planning for Critical Systems

A PAM strategy goes beyond immediate incident response to include recovery planning for systems, ensuring resilience in a catastrophic event.

Virtualization Infrastructure Hardening and Protections

Treat vCenter/ESXi, Hyper-V, cloud consoles as Tier-0 choke points. Use dedicated admin identities and/or privileged directories (separate forest or platform-local), vault them, require MFA, and put all hypervisor/out-of-band management on segmented admin networks reachable only from PAWs/jump hosts. For HPE Integrated Lights-Out (iLO) / Integrated Dell Remote Access Computer (iDRAC) / Intelligence Platform Management Interface (IPMI), place on a dedicated management network, disable internet exposure, replace default certs, avoid IPMI-over-LAN, and restrict operators to a tiny vetted group with session recording and aggressive rotation/certificate-based authentication.

Harden ESXi hosts. Enable Lockdown Mode to force host admin via vCenter, reserve direct console/ Direct Console User Interface (DCUI) for break-glass; minimize SSH, disable when not needed; enforce vCenter RBAC and strong password policies. Centralize telemetry in SIEM for VM create/delete, role/permission changes, snapshot/optical disk image (ISO) mounts high-signal events for ransomware staging. Monitor vpxuser across hosts; keep automatic rotation enabled (30-day default) and, if compromise suspected, change rotation interval in vCenter so it propagates, rather than requiring manual changes on hosts.

Harden PAM servers themselves as Tier-0: Dedicated machines, not domain-joined or isolated to a Tier-0 silo; vendor hardening baselines; minimal services; host firewalls; controlled console access; continuous health/telemetry to SIEM. CyberArk’s Digital Vault Security Standard and hardening guidance provide concrete checklists.

Backup Infrastructure Protections

Backup infrastructure is the ultimate privileged access target for ransomware operators, as its compromise can stop recovery. Protecting identities that manage backups is a PAM concern, ensuring the “keys to the recovery kingdom” are secure. Organizations must find all dependencies and interconnectivity needs for backup infrastructure availability. The backup architecture should be effective and timely, considering isolated recovery environments and immutable backups—following the 3-2-1 rule of 3 copies in 2 locations and 1 offline.

A defined recovery and reconstitution sequencing strategy, based on business importance, guides restoration. Planning for secure, validated restoration using isolated network enclaves is key to prevent reintroducing malware. Strategies include using unique, separate credentials (not with primary identity provider) with MFA for backup infrastructure, securing offline copies of emergency access credentials, and using unique programmatic service accounts with regular rotation. Implementing firewall rules to restrict admin traffic to a dedicated backup admin network, isolating backup servers from production, and using immutable backups or “write once, read many” (WORM) capabilities are also vital. Finally, admin access to backup infrastructure should be restricted via secure access workstations, and detection strategies should find illegitimate modifications to backup retention and purge policies.

Conclusion: A Proactive Stance on Privileged Access Security

Privileged accounts remain the primary target for attackers, serving as the gateway to financial and operational impact within any organization. The threat landscape, with more credential compromise, account takeovers, and insider threats, shows the need for privileged account monitoring and a mature privileged access management (PAM) program.

Effective PAM goes beyond the narrow definition of privileged accounts, covering human and non-human entities across IT environments and their dependencies. It needs a maturity journey, guiding organizations from an Uninitiated posture to Iterative Optimization—an automated, continuously improving defense. Dedicated PAM solutions are foundational, but must sit on firm system hardening and enforced policy baselines—tiering and SoD, PAWs-only administration, conditional access/MFA, application allow-listing, credential hygiene/rotation—followed by protocol controls such as RDP, SMB, and WinRM. Together these measures reduce attack surface and sharply limit the utility of stolen credentials.

In detection, traditional logging limits mean moving to specialized monitoring. Distinguishing privileged account activity from normal IAM abuse needs more detailed context and a focus on compromise impact. Using advanced analytics, especially machine learning anomaly detection, finds subtle, “in-role but abnormal” behaviors that show compromise or misuse. Nuanced alerting, like prioritizing brute-force attempts against super admin accounts, optimizes security operations. High-fidelity session monitoring gives proof for investigation and compliance. Google SecOps, with its central aggregation, unified data model, and automated response, is a platform to make these PAM monitoring strategies work, enabling real-time threat detection and fast containment.

Finally, a PAM strategy demands practiced incident response and recovery planning. Immediate tactical hardening, better logging, and isolation are key during an incident. Thorough investigation and remediation, including secret rotation and system rebuilding, are needed for eviction and future resilience. Planning for critical system recovery, like key vaults, virtualization infrastructure, and backup systems—with isolated, encrypted, and tested backups—is the ultimate safeguard against loss.

By taking a proactive stance on privileged access security, organizations can reduce risk, protect assets, and build a more defensible and resilient digital ecosystem.

Read More for the details.

2025 10 28

AWS – Amazon EC2 Im4gn instances now available in AWS Europe (Milan) Region

Tibor Kiss AWS, Cloud AWS

Starting today, Amazon EC2 Im4gn Instances are available in Europe (Milan) region. Im4gn instances are built on the AWS Nitro System and are powered by AWS Graviton2 processors. They feature up to 30TB of instance storage with the 2nd Generation AWS Nitro SSDs that are custom-designed by AWS for the storage performance of I/O intensive workloads such as SQL/NoSQL databases, search engines, distributed file systems and data analytics. These instances help with transactions processed per second (TPS) for I/O intensive workloads such as relational databases (e.g. MySQL, MariaDB, PostgreSQL), and NoSQL databases (KeyDB, ScyllaDB, Cassandra) which have medium-large size data sets and can benefit from high compute performance and high network throughput. They are also an ideal fit for search engines, and data analytics workloads requiring fast access to data sets on local storage.

The Im4gn instances also feature up to 100 Gbps networking and support for Elastic Fabric Adapter (EFA) for applications requiring high levels of inter-node communication.

Get started with Im4gn instances by visiting the AWS Management Console, AWS Command Line Interface (CLI), or AWS SDKs. To learn more, visit the Im4gn instances page.

Read More for the details.

2025 10 28

AWS – Amazon EC2 I7i instances now available in additional AWS GovCloud (US) Regions

Tibor Kiss AWS, Cloud AWS

Amazon Web Services (AWS) announces the availability of high performance Storage Optimized Amazon EC2 I7i instances in the AWS GovCloud (US-East, US-West) Regions. Powered by 5th generation Intel Xeon Scalable processors with an all-core turbo frequency of 3.2 GHz, these new instances deliver up to 23% better compute performance and more than 10% better price performance over previous generation I4i instances. Powered by 3rd generation AWS Nitro SSDs, I7i instances offer up to 45TB of NVMe storage with up to 50% better real-time storage performance, up to 50% lower storage I/O latency, and up to 60% lower storage I/O latency variability compared to I4i instances.

I7i instances offer the best compute and storage performance for x86-based storage optimized instances in Amazon EC2, ideal for I/O intensive and latency-sensitive workloads that demand very high random IOPS performance with real-time latency to access the small to medium size datasets (multi-TBs). Additionally, torn write prevention feature support up to 16KB block sizes, enabling customers to eliminate database performance bottlenecks.

I7i instances are available in eleven sizes – nine virtual sizes up to 48xlarge and two bare metal sizes – delivering up to 100Gbps of network bandwidth and 60Gbps of Amazon Elastic Block Store (EBS) bandwidth.
To learn more, visit the I7i instances page.

Read More for the details.

2025 10 28

AWS – Amazon Kinesis Data Streams now supports 10x larger record sizes

Tibor Kiss AWS, Cloud AWS

Amazon Kinesis Data Streams now supports record sizes up to 10MiB, a tenfold increase from the previous 1MiB limit. This launch enables customers to publish intermittent larger data payloads in their data streams while continuing to use existing Kinesis Data Streams APIs in their applications. This launch is accompanied by a 2x increase in the maximum PutRecords request size from 5MiB to 10MiB.

Amazon Kinesis Data Streams is a serverless data streaming service that enables customers to capture, process, and store real-time data streams at any scale. With this launch, customers no longer need to maintain separate processing pipelines for handling intermittent large records, and can thus simplify their data pipelines. This reduces operational overhead for IoT analytics, change data capture, and generative AI workloads. You can update your stream’s maximum record size up to 10 MiB using either the AWS Management Console or the UpdateMaxRecordSize API via the AWS SDK or CLI. Once your stream is configured, you can publish and consume larger records using existing Kinesis Data Streams APIs. You do not incur additional costs to use this capability beyond your regular Kinesis data streams charges.

In conjunction with this launch, AWS Lambda now supports larger payloads up to 6MiB from Kinesis Data Streams.

Amazon Kinesis Data Streams supports large records in the AWS Regions documented here. To learn more about using large records and how common downstream applications handle large records, please see our documentation.

Read More for the details.

2025 10 28

GCP – Expanding investment in our Google Public Sector partner ecosystem

Tibor Kiss Cloud, Google Cloud gcp

As AI technology advances at a rapid pace, I am pleased to announce a new set of investments in our partner ecosystem at today’s Partner Connect at the Google Public Sector Summit. Our new initiatives will deepen our collaboration and empower our partners to capitalize on the incredible momentum in AI.

We are focused on what matters most: accelerating the growth of every partner, aligning our teams, and making it easier to bring transformative solutions to public sector customers.

Increasing investment in partner-focused programs

Today we are significantly increasing our investments in vital partner programs to drive continued growth:

Expanded Rapid Innovation Team (RIT) partner funding: We are increasing investment in our successful Rapid Innovation Team Partner Funding Pilot to help more partners work with the RIT to build and deliver repeatable, high-fidelity solutions within and across federal, state, and local governments, and the higher education sector.
Doubled training capacity: We are doubling the capacity for our Partner Development Sprints to help partner teams build critical technical skills and specializations in a 120-day path.
Targeted opportunity advancement: We are increasing our Deal Acceleration Funds (DAF) to directly invest in shortening the sales cycle for strategic, high-value deals.

Enhancing co-selling opportunities

We are improving alignment between our sales and partner teams to accelerate co-selling and create a seamless “one-customer” experience. This includes:

New Google Public Sector Partner Expertise Badges: We are launching three new badges in our Google Public Sector Partner Expertise Badge Program, aligned with critical customer needs: Google Distributed Cloud (GDC), Infrastructure Modernization, and Gemini for Government. These are the skills the market is demanding, and the application process for new partners to gain this critical differentiation opens today.
Streamlined subcontracting: We remain fully committed to a services engagement model that heavily relies on our partners. Our new Google Public Sector Services Subcontractor Program creates a transparent, standardized process for selecting subcontracting partners, creating a transparent and repeatable process for how partners are selected for individual projects.

Accelerating time-to-market for independent software vendors

We are dedicated to increasing speed-to-market for our Independent Software Vendor (ISV) partners with investments that include:

ATO Accelerator Program expansion: Today, we are announcing the expansion of our ISV ATO Accelerator Program by providing up to $250,000 in Google Cloud Platform (GCP) credits and up to $500,000 in services reimbursements per partner, to ease the path to vital FedRAMP and Impact Level accreditations.
Greater Marketplace access: Our ISV ATO Accelerator Program speeds ISV entry into our private marketplaces such as JWCC and C2E, where we are seeing rapid growth.
Increased ISV support: We are also increasing the resources supporting our ISVs to ensure our partners have the ability to build on GCP and co-sell with our field teams.

Simplifying collaboration to accelerate delivery

Finally, we are launching new tools to make it easier to partner with Google Public Sector, including:

Partner Demo Portal enhancements: We are thrilled to announce further enhancements to the Partner Demo Portal, based on feedback we’ve heard from partners. The new portal is designed to give partners greater visibility, deeper insights, and a more seamless collaborative experience.
- Key enhancements include:
  - Enhanced discovery: Partners can find the right demo faster with improved filtering by government type, industry, and application.
  - Actionable analytics: Partners can now track search appearances, views, and saves to see the traction their solutions are gaining with our sales teams.
  - Transparent collaboration: A new submission and review workflow provides clear, timely feedback from our partner engineers.
  - Showcase success: Partners can now add customer references and highlight “golden assets”—demos that include a pitch deck, functional demo, and pricing estimations.
Training investments: We are making investments in new self-service labs and instructor-led bootcamps focused on public sector use cases designed to help partners grow faster.

Your strategic advantage with Google Public Sector

These initiatives are all backed by our commitment to partnership. We deliver what no other provider can: a full-stack AI solution with multi-party and open source AI models, and a unified AI platform – all powered by our secure custom silicon, chips, and global infrastructure.

Our model is, and will continue to be, built on scaling through our partners. We view partners as mission accelerators: critical to helping public sector organizations successfully adopt AI, enhance cybersecurity, and modernize their operations.

We are excited to continue building the future together. We invite you to engage in our programs today:

Read More for the details.

2025 10 28

AWS – Announcing Amazon Nova Multimodal Embeddings

Tibor Kiss AWS, Cloud AWS

We are excited to announce the general availability of Amazon Nova Multimodal Embeddings, a state-of-the-art embedding model for agentic RAG and semantic search. It is the first unified embedding model that supports text, documents, images, video, and audio through a single model, to enable cross-modal retrieval with leading accuracy.

Managing and searching across different content types traditionally required multiple specialized embedding models, leading to complexity, higher costs, and data silos. Amazon Nova Multimodal Embeddings maps diverse content types into a unified space with leading accuracy, helping break down these silos. Developers can build cross-modal applications that search video archives using complex queries, find relevant product images based on customer questions, or search financial documentation that contain both infographics and text explanations, all using a single embedding model.

The model supports inputs of up to 8K tokens in length and video/audio segments up to 30 seconds, with the capability to segment larger files. Multiple output embedding dimensions allow organizations to balance accuracy and performance with storage and computation costs. Organizations can choose between synchronous API for near real-time applications and asynchronous API for efficient processing of larger files, enabling them to optimize for both latency-sensitive and high-volume workloads.

Amazon Nova Multimodal Embeddings is available in US East (N. Virginia) in Amazon Bedrock.

To learn more, read the AWS News blog and user guide. To get started with Nova Multimodal Embeddings in Amazon Bedrock, visit the Amazon Bedrock console.

Read More for the details.

2025 10 28

AWS – Amazon EC2 I7ie instances now available in AWS GovCloud (US) Region

Tibor Kiss AWS, Cloud AWS

AWS is announcing starting today, Amazon EC2 I7ie instances are now available in AWS GovCloud (US-West) region. Designed for large storage I/O intensive workloads, I7ie instances are powered by 5th Gen Intel Xeon Processors with an all-core turbo frequency of 3.2 GHz, offering up to 40% better compute performance and 20% better price performance over existing I3en instances. I7ie instances offer up to 120TB local NVMe storage density (highest in the cloud) for storage optimized instances and offer up to twice as many vCPUs and memory compared to prior generation instances. Powered by 3rd generation AWS Nitro SSDs, I7ie instances deliver up to 65% better real-time storage performance, up to 50% lower storage I/O latency, and 65% lower storage I/O latency variability compared to I3en instances.

I7ie are high density storage optimized instances, ideal for workloads requiring fast local storage with high random read/write performance at very low latency consistency to access large data sets. These instances are available in 9 different virtual sizes and deliver up to 100Gbps of network bandwidth and 60Gbps of bandwidth for Amazon Elastic Block Store (EBS).

To learn more, visit the I7ie instances page.

Read More for the details.

2025 10 28

AWS – Amazon DocumentDB (with MongoDB compatibility) announces upgraded query planner that can run queries up to 10x faster

Tibor Kiss AWS, Cloud AWS

Today, Amazon DocumentDB (with MongoDB compatibility) announces a new query planner, featuring advanced query optimization capabilities and improved performance. PlannerVersion 2.0 for Amazon DocumentDB (with MongoDB compatibility) 5.0 delivers up to 10x performance improvement over the prior version when using find and update operators with indexes. Performance improvements primarily come from using more optimal index plans and enabling index scan support for operators such as negation operators ($neq, $nin) and nested $elementMatch. PlannerVersion 2.0 queries run faster through better cost estimation techniques, optimized algorithms, and enhanced stability.

PlannerVersion 2.0 also simplifies query syntax. For example, you no longer need to provide explicit hints for $regex queries to utilize indexes.

PlannerVersion 2.0 is available in all AWS Regions where Amazon DocumentDB 5.0 is supported. You can enable it by simply modifying the corresponding parameter in your cluster parameter group. The change does not require a cluster restart or cause any downtime. If needed, you can easily revert to using the legacy query planner. To learn more about the new query planner, see Getting Started with New Query Planner.

Read More for the details.

2025 10 28

AWS – Amazon EC2 R8i and R8i-flex instances are now available in Europe (London)

Tibor Kiss AWS, Cloud AWS

Starting today, Amazon Elastic Compute Cloud (Amazon EC2) R8i and R8i-flex instances are available in the Europe (London) region. These instances are powered by custom Intel Xeon 6 processors, available only on AWS, delivering the highest performance and fastest memory bandwidth among comparable Intel processors in the cloud. The R8i and R8i-flex instances offer up to 15% better price-performance, and 2.5x more memory bandwidth compared to previous generation Intel-based instances. They deliver 20% better performance than R7i instances, with even higher gains for specific workloads. They are up to 30% faster for PostgreSQL databases, up to 60% faster for NGINX web applications, and up to 40% faster for AI deep learning recommendation models compared to R7i.

R8i-flex, our first memory-optimized Flex instances, are the easiest way to get price performance benefits for a majority of memory-intensive workloads. They offer the most common sizes, from large to 16xlarge, and are a great first choice for applications that don’t fully utilize all compute resources.

R8i instances are a great choice for all memory-intensive workloads, especially for workloads that need the largest instance sizes or continuous high CPU usage. R8i instances offer 13 sizes including 2 bare metal and the new 96xlarge size for the largest applications. R8i instances are SAP-certified and deliver 142,100 aSAPS, the highest among all comparable machines in on-premises and cloud environments, delivering exceptional performance for mission-critical SAP workloads.

To get started, sign in to the AWS Management Console. Customers can purchase these instances via Savings Plans, On-Demand instances, and Spot instances. For more information about the new R8i and R8i-flex instances visit the AWS News blog.

Read More for the details.

2025 10 28

AWS – AWS Resource Explorer supports 47 additional resource types

Tibor Kiss AWS, Cloud AWS

AWS Resource Explorer now supports 47 more resource types services including Amazon Bedrock, AWS Shield, and AWS Glue.

With this release, customers can now search for the following resource types in AWS Resource Explorer:

1. amplify:apps	26. profile:domains/object-types
2. aoss:collection	27. resiliencehub:app
3. app-integrations:application	28. route53-recovery-control:controlpanel/routingcontrol
4. appconfig:application/environment	29. route53-recovery-readiness:cell
5. appconfig:extensionassociation	30. s3:storage-lens-group
6. bedrock:agent-alias	31. s3express:bucket
7. cloudtrail:dashboard	32. sagemaker:monitoring-schedule
8. comprehend:flywheel	33. shield:protection
9. devicefarm:instanceprofile	34. shield:protection-group
10. directconnect:dx-gateway	35. ssm-incidents:response-plan
11. elasticloadbalancing:listener/gwy	36. verifiedpermissions:policy-store
12. elasticloadbalancing:loadbalancer/gwy	37. vpc-lattice:service
13. fsx:backup	38. vpc-lattice:service/listener
14. glue:dataQualityRuleset	39. vpc-lattice:servicenetwork
15. glue:registry	40. vpc-lattice:servicenetworkserviceassociation
16. iottwinmaker:workspace/sync-job	41. vpc-lattice:targetgroup
17. ivs:encoder-configuration	42. wafv2:ipset
18. ivs:ingest-configuration	43. wafv2:regexpatternset
19. ivs:playback-restriction-policy	44. wafv2:rulegroup
20. ivs:storage-configuration	45. wafv2:webacl
21. lex:bot	46. wisdom:content
22. mediatailor:vodSource	47. workspaces-web:portal
23. network-firewall:stateful-rulegroup
24. network-firewall:stateless-rulegroup
25. profile:domains/integrations

To view a complete list of all supported types, see the supported resource types page.

Read More for the details.

2025 10 27

AWS – AWS Payment Cryptography is now available in Canada(Montreal), Africa (Cape Town) and Europe (London)

Tibor Kiss AWS, Cloud AWS

AWS Payment Cryptography has expanded its global presence with availability in three new regions – Canada(Montreal), Africa (Cape Town) and Europe (London). This expansion enables customers with latency-sensitive payment applications to build, deploy or migrate into additional AWS Regions without depending on cross-region support. For customers processing payment workloads in Europe, availability in London offers additional options for multi-Region high availability.

AWS Payment Cryptography is a fully managed service that simplifies payment-specific cryptographic operations and key management for cloud-hosted payment applications. The service scales elastically with your business needs and is assessed as compliant with PCI PIN and PCI P2PE requirements, eliminating the need to maintain dedicated payment HSM instances. Organizations performing payment functions – including acquirers, payment facilitators, networks, switches, processors, and banks can now position their payment cryptographic operations closer to their applications while reducing dependencies on auxiliary data centers with dedicated payment HSMs.

AWS Payment Cryptography is available in the following AWS Regions: Canada(Montreal), US East (Ohio, N. Virginia), US West (Oregon), Europe (Ireland, Frankfurt, London), Africa(Cape Town) and Asia Pacific (Singapore, Tokyo, Osaka, Mumbai).

To start using the service, please download the latest AWS CLI/SDK and see the AWS Payment Cryptography user guide for more information.

Read More for the details.

2025 10 27

GCP – Introducing an agentic commerce solution for merchants from PayPal and Google Cloud

Tibor Kiss Cloud, Google Cloud gcp

Modern consumers demand a seamless, personalized shopping journey, from initial product discovery all the way to final purchase. With the rise of agentic AI, merchants now have an opportunity to deliver a truly assistive and cohesive experience across every touchpoint.

That’s why today, building on our goal of transforming commerce, PayPal and Google Cloud are thrilled to announce that we’re bringing agentic shopping experiences to life with a new offering that combines Google Cloud’s Conversational Commerce agent with payments powered by PayPal.

This combination will allow merchants to rapidly deploy agentic commerce experiences directly on their own digital surfaces to drive more consumer engagement, personalization, and conversion. Merchants are able to maintain full control over the agent’s tone, look, and the customer relationship.

How it works

The PayPal Agent will communicate securely with the merchant’s agent over the open Agent2Agent (A2A) Protocol, as well as being integrated with the Agent Payments Protocol (AP2) — a payments layer built on top of A2A and the Model Context Protocol (MCP) that provides trust, accountability, and fraud controls.

A2A Protocol is an open standard designed to enable AI agents to communicate, collaborate and delegate tasks to one another across organizations. AP2 provides a set of requirements, including Verifiable Digital Credentials, which secure agentic transactions.

Smooth, simple shopping journeys: The power of agent collaboration

With this new offering, merchants will have the option to adopt Google Cloud’s Conversational Commerce Agent or build their own agents using Google’s Agent Development Kit (ADK). Fully brand-compliant and acting as an intelligent sales associate for the merchant, the Conversational Commerce Agent is designed to engage shoppers in natural, human-like conversations, guiding them all the way from initial intent and product discovery to a completed purchase.

Once deployed, the merchant’s commerce agent can understand complex requests, suggest relevant products, answer questions, and personally assist the user through their shopping journeys. During product discovery and selection, the merchant’s commerce agent engages the PayPal Agent through A2A to provide context on the user’s shopping history, based on permissioned data, to help improve product recommendations.

Once a customer is ready to check out, the PayPal Agent, in line with AP2, will provide a seamless and secure checkout experience within the conversational interface. The PayPal Agent can also surface payment method recommendations and check “buy now, pay later” eligibility. With the shopper’s consent, merchant agents will then connect to the PayPal Agent in an authenticated manner, and authorize the transaction on a trusted surface.

Consumer trust at the core

Agentic commerce holds massive opportunity, but also exposes potential challenges around control, risk, and fraud, which Google Cloud and PayPal are proactively addressing.

AP2 is an open protocol that’s payment-method agnostic, thanks to its development by Google in collaboration with more than 100 industry partners. AP2 provides a common, secure language for AI agents to transact on behalf of users, extending the core constructs of the A2A Protocol and MCP to establish the essential foundation for secure, accountable, and authorized commerce.

AP2 uses mandates — tamper-proof, cryptographically-signed digital contracts that provide verifiable proof of user intent. These mandates are signed by Verifiable Digital Credentials (VDCs), creating a non-repudiable audit trail.

For example:

Cart Mandate: The foundational credential used when the user is present to authorize a purchase. Cart Mandates are generated by the merchant and cryptographically signed by the user (typically via their device), binding authorization to a specific transaction.
Payment Mandate: A separate VDC shared with the payment network and issuer to provide visibility into the agentic nature of the transaction, helping the network and issuer build trust and assess risk. This credential contains signals for AI agent presence and the transaction modality (e.g. Human Present vs. Not Present).

Essentially, AP2 provides the critical foundation for trusted, agent-led payments, providing verifiable intent and establishing clear transaction accountability. Instead of inferring action, trust is anchored to deterministic, non-repudiable proof of intent from the user, which directly addresses the risk of agent error. Payment mandates act as the foundational evidence for every transaction, creating a secure, unchangeable audit trail that helps payment networks to establish clear and fair principles for accountability and dispute resolution.

For example, with PayPal’s AP2-compliant agent, merchants will be able to have the assurance that a user was present to authorize the payment. Instead of using APIs, it will connect agents using AP2, helping ensure users, merchants, and payment providers can confidently initiate and transact with agent-driven payments.

With today’s announcement, Google Cloud and PayPal are proud to work together to provide a largely out-of-the-box solution for merchants who want to deploy agentic commerce experiences without building the complex framework from scratch, all while owning the experience and relationship with the consumer. Building the solution using A2A and AP2 protocols ensures safety and security throughout the process.

To learn more, contact your Google Cloud sales representative or reach us here.

_{Disclaimer: The video shown in this post is for informational purposes only and contains forward-looking statements, projections, and assumptions. These are not guarantees of future performance, and actual results and experiences may vary.}

Read More for the details.