Cloud

2025 08 01

AWS – Amazon EC2 now supports force terminate for EC2 instances

Tibor Kiss AWS, Cloud AWS

Starting today, Amazon EC2 customers can now force terminate instances that are stuck in the shutting-down state. EC2 Instances can get stuck in the shutting down state because of rare issues caused by frozen operating system or underlying hardware problems. When customers use force terminate, the instance will first attempt a graceful shutdown process. If unsuccessful within the timeout period, the instance proceeds with a forced shutdown. A forced shutdown may not flush the file system caches and metadata or run shutdown scripts before instance termination. Force terminate allows customers to recover resources associated with stuck instances such as vCPU Quotas or Elastic IP addresses without waiting for AWS intervention, providing greater flexibility in managing EC2 instances.

Read More for the details.

Amazon Kinesis Video Streams expands coverage to three new AWS Regions

2025 08 01

AWS – Amazon Kinesis Video Streams expands coverage to three new AWS Regions

Tibor Kiss AWS, Cloud AWS

Amazon Kinesis Video Streams (Amazon KVS) is now available in Europe (Spain), Asia Pacific (Malaysia), Middle East (Bahrain) regions. Amazon KVS is a fully managed AWS service that enables you to securely stream, process, and store video and time-encoded data from connected devices. With the region expansion update, organizations operating in these regions can benefit from faster response times, stronger data residency controls, and reduced data transfer expenses.

Amazon KVS automatically provisions and elastically scales the infrastructure needed to ingest streaming video from millions of devices. Using Amazon KVS, you can store, encrypt, and index video data and access their data streams through easy-to-use APIs. Amazon KVS also enables you to quickly build applications for live and on-demand viewing, and take advantage of computer vision and video analytics through integration with other AWS services, including Amazon Rekognition Video and Amazon SageMaker. Moreover, Amazon KVS WebRTC offers fully-managed capabilities to support interactive and real-time peer-to-peer media streaming between web browsers, mobile applications, and connected devices.

To learn more, please visit the Amazon Kinesis Video Streams product page and the AWS Region services list for complete regional availability information.

Read More for the details.

2025 08 01

GCP – Introducing audit-only mode for Access Transparency

Tibor Kiss Cloud, Google Cloud gcp

As part of our commitment to cloud workload security and transparency, today, we’re introducing a new, lightweight audit-only mode for Access Approval to enable access approvals in an “on demand only” model. This new capability is available at no extra charge in the Security section of the Google Cloud Console.

Previously, Access Approval delivered robust security by ensuring all Google Cloud accesses were reviewed. While incredibly effective as a mitigation control, this comprehensive approach meant administrators frequently reviewed access to both sensitive and non-sensitive data, which could add administrative overhead. It also wasn’t specifically designed to easily enable audit log-powered reactive control strategies — a need we’ve heard from many customers. Our new audit-only mode builds on that strong foundation, offering the flexibility to tailor Access Approval to your specific product needs and security workflows.

The new Access Approval combines the benefits of Access Approval (access notifications, revocable Access Approval events, Cloud Console or API based user experience) with new functionality to run in audit mode and to limit approvals to specific products.

Additionally, workload administrators can easily switch Access Approval policies at any time to temporarily shift policy. For example, you can prevent any Google Cloud access without approval during a critical launch week.

aside_block: <ListValue: [StructValue([(‘title’, ‘$300 in free credit to try Google Cloud security products’), (‘body’, <wagtail.rich_text.RichText object at 0x3ebe34751610>), (‘btn_text’, ‘Start building for free’), (‘href’, ‘http://console.cloud.google.com/freetrial?redirectPath=/welcome’), (‘image’, None)])]>

Here’s how you can use it.

Detect a finding via analysis of an Access Transparency log (such as a write action).
Navigate to Access Approval.
Locate the event from the “approvalID” provided in the Access Transparency log.
Add Access Approvals by revoking access to the data associated with the access event.
Google will now require customer approval to access the resource in that access event going forward.

Our customers have said that adding an additional source of audit log data linked to mitigation workflows can be invaluable. For example, for organizations with strict change-management processes, enabling Access Approval in full is a suitable control for these workloads. For other organizations, Google Cloud’s Access Approval audit mode with access mitigation is part of a comprehensive disaster mitigation plan that is available on demand without interrupting general administrative workflows.

With the new audit-only mode policy in Access Approval, workload administrators can now add Access Approval to on-demand security mitigation plans — all without incurring additional operating burden on access events. With Access Approval, you hold the control options to limit Google Cloud’s administrative and support access to your data on-demand, when you choose to apply it.

To get started today with Access Approval’s “Transparency” audit mode, read our setup guide.

Read More for the details.

Introducing Google Cloud Setup: Your guided pathway to a secure cloud foundation

2025 08 01

GCP – Introducing Google Cloud Setup: Your guided pathway to a secure cloud foundation

Tibor Kiss Cloud, Google Cloud gcp

Are you ready to unlock the power of Google Cloud and want guidance on how to set up your environment effectively? Whether you’re a cloud novice or part of an experienced team looking to migrate critical workloads, getting your foundational infrastructure right is the key to success. That’s where Google Cloud Setup comes in — your guided pathway to a secure cloud foundation and quick start on Google Cloud.

Google Cloud Setup helps you quickly implement Google Cloud’s recommended best practices. Our goal is to provide a fast and easy path to deploying your workloads without unnecessary configuration effort. Think of it as your expert guide, walking you through the essential first steps so you can focus on what truly matters: rapidly deploying your innovative applications and services. To help you get started without financial barriers, all components and service integrations enabled during the setup process are free or include some level of no-cost access.

aside_block: <ListValue: [StructValue([(‘title’, ‘Try Google Cloud for free’), (‘body’, <wagtail.rich_text.RichText object at 0x3ebe346ff3d0>), (‘btn_text’, ‘Get started for free’), (‘href’, ‘https://console.cloud.google.com/freetrial?redirectPath=/welcome’), (‘image’, None)])]>

Choose the foundation that fits your needs

We understand that every organization and project has unique requirements. That’s why Cloud Setup offers three distinct guided flows to choose from:

Proof-of-concept: Designed for users who want to set up a lightweight environment to explore Google Cloud and run initial tests or sandbox workloads. This flow focuses on the minimum configuration to get you started quickly.
Production: This flow is recommended for supporting production-ready workloads with security and scalability in mind. It aligns with Google Cloud’s best practices and is tailored for administrators setting up basic foundational infrastructure for production workloads.
Enhanced security (Preview): Designed for organizations, regions or workloads with advanced security and compliance requirements, this flow defaults to more advanced security controls and is designed to help you meet rigorous requirements. Even this advanced foundation sets you up with a perpetual free tier up to certain usage limits.

Building blocks for a solid foundation

Cloud Setup guides you through a series of onboarding steps, presenting defaults backed by Google Cloud best practices. Throughout the process, you’ll also encounter key features designed to help protect your organization and prepare it for growth, including:

Cloud KMS AutoKey: Automates the provisioning and assignment of customer-managed encryption keys (CMEK).
Security Command Center: Provides security posture management for Google Cloud deployments including automatic project scanning for security issues such as open ports and misconfigured access controls.
Centralized Logging and Monitoring: Enables you to easily set up infrastructure to monitor your system’s health and performance from a central location — critical for audit logging compliance and visualizing metrics across projects.
Shared VPC Networks: Allows you to establish a centralized network across multiple projects, enabling secure and efficient communication between your Google Cloud resources.
Hybrid Connectivity: Facilitates connecting your Google Cloud environment to your on-premises infrastructure or other cloud providers. This is often a critical step for workload migrations.
Support plan: Enables you to quickly resolve any issues with help from experts at Google Cloud.

At the end of the guided flow, you can deploy your configuration directly via the Google Cloud console or download a Terraform configuration file for later deployment using other Infrastructure as Code (IaC) methods.

Experience the cloud faster and smarter

Organizations using Cloud Setup experience enjoy:

Faster application deployment: By simplifying the initial setup, you can get your applications up and running more quickly, accelerating your cloud journey.
Reduced setup effort: Our streamlined flow significantly reduces the number of manual steps, allowing you to establish a basic foundation with less effort.
Greater access to Google Cloud’s full potential: By establishing a solid foundation quickly, you can more easily explore and leverage a wider range of Google Cloud services to meet your evolving needs and unlock greater value.

Ready to start your Google Cloud journey? Visit Google Cloud Setup today for a streamlined path to a secure cloud foundation. Let us guide you through the initial steps so you can focus on innovation and growth.

To learn more, visit:

Cloud Setup documentation
Cloud Setup overview (requires login)

Read More for the details.

Remember this: Agent state and memory with ADK

2025 08 01

GCP – Remember this: Agent state and memory with ADK

Tibor Kiss Cloud, Google Cloud gcp

Imagine that you’re a developer, and your friend is learning to code for the first time. They’re struggling with some concepts, like Python dictionaries.

But you have an idea! What if you could design an AI agent that could help your friend learn complex topics in a conversational way? What if that agent could even be personalized to your friend’s learning style, remember your friend’s past performance, and adjust the learning plan in real time? With agent state and memory, all of this is possible. In this post, we’ll explore how, with Agent Development Kit (ADK).

AgentBanner

The Python Tutor agent

We’ll start by designing a simple, conversational agent with ADK. This agent uses Gemini 2.5 Flash as its model, leveraging its reasoning capabilities. The agent also relies on a set of function tools that allow the agent to progress through a simple quiz on Python dictionaries.

AgentArchSimple

For example, here’s the start_quiz tool, which kicks off a new quiz workflow. (More on state in a minute!)

code_block: <ListValue: [StructValue([(‘code’, ‘def start_quiz(tool_context: ToolContext) -> Dict[str, Any]:rn state = tool_context.statern # Initialize quiz statern state[“quiz_started”] = Truern state[“current_question_index”] = 0rn state[“correct_answers”] = 0rn state[“total_answered”] = 0rn state[“score_percentage”] = 0rn if quiz:rn return {rn “status”: “started”,rn “first_question”: quiz[0][0],rn “question_number”: 1,rn “total_questions”: len(quiz),rn }rn return {“status”: “error”, “error_message”: “No questions available”}’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54cb7bd2b0>)])]>

Overall, this is the quiz workflow we want our agent to take, using those tools together:

QuizFlowSimple

From there, we prompt our agent with system instructions to help the model reason its way through that workflow:

code_block: <ListValue: [StructValue([(‘code’, ‘QUIZ_INSTRUCTIONS = “””rnQUIZ MANAGEMENT PROCESS:rn1. **User identification**: Ask for their name if not providedrn2. **Memory check**: Search for their previous learning history using search_memory()rn3. **Personalized start**: Reference their past progress if found, or welcome new learnersrn4. **Quiz flow**:rn – When user wants to start: Use start_quiz()rn – Present questions clearly with proper formattingrn – When user answers: Use submit_answer(answer=”[user’s answer]”)rn – Provide immediate feedback:rn * If correct: Congratulate and continuern * If incorrect: Explain the concept thoroughly and continue. DO NOT GIVE THE USER A SECOND CHANCE TO ANSWER, just move on to the next question!rn * If quiz complete: Show final score and offer concept reviewrn5. **Progress tracking**: Use get_quiz_status() to monitor progressrn6. **Reset option**: Use reset_quiz() if they want to start overrn”””‘), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54cb7bdac0>)])]>

From here, we can define our ADK agent, and get cooking:

code_block: <ListValue: [StructValue([(‘code’, ‘root_agent = LlmAgent(rn model=”gemini-2.5-flash”,rn name=”python_tutor”,rn instruction=QUIZ_INSTRUCTIONS,rn tools=quiz_toolsrn)’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54cb7bdd90>)])]>

basic_functionality

But how can we ask this agent to remember things that happen during a student’s session, like their name, or their progress through the quiz? This is where short-term memory comes into play.

Short-term memory

In the context of an AI agent, short-term memory is what the agent can remember within one session.

But what is a session? Think of a session like a phone call with a customer support representative—but you get a different rep every time you call. That representative can only remember what you’ve told them during that conversation. After you hang up the phone, all context is lost.

PhoneCallAnalogy

Short-term memory might sound like a bad thing, but it plays an important role in an AI agent. For instance, our Python Tutor agent might need to keep track of the user’s quiz progress, like the number of questions completed so far. The agent probably does not need to store that progress long-term—it might just need to store their final score.

Every user interaction with an ADK agent gets a session, and that session is managed by the ADK SessionService. Each session contains important fields, like the session ID, user ID, event history (the conversation thread), and the state.

AnatomyOfASession

What is session state? Think of it like the agent’s scratchpad during that “phone call” with the user. Each session’s state contains a list of key-value pairs, whose values are updated by the agent throughout the session.

StateChalkboard

ADK can write to session state a few different ways. One way is within a tool. We can use ADK’s ToolContext to get the current session state, and create or update fields:

code_block: <ListValue: [StructValue([(‘code’, ‘from google.adk.tools.tool_context import ToolContextrndef submit_answer(tool_context: ToolContext, answer: str) -> Dict[str, Any]:rn state = tool_context.statern i = state.get(“current_question_index”, 0)rn correct_answer = quiz[i][1]rn is_correct = answer.strip().lower() == correct_answer.strip().lower()rn state[“total_answered”] = state.get(“total_answered”, 0) + 1rn if is_correct:rn state[“correct_answers”] = state.get(“correct_answers”, 0) + 1’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54cb7bd220>)])]>

From there, we can instruct our Agent’s model to read from the current state fields using ADK’s key templating feature. All you have to do is wrap a state field in curly braces {}, and ADK will inject the state key’s value into your prompt, on each model call.

code_block: <ListValue: [StructValue([(‘code’, ‘prompt=”””rnINSTRUCTIONS:rn- Guide users through Python dictionaries with key concepts and examples.rn- Format code examples using markdown code blocks.rn- Use lots of friendly emojis in your responses, including for formatting.rn- Be encouraging and provide detailed explanations for incorrect answers.rnCURRENT SESSION STATE:rn- Current question index: {current_question_index}rn- Questions answered correctly: {correct_answers}rn- Total questions answered: {total_answered}rn- Current score: {score_percentage} %rn- Quiz started: {quiz_started}rn”””‘), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54c41e4ee0>)])]>

Here’s our Python Tutor agent updating those state fields in real-time:

state_tracking

By default, state fields only persist within the current session; once you start a new session, even as the same user, the values are gone. But ADK does have magic state key prefixes like user: and app: , which allow you to persist state key values either across all user sessions, or across all sessions with all users. These magic prefixes are useful if you have simple text settings you want to persist across sessions, like dark-mode=true.

So those are the basics of ADK’s short-term memory. But how does ADK store session and state data?

ADKDiagram

By default, the ADK web UI’s SessionService writes session data in memory. This means that if ADK’s runner crashes, or is shut down, all session data is lost. And if you’re running a scaled, production-grade agent with ADK, with multiple instances of your ADK agent, you can’t guarantee that user requests will always hit the same instance. This means that if request 1 goes to instance A, and request 2 goes to instance B, instance B won’t have the in-memory session state stored inside instance A.

So for production-grade agents, you should store session data outside of the agent’s runtime. ADK provides two ways of doing this. The first is the DatabaseSessionService: store session data in a SQL database, like SQLLite, MySQL, or PostgreSQL. This is easy to set up – all you need is a database. Then, you can pass your database’s URI into the ADK runner:

code_block: <ListValue: [StructValue([(‘code’, ‘uv run adk web –session_service_uri=”postgresql://$USERNAME:$PASSWORD@127.0.0.1:5432/pythontutor”‘), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54c41c11f0>)])]>

From there, you can access your SQL database and see the session and state tables.

The other option is a VertexAISessionService, where you store session data in Agent Engine. This is a good option if you’re already using Agent Engine as your ADK runtime.

Long-term memory

We’ve covered how ADK stores data within a session. But what if you have data you want to persist across sessions with the same user? This is where ADK’s long-term memory comes in.

An ADK agent with long-term memory is like talking to the same customer service representative every time. And that representative has access to all key information from all your past conversations.

LTMemoryBanner

For instance, our Python Tutor agent might want to store quiz scores over time: how has the student improved? Long-term memories could also help the agent develop a personalized learning plan for the student: what topics has the student consistently struggled with?

ADK offers two ways to store these long-term memories. The default way is in memory, by using an InMemoryMemoryService. Here, all sessions are stored raw (with the full conversation thread), and can be retrieved by the agent in further sessions using a basic keyword search. This method is good for local development, but it has the same pitfall as the InMemorySessionService: if ADK restarts or crashes, all memories are lost forever. Another downside to this method is if you have a lot of past user sessions, and you’re retrieving the session’s event history in its raw form, you could overwhelm your agent’s model with too much context.

Luckily, ADK provides a way to store long-term memories persistently outside the ADK runtime, and that’s with a VertexAIMemoryBankService. This memory service uses Vertex AI Memory Bank (Preview) to intelligently store and retrieve memories from past user interactions.

Memory Bank uses the Gemini model to extract key information from session data, to store just the key memories for future use:

HowMemoryBankWorks

To store memories in Memory Bank, we implement a Callback in our Python Tutor agent to add the current session data to the bank:

code_block: <ListValue: [StructValue([(‘code’, ‘async def auto_save_to_memory_callback(callback_context):rn session = callback_context._invocation_context.sessionrn memory_service = VertexAiMemoryBankService(rn project=os.getenv(“GOOGLE_CLOUD_PROJECT”),rn location=os.getenv(“GOOGLE_CLOUD_LOCATION”, “us-central1”),rn agent_engine_id=agent_engine_id,rn )rn await memory_service.add_session_to_memory(session)rn…’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54c41c1460>)])]>

Then, we instruct our agent to retrieve those memories using a vector-based similarity search, by keyword.

code_block: <ListValue: [StructValue([(‘code’, ‘async def search_memory(query: str) -> list:rn memory_bank_service = VertexAiMemoryBankService(rn project=os.getenv(“GOOGLE_CLOUD_PROJECT”),rn location=os.getenv(“GOOGLE_CLOUD_LOCATION”),rn agent_engine_id=os.getenv(“AGENT_ENGINE_ID”),rn )rn search_results = await memory_bank_service.search_memory(rn app_name=”python-tutor-long-term”,rn user_id=”user”,rn query=”score”,rn )rn…’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54c41c14f0>)])]>

Now, we have an updated quiz flow that can remember the user’s past quiz results:

QuizFlowLongTerm

Here’s the updated Python Tutor agent in action, using Memory Bank for long-term memory retrieval:

ltmemory-inaction

What’s happening under the hood, here? When our agent triggers memory generation in that after-agent Callback, Memory Bank uses Gemini to extract key information. So when the user starts a new quiz, triggering search_memory for past results, here’s what Memory Bank hands back:

MemoryBankMemories

Note that while Memory Bank is part of Vertex AI Agent Engine, you don’t need to run your ADK agent on Agent Engine’s Runtime to use this feature. For instance, you can run your agent on Cloud Run, and just integrate Memory Bank as your long-term memory store.

Get building!

To recap, short- and long-term memory play a key role in AI agents, allowing the agent to remember key information within and across user sessions, resulting in a more contextually-aware and personalized user experience. ADK supports a variety of session and memory storage options, from SQL databases to Vertex AI Agent Engine.

Check out the source code on GitHub to explore and deploy the Python Tutor agent yourself.

And to learn more, check out these resources.

Thanks for reading!

Image credits: Emoji Kitchen

Read More for the details.

2025 08 01

AWS – Amazon CloudWatch launches natural language query generation for OpenSearch PPL and SQL

Tibor Kiss AWS, Cloud AWS

Amazon CloudWatch launches natural language query generation powered by generative AI for OpenSearch PPL and SQL query languages in CloudWatch Logs Insights, accelerating logs analysis.

CloudWatch Logs Insights enables you to interactively search and analyze your logs with Logs Insights query language, OpenSearch Service Piped Processing Language (PPL), and OpenSearch Service Structured Query Language (SQL).

Customers using OpenSearch PPL and OpenSearch SQL can now use plain English to quickly generate queries in the context of their logs without needing extensive knowledge of the query language, reducing time to gather insights. For example, you can ask in plain English “Give me the number of errors and exceptions per hour” or “What are the top 100 source IP addresses by bytes transferred” and the queries will be automatically generated in OpenSearch PPL or SQL, depending on the language selected.

The query generator is available in US East (N. Virginia), US East (Ohio), US West (Oregon), Asia Pacific (Hong Kong), Asia Pacific (Singapore), Asia Pacific (Sydney), Asia Pacific (Tokyo), Europe (Frankfurt), Europe (Ireland), and Europe (Stockholm). To learn more, view documentation.

Read More for the details.

2025 08 01

AWS – Database Insights provides on-demand analysis for RDS for Oracle

Tibor Kiss AWS, Cloud AWS

Amazon CloudWatch Database Insights expands the availability of its on-demand analysis experience to the RDS for Oracle database engine. This feature leverages machine learning models to help identify performance bottlenecks during the selected time period, and gives advice on what to do next.

This launch allows you to analyze database performance monitoring data for a time period of your choice. You can learn how the selected time period differs from normal, what went wrong, and get advice on corrective actions. Through simple-to-understand graphs and explanations, you can identify the chief contributors to performance issues. You will also get guidance on the next steps to act on these issues. This can reduce the mean-time-to-diagnosis for database performance issues from hours to minutes.

You can get started with this feature by enabling the Advanced mode of Database Insights on your RDS for Oracle databases using the RDS service console, AWS APIs, the AWS SDK, or AWS CloudFormation. Please refer to RDS documentation and Aurora documentation for information regarding the availability of this feature across different regions, engines and instance classes.

CloudWatch Database Insights delivers database health monitoring aggregated at the fleet level, as well as instance-level dashboards for detailed database and SQL query analysis. It is available in all AWS regions and offers vCPU-based pricing – see the pricing page for details. For further information, visit the Database Insights User Guide.

Read More for the details.

2025 08 01

AWS – Amazon Neptune Global Database is now in five new regions

Tibor Kiss AWS, Cloud AWS

Amazon Neptune Global Database is now available in Europe (Frankfurt), Asia Pacific (Singapore), Asia Pacific (Osaka), Asia Pacific (Jakarta) and Israel (Tel Aviv).

Amazon Neptune Database is a fast, reliable, fully managed graph database service that makes it easy to build and run applications that work with highly connected datasets. Neptune Global Database uses fast, storage-based replication across regions with latencies typically less than one second, using dedicated infrastructure with no impact to your workload’s performance. In the unlikely event of a regional degradation or outage, one of the secondary regions can be promoted to full read/write capabilities. You can have up-to five secondary regions with global database, and each secondary region can have up-to 16 replica instances.

You can create a Neptune Global Database with just a few clicks on the Amazon Neptune Management Console or download the latest AWS SDK or CLI. For Neptune Global Database, standard pricing will apply for the Neptune resources you use in all AWS Regions. Additionally, you will be charged for ‘replicated write I/Os’, which will capture writes, inserts, and deletes between the primary and secondary Neptune clusters. For more information on Neptune Global Database, visit our documentation. For pricing please see Amazon Neptune pricing page.

Read More for the details.

2025 08 01

AWS – Amazon RDS for Oracle now supports R6in and M6in instances

Tibor Kiss AWS, Cloud AWS

Amazon RDS for Oracle now supports R6in and M6in instances that deliver up to 170 Gbps of network bandwidth. Enhanced network bandwidth makes M6in and R6in DB instances ideal for write-intensive workloads.

R6in and M6in instances are available for Amazon RDS for Oracle in Bring Your Own License model for both Oracle Database Enterprise Edition (EE) and Oracle Database Standard Edition 2 (SE2) editions. You can launch the new instance in the Amazon RDS Management Console or using the AWS CLI. Refer Amazon RDS for Oracle Pricing for available instance configurations and pricing details.

Read More for the details.

2025 07 31

AWS – Amazon announces Extended Support for ElastiCache version 4 and version 5 for Redis OSS

Tibor Kiss AWS, Cloud AWS

Amazon ElastiCache now offers Extended Support for ElastiCache versions 4 and 5 for Redis OSS, allowing customers to maintain critical workloads on these versions for up to three years beyond the standard support end date. This new offering addresses the needs of customers who require additional time to plan and execute version upgrades due to application dependencies or large-scale deployment complexities.

Standard support for ElastiCache versions 4 and 5 for Redis OSS will end on January 31, 2026. After this date, clusters not upgraded to a supported version will be automatically enrolled in Extended Support. During the Extended Support period, Amazon ElastiCache will continue to provide critical security updates for Common Vulnerabilities and Exposures (CVEs) and critical defect fixes for these Redis OSS versions. Customers can upgrade using service update APIs or Modify APIs for their cache clusters and replication groups. We recommend upgrading to the latest ElastiCache for Valkey version, where customers can benefit from 20% lower price, improved performance. To get started and learn more about the benefits, see our blog.

Amazon ElastiCache Extended Support is available in all AWS Regions, including the AWS GovCloud (US) Regions and China Regions. The start date for Extended Support charges will not be earlier than February 1, 2026. Learn more about Extended Support, including supported engine versions, in the Amazon ElastiCache user guide. Learn more about pricing details and timelines for Amazon ElastiCache Extended Support at Amazon ElastiCache Pricing.

Read More for the details.

2025 07 31

AWS – Amazon Q Developer expands multi-language support

Tibor Kiss AWS, Cloud AWS

Today, Amazon Q Developer announced expanded multi-language support in AWS Management Console, AWS Console Mobile application and Q Developer in Microsoft Teams and Slack chat applications. Among the many supported languages are French, German, Italian, Japanese, Korean, Mandarin, Spanish, and Portuguese.

To get started, simply start a conversation with Q Developer using your preferred language. Q Developer will automatically detect the language and provide answers in the appropriate language, enabling global teams to learn, monitor, operate, and troubleshoot AWS resources faster and in a more accessible way.

This update is available in all AWS Regions where Amazon Q Developer is available. To get started visit Amazon Q Developer.

Read More for the details.

2025 07 31

AWS – Amazon DocumentDB Serverless is Generally Available

Tibor Kiss AWS, Cloud AWS

Today, AWS announces the general availability of Amazon DocumentDB Serverless, an on-demand, auto-scaling configuration for Amazon DocumentDB (with MongoDB compatibility). Amazon DocumentDB is a serverless, fully managed, MongoDB API–compatible document database service. Amazon DocumentDB Serverless automatically scales capacity up or down in fine-grained increments based on your application’s demand, offering up to 90% cost savings compared to provisioning for peak capacity.

For applications with variable workloads, Amazon DocumentDB Serverless offers simplified resource management, with no upfront commitments or additional costs, so you only pay for the database capacity used. It provides the same MongoDB compatible-APIs and capabilities as Amazon DocumentDB, including read replicas, Performance Insights, and I/O-Optimized. Amazon DocumentDB Serverless is ideal for a broad set of applications with variable, multi-tenant, or mixed use (read/write) workloads. For example, enterprises that have thousands of applications, or software as a service (SaaS) vendors that have multi-tenant environments with hundreds or thousands of databases, can use Amazon DocumentDB Serverless to manage database capacity across their entire fleet of databases. Additionally, you can build agentic AI applications that beneﬁt from its native vector search and serverless adaptability to handle dynamically invoked agentic AI workﬂows. Amazon DocumentDB Serverless is available starting with Amazon DocumentDB 5.0 for both new and existing clusters.

For pricing details and region availability, visit Amazon DocumentDB Pricing.

To learn more about Amazon DocumentDB Serverless, see the overview, documentation, and AWS News blog. Get started in just a few steps in the AWS Management Console.

Read More for the details.

2025 07 31

AWS – AWS Lambda response streaming now supports 200 MB response payloads

Tibor Kiss AWS, Cloud AWS

AWS Lambda response streaming now supports a default maximum response payload size of 200 MB, 10x higher than before. Lambda response streaming allows you to progressively stream response payloads back to clients, improving performance for latency sensitive workloads by reducing time to first byte (TTFB) performance.

Response streaming is ideal for use cases which are sensitive to end-user latency, including real-time AI chat or web or mobile applications where page load performance influences user experience. Previously, the default response payload limit for response streaming functions was 20 MB and if your response payload exceeded this limit, you needed to incur additional overhead by compressing the payload or using services like Amazon S3 as an intermediary step. The increased response payload limit allows you to process response payloads of up to 200 MB directly within Lambda, enabling use cases such as real-time processing of large datasets, image-heavy PDF files, or even music files.

Lambda response streaming supports Node.js managed runtimes as well as custom runtimes. The 200 MB response streaming payload limit is default in all AWS Regions where Lambda response streaming is supported.

To learn more about Lambda response streaming, please refer to Lambda documentation.

Read More for the details.

2025 07 31

AWS – Amazon Chime SDK now provides Internet Protocol Version 6 (IPv6) API endpoints

Tibor Kiss AWS, Cloud AWS

Amazon Chime SDK now offers customers the option to use Internet Protocol version 6 (IPv6) addresses. Customers can use IPv6 to address new dual-stack API endpoints to invoke Amazon Chime SDK APIs.

The new dual-stack endpoints support both IPv4 and IPv6 clients, helping you transition from IPv4 to IPv6-based systems and applications at your own pace. This approach can help you work toward IPv6 compliance requirements while reducing the need for additional networking equipment to handle address translation between IPv4 and IPv6.

Dual-stack endpoints are available in all AWS Regions where Amazon Chime SDK is available, including AWS GovCloud (US). To learn more about Amazon Chime SDK, refer to the following resources:

Read More for the details.

2025 07 31

AWS – Amazon SNS launches additional message filtering operators

Tibor Kiss AWS, Cloud AWS

Amazon Simple Notification Service (Amazon SNS) now supports three additional message filtering operators: wildcard matching, anything-but wildcard matching, and anything-but prefix matching.

Amazon SNS is a fully managed pub/sub service that provides one-to-many message delivery to various endpoints, including AWS Lambda, Amazon SQS, Amazon Data Firehose, SMS via AWS End User Messaging, push notifications, and email. With this launch, topic subscribers can use these additional operators to define more flexible message filtering policies, ensuring they receive only relevant messages. This reduces the need for additional filtering logic in subscriber applications.

Amazon SNS message filtering is available in all AWS commercial Regions and AWS GovCloud (US) Regions. To learn more, see Message Filtering in the Amazon SNS Developer Guide.

Read More for the details.

2025 07 31

AWS – Amazon SNS standard topics now support Amazon SQS fair queues

Tibor Kiss AWS, Cloud AWS

Amazon Simple Notification Service (Amazon SNS) now supports message group IDs in standard topics, enabling fair queue functionality for all subscribed Amazon SQS standard queues. This feature allows you to mitigate noisy neighbor impact in all multi-tenant standard queues subscribed to a SNS standard topic by ensuring that high-volume or slow-processing messages from one tenant don’t delay messages from other tenants.

When you include a message group ID in messages sent to your Amazon SNS standard topic, the topic automatically forwards these IDs to all subscribed Amazon SQS standard queues, activating fair queue behavior across those queues. This capability is particularly valuable for SaaS applications that use Amazon SNS to distribute messages to multiple processing queues, event-driven architectures serving multiple customers, and microservices that need to maintain quality of service across different request types.

This feature is now available in all AWS commercial and AWS GovCloud (US) Regions. To learn more about using message group IDs with Amazon SNS, see the Amazon SNS Developer Guide. For more information about Amazon SQS fair queues, read our blog post.

Read More for the details.

2025 07 31

AWS – Amazon EventBridge now supports Internet Protocol Version 6 (IPv6)

Tibor Kiss AWS, Cloud AWS

Amazon EventBridge now supports Internet Protocol version 6 (IPv6) through new dual-stack endpoints. You can now connect to EventBridge Event Bus, EventBridge Scheduler, EventBridge Pipes, and EventBridge schema registries using IPv6, IPv4, or dual-stack clients. The existing Amazon EventBridge endpoints that support only IPv4 will remain available for backwards compatibility.

Amazon EventBridge is a serverless service that uses events to connect application components together, making it easier for you to build scalable event-driven applications. Through Event Bus for many-to-many routing, Scheduler for scheduled tasks, Pipes for point-to-point integrations, and schema registries for event schema discovery and management, EventBridge provides reliable and efficient ways to ingest, filter, transform, and deliver events. With dual-stack endpoints now available for these EventBridge services, you can use IPv6 to future-proof your event-driven architectures, maintain compatibility with existing IPv4 systems, and eliminate the need for complex IP address translation infrastructure.

IPv6 for EventBridge is available in all AWS Regions, including AWS GovCloud (US). To learn more, please visit the EventBridge dual-stack endpoint documentation or read the AWS whitepaper on IPv6 best practices.

Read More for the details.

2025 07 31

AWS – AWS Batch now supports scheduling SageMaker Training jobs

Tibor Kiss AWS, Cloud AWS

As of today, AWS Batch now supports scheduling for SageMaker Training jobs. With AWS Batch for SageMaker Training jobs, data scientists are able to submit training jobs to configurable queues powered by AWS Batch. This integration enables jobs to be scheduled based on priority and resource availability, eliminating manual retries and coordination. Additionally, system administrators can set up fair-share scheduling policies to optimize resource utilization across teams. The system will automatically retry failed jobs and provide visibility into queue status.

You can also procure SageMaker Flexible Training Plans (FTP) to guarantee the capacity you need during the time you need it. With a Flexible Training Plan in place, Batch’s queuing capabilities allows you to maximize your utilization for the duration of your plan. Data scientists can submit experiments with confidence directly from the SageMaker Python SDK, knowing that infrastructure complexities are handled automatically.

You can start using AWS Batch for SageMaker Training jobs immediately through the AWS Management Console, AWS Command Line Interface (CLI), or AWS SDKs. There are no additional charges for AWS Batch itself – you only pay for the AWS resources used to run your applications. AWS Batch for SageMaker Training jobs is now generally available in all commercial AWS Regions where AWS Batch and SageMaker AI are available. To get started, see the AWS Batch for SageMaker Training jobs documentation and our blog post.

Read More for the details.

2025 07 31

AWS – AWS DMS Schema Conversion introduces Virtual Mode

Tibor Kiss AWS, Cloud AWS

AWS Database Migration Service (DMS) Schema Conversion now supports Virtual Mode for Data providers, enabling you to perform schema assessment and conversion without connecting to target database instances. This feature helps you begin conversion planning immediately while reducing infrastructure costs.

Virtual Mode enables you to evaluate database compatibility, review and convert schema code, generate assessment reports, and plan resource requirements. All of this happens before provisioning actual database infrastructure. When you are ready for migration, you can switch from virtual to real Data providers seamlessly. Virtual Mode works with all AWS DMS Schema Conversion-supported target databases, including Amazon RDS and Aurora PostgreSQL, MySQL, Amazon RDS for Db2, and Amazon Redshift.

Virtual Mode is available in all AWS Regions where AWS DMS Schema Conversion is supported, at no additional charge. To learn more visit the Virtual Data provider page.

Read More for the details.

2025 07 31

AWS – Amazon Connect Cases now displays detailed email content within the case activity feed

Tibor Kiss AWS, Cloud AWS

Amazon Connect Cases now displays email content, including message body, images, and attachment details directly within the case activity feed, enabling case workers to understand email conversations more efficiently and resolve cases faster.

Amazon Connect Cases is available in the following AWS regions: US East (N. Virginia), US West (Oregon), Canada (Central), Europe (Frankfurt), Europe (London), Asia Pacific (Seoul), Asia Pacific (Singapore), Asia Pacific (Sydney), Asia Pacific (Tokyo), and Africa (Cape Town) AWS regions. To learn more and get started, visit the Amazon Connect Cases webpage and documentation.

Read More for the details.