2025 08 01

GCP – Remember this: Agent state and memory with ADK

Imagine that you’re a developer, and your friend is learning to code for the first time. They’re struggling with some concepts, like Python dictionaries.

But you have an idea! What if you could design an AI agent that could help your friend learn complex topics in a conversational way? What if that agent could even be personalized to your friend’s learning style, remember your friend’s past performance, and adjust the learning plan in real time? With agent state and memory, all of this is possible. In this post, we’ll explore how, with Agent Development Kit (ADK).

The Python Tutor agent

We’ll start by designing a simple, conversational agent with ADK. This agent uses Gemini 2.5 Flash as its model, leveraging its reasoning capabilities. The agent also relies on a set of function tools that allow the agent to progress through a simple quiz on Python dictionaries.

For example, here’s the start_quiz tool, which kicks off a new quiz workflow. (More on state in a minute!)

code_block: <ListValue: [StructValue([(‘code’, ‘def start_quiz(tool_context: ToolContext) -> Dict[str, Any]:rn state = tool_context.statern # Initialize quiz statern state[“quiz_started”] = Truern state[“current_question_index”] = 0rn state[“correct_answers”] = 0rn state[“total_answered”] = 0rn state[“score_percentage”] = 0rn if quiz:rn return {rn “status”: “started”,rn “first_question”: quiz[0][0],rn “question_number”: 1,rn “total_questions”: len(quiz),rn }rn return {“status”: “error”, “error_message”: “No questions available”}’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54cb7bd2b0>)])]>

Overall, this is the quiz workflow we want our agent to take, using those tools together:

From there, we prompt our agent with system instructions to help the model reason its way through that workflow:

code_block: <ListValue: [StructValue([(‘code’, ‘QUIZ_INSTRUCTIONS = “””rnQUIZ MANAGEMENT PROCESS:rn1. **User identification**: Ask for their name if not providedrn2. **Memory check**: Search for their previous learning history using search_memory()rn3. **Personalized start**: Reference their past progress if found, or welcome new learnersrn4. **Quiz flow**:rn – When user wants to start: Use start_quiz()rn – Present questions clearly with proper formattingrn – When user answers: Use submit_answer(answer=”[user’s answer]”)rn – Provide immediate feedback:rn * If correct: Congratulate and continuern * If incorrect: Explain the concept thoroughly and continue. DO NOT GIVE THE USER A SECOND CHANCE TO ANSWER, just move on to the next question!rn * If quiz complete: Show final score and offer concept reviewrn5. **Progress tracking**: Use get_quiz_status() to monitor progressrn6. **Reset option**: Use reset_quiz() if they want to start overrn”””‘), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54cb7bdac0>)])]>

From here, we can define our ADK agent, and get cooking:

code_block: <ListValue: [StructValue([(‘code’, ‘root_agent = LlmAgent(rn model=”gemini-2.5-flash”,rn name=”python_tutor”,rn instruction=QUIZ_INSTRUCTIONS,rn tools=quiz_toolsrn)’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54cb7bdd90>)])]>

But how can we ask this agent to remember things that happen during a student’s session, like their name, or their progress through the quiz? This is where short-term memory comes into play.

Short-term memory

In the context of an AI agent, short-term memory is what the agent can remember within one session.

But what is a session? Think of a session like a phone call with a customer support representative—but you get a different rep every time you call. That representative can only remember what you’ve told them during that conversation. After you hang up the phone, all context is lost.

Short-term memory might sound like a bad thing, but it plays an important role in an AI agent. For instance, our Python Tutor agent might need to keep track of the user’s quiz progress, like the number of questions completed so far. The agent probably does not need to store that progress long-term—it might just need to store their final score.

Every user interaction with an ADK agent gets a session, and that session is managed by the ADK SessionService. Each session contains important fields, like the session ID, user ID, event history (the conversation thread), and the state.

What is session state? Think of it like the agent’s scratchpad during that “phone call” with the user. Each session’s state contains a list of key-value pairs, whose values are updated by the agent throughout the session.

ADK can write to session state a few different ways. One way is within a tool. We can use ADK’s ToolContext to get the current session state, and create or update fields:

code_block: <ListValue: [StructValue([(‘code’, ‘from google.adk.tools.tool_context import ToolContextrndef submit_answer(tool_context: ToolContext, answer: str) -> Dict[str, Any]:rn state = tool_context.statern i = state.get(“current_question_index”, 0)rn correct_answer = quiz[i][1]rn is_correct = answer.strip().lower() == correct_answer.strip().lower()rn state[“total_answered”] = state.get(“total_answered”, 0) + 1rn if is_correct:rn state[“correct_answers”] = state.get(“correct_answers”, 0) + 1’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54cb7bd220>)])]>

From there, we can instruct our Agent’s model to read from the current state fields using ADK’s key templating feature. All you have to do is wrap a state field in curly braces {}, and ADK will inject the state key’s value into your prompt, on each model call.

code_block: <ListValue: [StructValue([(‘code’, ‘prompt=”””rnINSTRUCTIONS:rn- Guide users through Python dictionaries with key concepts and examples.rn- Format code examples using markdown code blocks.rn- Use lots of friendly emojis in your responses, including for formatting.rn- Be encouraging and provide detailed explanations for incorrect answers.rnCURRENT SESSION STATE:rn- Current question index: {current_question_index}rn- Questions answered correctly: {correct_answers}rn- Total questions answered: {total_answered}rn- Current score: {score_percentage} %rn- Quiz started: {quiz_started}rn”””‘), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54c41e4ee0>)])]>

Here’s our Python Tutor agent updating those state fields in real-time:

By default, state fields only persist within the current session; once you start a new session, even as the same user, the values are gone. But ADK does have magic state key prefixes like user: and app: , which allow you to persist state key values either across all user sessions, or across all sessions with all users. These magic prefixes are useful if you have simple text settings you want to persist across sessions, like dark-mode=true.

So those are the basics of ADK’s short-term memory. But how does ADK store session and state data?

By default, the ADK web UI’s SessionService writes session data in memory. This means that if ADK’s runner crashes, or is shut down, all session data is lost. And if you’re running a scaled, production-grade agent with ADK, with multiple instances of your ADK agent, you can’t guarantee that user requests will always hit the same instance. This means that if request 1 goes to instance A, and request 2 goes to instance B, instance B won’t have the in-memory session state stored inside instance A.

So for production-grade agents, you should store session data outside of the agent’s runtime. ADK provides two ways of doing this. The first is the DatabaseSessionService: store session data in a SQL database, like SQLLite, MySQL, or PostgreSQL. This is easy to set up – all you need is a database. Then, you can pass your database’s URI into the ADK runner:

code_block: <ListValue: [StructValue([(‘code’, ‘uv run adk web –session_service_uri=”postgresql://$USERNAME:$PASSWORD@127.0.0.1:5432/pythontutor”‘), (‘language’, ”), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54c41c11f0>)])]>

From there, you can access your SQL database and see the session and state tables.

The other option is a VertexAISessionService, where you store session data in Agent Engine. This is a good option if you’re already using Agent Engine as your ADK runtime.

Long-term memory

We’ve covered how ADK stores data within a session. But what if you have data you want to persist across sessions with the same user? This is where ADK’s long-term memory comes in.

An ADK agent with long-term memory is like talking to the same customer service representative every time. And that representative has access to all key information from all your past conversations.

For instance, our Python Tutor agent might want to store quiz scores over time: how has the student improved? Long-term memories could also help the agent develop a personalized learning plan for the student: what topics has the student consistently struggled with?

ADK offers two ways to store these long-term memories. The default way is in memory, by using an InMemoryMemoryService. Here, all sessions are stored raw (with the full conversation thread), and can be retrieved by the agent in further sessions using a basic keyword search. This method is good for local development, but it has the same pitfall as the InMemorySessionService: if ADK restarts or crashes, all memories are lost forever. Another downside to this method is if you have a lot of past user sessions, and you’re retrieving the session’s event history in its raw form, you could overwhelm your agent’s model with too much context.

Luckily, ADK provides a way to store long-term memories persistently outside the ADK runtime, and that’s with a VertexAIMemoryBankService. This memory service uses Vertex AI Memory Bank (Preview) to intelligently store and retrieve memories from past user interactions.

Memory Bank uses the Gemini model to extract key information from session data, to store just the key memories for future use:

To store memories in Memory Bank, we implement a Callback in our Python Tutor agent to add the current session data to the bank:

code_block: <ListValue: [StructValue([(‘code’, ‘async def auto_save_to_memory_callback(callback_context):rn session = callback_context._invocation_context.sessionrn memory_service = VertexAiMemoryBankService(rn project=os.getenv(“GOOGLE_CLOUD_PROJECT”),rn location=os.getenv(“GOOGLE_CLOUD_LOCATION”, “us-central1”),rn agent_engine_id=agent_engine_id,rn )rn await memory_service.add_session_to_memory(session)rn…’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54c41c1460>)])]>

Then, we instruct our agent to retrieve those memories using a vector-based similarity search, by keyword.

code_block: <ListValue: [StructValue([(‘code’, ‘async def search_memory(query: str) -> list:rn memory_bank_service = VertexAiMemoryBankService(rn project=os.getenv(“GOOGLE_CLOUD_PROJECT”),rn location=os.getenv(“GOOGLE_CLOUD_LOCATION”),rn agent_engine_id=os.getenv(“AGENT_ENGINE_ID”),rn )rn search_results = await memory_bank_service.search_memory(rn app_name=”python-tutor-long-term”,rn user_id=”user”,rn query=”score”,rn )rn…’), (‘language’, ‘lang-py’), (‘caption’, <wagtail.rich_text.RichText object at 0x3e54c41c14f0>)])]>

Now, we have an updated quiz flow that can remember the user’s past quiz results:

Here’s the updated Python Tutor agent in action, using Memory Bank for long-term memory retrieval:

What’s happening under the hood, here? When our agent triggers memory generation in that after-agent Callback, Memory Bank uses Gemini to extract key information. So when the user starts a new quiz, triggering search_memory for past results, here’s what Memory Bank hands back:

Note that while Memory Bank is part of Vertex AI Agent Engine, you don’t need to run your ADK agent on Agent Engine’s Runtime to use this feature. For instance, you can run your agent on Cloud Run, and just integrate Memory Bank as your long-term memory store.

Get building!

To recap, short- and long-term memory play a key role in AI agents, allowing the agent to remember key information within and across user sessions, resulting in a more contextually-aware and personalized user experience. ADK supports a variety of session and memory storage options, from SQL databases to Vertex AI Agent Engine.

Check out the source code on GitHub to explore and deploy the Python Tutor agent yourself.

And to learn more, check out these resources.

Thanks for reading!

Image credits: Emoji Kitchen

GCP – Remember this: Agent state and memory with ADK

The Python Tutor agent

Short-term memory

Long-term memory

Get building!

Related Posts

AWS – SageMaker HyperPod now supports Managed tiered KV cache and intelligent routing

AWS – Amazon SageMaker HyperPod now supports custom Kubernetes labels and taints

AWS – Amazon Kinesis Video Streams now supports a new cost effective warm storage tier