Configure the Automatic Memory Wrapper Agent

Configure the NVIDIA NeMo Agent Toolkit automatic memory wrapper agent as a workflow or a function.

Requirements

The automatic memory wrapper agent works with any memory backend that implements the MemoryEditor interface. The following memory plugins are available:

nvidia-nat-zep-cloud - Zep Cloud memory backend (Zep NVIDIA NeMo documentation)
nvidia-nat-mem0ai - Mem0 memory backend
nvidia-nat-redis - Redis memory backend

Configuration

The automatic memory wrapper agent may be utilized as a workflow or a function.

Example 1: Automatic Memory Wrapper Agent as a Workflow

To use the automatic memory wrapper agent as a workflow:

memory:
  zep_memory:
    _type: nat.plugins.zep_cloud/zep_memory

functions:
  my_react_agent:
    _type: react_agent
    llm_name: nim_llm
    tool_names: [calculator]

workflow:
  _type: auto_memory_agent
  inner_agent_name: my_react_agent
  memory_name: zep_memory
  llm_name: nim_llm

Example 2: Automatic Memory Wrapper Agent as a Function

To use the automatic memory wrapper agent as a function:

memory:
  zep_memory:
    _type: nat.plugins.zep_cloud/zep_memory

functions:
  my_agent_with_memory:
    _type: auto_memory_agent
    inner_agent_name: my_react_agent
    memory_name: zep_memory
    llm_name: nim_llm
    description: 'A ReAct agent with automatic memory'

Configurable Options

Required Parameters:

Parameter	Description
`inner_agent_name`	Name of the agent to wrap with automatic memory
`memory_name`	Name of the memory backend (from `memory:` section)
`llm_name`	LLM to use (required by `AgentBaseConfig`)

Optional Feature Flags (all default to true):

Parameter	Default	Description
`save_user_messages_to_memory`	`true`	Automatically save user messages before agent processing
`retrieve_memory_for_every_response`	`true`	Automatically retrieve and inject memory context
`save_ai_messages_to_memory`	`true`	Automatically save agent responses after generation

Memory Backend Parameters:

search_params: Passed to memory_editor.search() for memory retrieval configuration

search_params:
  mode: "summary"  # Zep: "basic" or "summary"
  top_k: 10        # Maximum memories to retrieve

add_params: Passed to memory_editor.add_items() for memory storage configuration

add_params:
  ignore_roles: ["assistant"]  # Zep: Exclude roles from graph memory

Other Options:

description: Defaults to "Auto Memory Agent Wrapper". When configured as a function, this allows control over the tool description.
verbose: Defaults to False (useful to prevent logging of sensitive data). If set to True, the wrapper will log memory operations and intermediate steps.

How the Automatic Memory Wrapper Agent Works

The automatic memory wrapper agent intercepts agent invocations and automatically handles memory operations:

Step-by-Step Execution Flow

User Message Reception – The wrapper receives the user's input message
Memory Retrieval (if retrieve_memory_for_every_response is true)
- Searches the memory backend for relevant context
- Injects retrieved memories into the agent's context
User Message Storage (if save_user_messages_to_memory is true)
- Stores the user's message in the memory backend
Agent Invocation – The wrapped agent processes the request with memory context
Response Storage (if save_ai_messages_to_memory is true)
- Stores the agent's response in the memory backend
Response Return – Returns the agent's response to the user

Example Walkthrough

Consider a conversation with automatic memory enabled:

First Interaction:

User: "My name is Alice and I prefer Python for data analysis."
Agent: "Nice to meet you, Alice! I'll remember your preference for Python."

The wrapper automatically:

Stores the user message
Invokes the inner agent
Stores the agent response

Later Interaction:

User: "What programming language should I use for my data project?"
Agent: "Based on what you told me earlier, I recommend Python for your data analysis project since that's your preferred language."

The wrapper automatically:

Retrieves relevant memories (Alice's name and Python preference)
Injects them into the agent's context
Agent can reference past conversations naturally

Multi-Tenant Memory Isolation

The automatic memory wrapper agent provides multi-tenant support through automatic user ID extraction. User ID is NOT configured in YAML but extracted at runtime.

User ID Extraction Priority

user_manager.get_id() - For production with custom auth middleware (recommended)
X-User-ID HTTP header - For testing without middleware
"default_user" - Fallback for local development

Production: Custom Middleware

Create middleware that extracts user ID from your authentication system:

from nat.runtime.session import SessionManager

class AuthenticatedUserManager:
    def __init__(self, user_id: str):
        self._user_id = user_id

    def get_id(self) -> str:
        return self._user_id

# In your request handler
async def handle_request(request):
    # Extract from JWT, OAuth, API key, etc.
    user_id = extract_user_from_jwt(request.headers["authorization"])

    async with session_manager.session(
        user_manager=AuthenticatedUserManager(user_id=user_id),
        http_connection=request,
    ) as session:
        result = await session.run(user_input)
    return result

Testing: X-User-ID Header

For quick testing without custom middleware:

curl -X POST http://localhost:8000/chat \
  -H "Content-Type: application/json" \
  -H "X-User-ID: test_user_123" \
  -H "conversation-id: test_conv_001" \
  -d '{"messages": [{"role": "user", "content": "Hello!"}]}'

Local Development: No Authentication

Omit both user_manager and X-User-ID header to use "default_user":

nat run --config examples/agents/auto_memory_wrapper/configs/config_zep.yml

Advanced Configuration Example

Here's a comprehensive configuration showing all available options:

memory:
  zep_memory:
    _type: nat.plugins.zep_cloud/zep_memory

llms:
  nim_llm:
    _type: nim
    model_name: meta/llama-3.3-70b-instruct
    temperature: 0.7

function_groups:
  calculator:
    _type: calculator

functions:
  my_react_agent:
    _type: react_agent
    llm_name: nim_llm
    tool_names: [calculator]
    verbose: true

workflow:
  _type: auto_memory_agent
  inner_agent_name: my_react_agent
  memory_name: zep_memory
  llm_name: nim_llm
  verbose: true
  description: "A ReAct agent with automatic Zep memory"

  # Feature flags (optional - all default to true)
  save_user_messages_to_memory: true
  retrieve_memory_for_every_response: true
  save_ai_messages_to_memory: true

  # Memory retrieval configuration (optional)
  search_params:
    mode: "summary"  # Zep: "basic" (fast) or "summary" (comprehensive)
    top_k: 5         # Maximum number of memories to retrieve

  # Memory storage configuration (optional)
  add_params:
    ignore_roles: ["assistant"]  # Zep: Exclude assistant messages from graph

Wrapping Different Agent Types

The automatic memory wrapper works with any agent type:

Wrapping a ReAct Agent

functions:
  my_react_agent:
    _type: react_agent
    llm_name: nim_llm
    tool_names: [calculator]

workflow:
  _type: auto_memory_agent
  inner_agent_name: my_react_agent
  memory_name: zep_memory
  llm_name: nim_llm

Wrapping a ReWOO Agent

functions:
  my_rewoo_agent:
    _type: rewoo_agent
    llm_name: nim_llm
    tool_names: [wikipedia_search, calculator]

workflow:
  _type: auto_memory_agent
  inner_agent_name: my_rewoo_agent
  memory_name: zep_memory
  llm_name: nim_llm

Wrapping a Tool Calling Agent

functions:
  my_tool_calling_agent:
    _type: tool_calling_agent
    llm_name: nim_llm
    tool_names: [weather_tool, calculator]

workflow:
  _type: auto_memory_agent
  inner_agent_name: my_tool_calling_agent
  memory_name: zep_memory
  llm_name: nim_llm

Important Notes

User ID is runtime-only - Set via user_manager or X-User-ID header, not in configuration
Memory backends are interchangeable - Works with any implementation of MemoryEditor interface
No memory tools needed - The wrapped agent does not need explicit memory tools configured
Transparent to inner agent - The wrapped agent is unaware of memory operations

Examples

For complete working examples, refer to:

examples/agents/auto_memory_wrapper - Full example with Zep Cloud integration

For additional information on memory backends and configuration, see:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configure the Automatic Memory Wrapper Agent

Requirements

Configuration

Example 1: Automatic Memory Wrapper Agent as a Workflow

Example 2: Automatic Memory Wrapper Agent as a Function

Configurable Options

How the Automatic Memory Wrapper Agent Works

Step-by-Step Execution Flow

Example Walkthrough

Multi-Tenant Memory Isolation

User ID Extraction Priority

Production: Custom Middleware

Testing: X-User-ID Header

Local Development: No Authentication

Advanced Configuration Example

Wrapping Different Agent Types

Wrapping a ReAct Agent

Wrapping a ReWOO Agent

Wrapping a Tool Calling Agent

Important Notes

Examples

FilesExpand file tree

auto-memory-wrapper.md

Latest commit

History

auto-memory-wrapper.md

File metadata and controls

Configure the Automatic Memory Wrapper Agent

Requirements

Configuration

Example 1: Automatic Memory Wrapper Agent as a Workflow

Example 2: Automatic Memory Wrapper Agent as a Function

Configurable Options

How the Automatic Memory Wrapper Agent Works

Step-by-Step Execution Flow

Example Walkthrough

Multi-Tenant Memory Isolation

User ID Extraction Priority

Production: Custom Middleware

Testing: X-User-ID Header

Local Development: No Authentication

Advanced Configuration Example

Wrapping Different Agent Types

Wrapping a ReAct Agent

Wrapping a ReWOO Agent

Wrapping a Tool Calling Agent

Important Notes

Examples