Feature/oracle agent spec integration by afourniernv · Pull Request #1566 · NVIDIA/NeMo-Agent-Toolkit

afourniernv · 2026-02-04T20:37:56Z

Overview

This PR represents a major refactoring and enhancement of the Agent-Spec integration work originally proposed in PR #1432 (YashaPushak:feat/agentspec). This branch addresses review feedback from @willkill07 and implements a more robust, production-ready Agent-Spec plugin that follows NeMo Agent Toolkit's standard plugin architecture patterns.

What This Branch Does

This branch implements a complete Oracle Agent-Spec integration plugin for NeMo Agent Toolkit, allowing users to run Agent-Spec YAML/JSON configurations as native NAT workflows with full observability, profiling, and evaluation capabilities.

Key Features

Agent-Spec Workflow Wrapper: Converts Agent-Spec YAML/JSON configurations to LangGraph CompiledStateGraph components using pyagentspec's LangGraph adapter, then wraps them as NAT Functions following the same pattern as LanggraphWrapperFunction.
Flexible Configuration Support:
- File-based: spec_file for referencing Agent-Spec YAML/JSON files
- Inline YAML/JSON: spec_yaml and spec_json for programmatic/dynamic configurations
- Validator ensures exactly one source is provided
NAT Tool Integration:
- tool_names field supports referencing NAT tools via FunctionRef or FunctionGroupRef
- Automatic tool resolution through builder.get_tools()
- Merges NAT tools with Agent-Spec-defined tools (NAT tools take precedence)
Component Reuse Support:
- components_registry for manually mapping component IDs to NAT-managed components (e.g., LLMs)
- Documented limitations and best practices for component reuse with embedded Agent-Spec configs
Conversation History Management:
- max_history field for trimming message history before execution
- Uses langchain_core.messages.trim_messages to limit context size
Auto-detection Features:
- Automatically detects ClientTool usage and configures MemorySaver checkpointer when needed

Architecture

This implementation follows NeMo Agent Toolkit's standard plugin architecture:

Plugin Registration: Uses @register_function decorator with proper framework wrappers (LLMFrameworkEnum.LANGCHAIN)
Function Wrapper Pattern: AgentSpecWrapperFunction follows the same pattern as LanggraphWrapperFunction, handling input/output conversion and delegating execution to the underlying LangGraph
Separate Package: Implemented as nvidia-nat-agent-spec package with proper dependencies (nvidia-nat-core, pyagentspec[langgraph,langgraph_mcp])
Type Safety: Full Pydantic models for configuration (AgentSpecWrapperConfig) and input/output (AgentSpecWrapperInput, AgentSpecWrapperOutput)
Error Handling: Comprehensive error handling with clear error messages

Changes from Original PR #1432

This branch addresses several architectural concerns raised in the review:

Enhanced E2E Test Validation: Added comprehensive validation in smoke tests (graph type, message types, content checks)
Type Annotations: Added explicit return type annotation (-> Self) to _validate_sources validator
Documentation: Documented component reuse limitations and best practices for embedded Agent-Spec configurations
Package Structure: Maintains separate package structure (not an optional dependency group)
Code Organization: Implementation details properly separated from configuration models

Testing

55 unit tests covering:
- Configuration validation (file, YAML, JSON sources)
- Tool integration (tool_names, tool_registry)
- Component registry functionality
- Message trimming (max_history)
- Error handling and edge cases
2 end-to-end integration tests (smoke tests):
- File-based Agent-Spec YAML execution
- Inline YAML execution
- Tests skip gracefully without NVIDIA_API_KEY

Usage Example

functions:
agent_spec_workflow:
_type: agent_spec_wrapper
spec_file: path/to/agent_spec.yaml
tool_names:
- FunctionRef("wiki_search")
- FunctionGroupRef("search_tools")
max_history: 20
description: "Agent-Spec workflow with NAT tool integration"

 ## Important: Dependency Conflicts

The agent-spec extra cannot be installed alongside langchain, most, or vanna extras due to incompatible langchain-core/langgraph version requirements:

pyagentspec requires: langchain-core<1.0.0, langgraph<1.0.0
nvidia-nat-langchain requires: langchain-core>=1.2.6, langgraph>=1.0.5

Users must choose one or the other, or use separate environments.

Lock File Updates

The uv.lock file has been updated to include dependencies for the new nvidia-nat-agent-spec package. This is expected and necessary when adding a new package.

copy-pr-bot · 2026-02-04T20:38:00Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2026-02-04T20:38:04Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

🔍 Trigger a full review

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Implement nvidia-nat-agent-spec plugin that bridges Oracle Agent-Spec YAML configurations with NeMo Agent Toolkit, enabling Agent-Spec agents to run as NAT Functions with full access to NAT's evaluation, profiling, and observability. Implementation: - AgentSpecWrapperFunction: Wraps LangGraph CompiledStateGraph as NAT Function - AgentSpecWrapperConfig: Configuration model supporting tool_registry, components_registry, and auto checkpointer detection - Two-layer adapter pattern: Oracle's AgentSpecLoader (YAML→LangGraph) + NAT wrapper (LangGraph→NAT Function) - Supports multiple input formats (strings, lists, message objects) - Auto-detects ClientTool and creates MemorySaver checkpointer when needed Features: - Tool registry: Wire up custom functions/tools via config - Components registry: Override LLMs/components (e.g., inject NIM LLMs) - Multiple input formats: Flexible input handling via convert_to_messages - Full NAT integration: Works with eval, profiler, observability, middleware Test Coverage: - Unit tests for AgentSpecWrapperFunction methods (_convert_input, _ainvoke, _astream, convert_to_str) - Registration function tests (tool registry, checkpointer, components registry) - Integration tests (tool usage, NIM LLM integration) - Test fixtures: Minimal agent, NIM agent, weather agent with tools Dependencies: - Resolves langchain-core/langgraph version conflicts via override-dependencies - Adds agent-spec extra to pyproject.toml - Updates uv.lock with new dependencies Signed-off-by: afourniernv <afournier@nvidia.com>

- Add nvidia-nat-agent-spec package to workspace with proper dependency management - Configure plugin to use setuptools_dynamic_dependencies pattern (matching langchain plugin) - Update plugin to depend on nvidia-nat-core instead of root nvidia-nat package - Add conflict declarations in root pyproject.toml for agent-spec vs langchain/vanna/most extras (pyagentspec requires langchain-core<1.0.0 while nvidia-nat-langchain requires >=1.2.6) - Add comprehensive README documenting installation, usage, known limitations, and testing Signed-off-by: afourniernv <afournier@nvidia.com>

…rations Add support for providing Agent-Spec configurations inline as YAML or JSON strings in addition to file paths. This enables more flexible configuration options for users who want to embed Agent-Spec configs directly in workflow definitions. Changes: - Add spec_yaml and spec_json fields to AgentSpecWrapperConfig - Make spec_file optional (FilePath | None) - Add validator to ensure exactly one source (file/yaml/json) is provided - Update register() to handle all three formats with proper format detection - Support both .yaml and .json file extensions - Maintain backward compatibility with existing spec_file usage Tests: - Add 8 new test cases covering inline YAML/JSON scenarios - Test validator enforcement (exactly one source required) - Test format detection for JSON files - Test components_registry integration with inline formats - Test ClientTool auto-detection in inline YAML/JSON All 21 tests passing (13 existing + 8 new) Signed-off-by: afourniernv <afournier@nvidia.com>

…istory trimming - Add inline YAML/JSON support: Allow Agent-Spec configs via spec_yaml and spec_json fields - Make spec_file optional with validator ensuring exactly one source is provided - Support both YAML and JSON formats for inline content and file-based configs - Add tool_names integration: Enable NAT tool discovery via FunctionRef/FunctionGroupRef - Integrate with builder.get_tools() to automatically resolve NAT-registered tools - Merge discovered tools with manual tool_registry (tool_registry takes precedence) - Support both individual function references and function group references - Add max_history message trimming: Implement conversation history truncation - Add max_history field (default: 15) to AgentSpecWrapperConfig - Use langchain_core.messages.trim_messages to limit message count before execution - Preserve full input state while only trimming message history - Consolidate tests: Use pytest.mark.parametrize for YAML/JSON test pairs - Reduce code duplication by parameterizing format-based tests - Maintain same test coverage with cleaner, more maintainable code - Add end-to-end smoke tests: Verify full integration with real LLM calls - Test Agent-Spec YAML file loading and execution - Test inline YAML support end-to-end - Marked as integration tests that skip gracefully without NVIDIA_API_KEY - Update minimal_agent.yaml fixture: Use OpenAiCompatibleConfig for NIM compatibility Tests: All 29 unit tests + 2 end-to-end tests passing Signed-off-by: afourniernv <afournier@nvidia.com>

- Enhanced E2E test validation (graph type, message types, content checks) - Added explicit return type annotation (-> Self) to _validate_sources - Documented component reuse limitation in class docstring and field description Signed-off-by: afourniernv <afournier@nvidia.com>

jschweiz · 2026-02-18T16:45:55Z