feat(profiler): ATIF-native profiler by afourniernv · Pull Request #1751 · NVIDIA/NeMo-Agent-Toolkit

afourniernv · 2026-03-05T17:47:05Z

NAT-366: ATIF-native profiler

Summary

How it works

┌─────────────────────────────────────────────────────────────────────────────┐
│                         ProfilerRunner.run(all_steps)                         │
└─────────────────────────────────────────────────────────────────────────────┘
                                        │
                                        ▼
┌─────────────────────────────────────────────────────────────────────────────┐
│  INPUT (either format)                                                       │
│  ┌─────────────────────────────┐    ┌─────────────────────────────┐        │
│  │ list[list[IntermediateStep]] │    │     list[Trajectory]         │        │
│  │ (from nat eval workflow)     │    │     (ATIF, from file/API)    │        │
│  └─────────────────────────────┘    └─────────────────────────────┘        │
└─────────────────────────────────────────────────────────────────────────────┘
                                        │
                                        ▼
                    ┌───────────────────────────────────────┐
                    │  _normalize_to_trajectories()         │
                    │  IntermediateStep? → convert to ATIF  │
                    │  Trajectory? → pass through           │
                    └───────────────────────────────────────┘
                                        │
                                        ▼
                    ┌───────────────────────────────────────┐
                    │  list[Trajectory]  (unified format)   │
                    └───────────────────────────────────────┘
                                        │
                                        ▼
┌─────────────────────────────────────────────────────────────────────────────┐
│  create_dataframe_from_atif(trajectories)                                    │
│  ┌─────────────────────────────────────────────────────────────────────┐    │
│  │  For each Trajectory step:                                           │    │
│  │  • user → WORKFLOW_START row                                        │    │
│  │  • agent + tool_calls + observation → TOOL_END row per tool          │    │
│  │  • agent + metrics (nonzero tokens) → LLM_START + LLM_END rows       │    │
│  │  • agent + message, no (tool_calls and observation) → WORKFLOW_END   │    │
│  │  Uses step.extra for function_id, parent_function_id, etc.            │    │
│  └─────────────────────────────────────────────────────────────────────┘    │
└─────────────────────────────────────────────────────────────────────────────┘
                                        │
                                        ▼
                    ┌───────────────────────────────────────┐
                    │  pd.DataFrame  (profiler schema)      │
                    └───────────────────────────────────────┘
                                        │
            ┌───────────────────────────┼───────────────────────────┐
            ▼                           ▼                           ▼
┌───────────────────┐   ┌───────────────────────────┐   ┌───────────────────┐
│ dataframe_to_      │   │  Downstream modules        │   │  JSON / CSV        │
│ profiler_traces()  │   │  • LLMMetrics              │   │  output            │
│ → traces_dict      │   │  • bottleneck_analysis     │   │                    │
│ → traces_obj       │   │  • token_uniqueness       │   │  all_requests_     │
│ (JSON, Trie)       │   │  • workflow_runtimes       │   │  profiler_traces   │
└───────────────────┘   │  • prompt_caching          │   │  standardized_     │
                        │  • ModelTrainer (forecast) │   │  data_all.csv      │
                        └───────────────────────────┘   └───────────────────┘

Stage	Input	Output
Boundary	IntermediateStep or Trajectory	`list[Trajectory]`
Conversion	`list[Trajectory]`	`pd.DataFrame`
Downstream	`pd.DataFrame`	Metrics, JSON, CSV, forecasts

Implements NAT-366: the profiler now accepts ATIF Trajectories and uses ATIF internally. Both IntermediateStep (existing) and Trajectory (ATIF) inputs are supported at the boundary.

Requirements (per JIRA / Anuradha doc)

Input boundary: Profiler accepts list[list[IntermediateStep]] and list[Trajectory]. IntermediateStep input is converted to ATIF at the boundary when needed.
Internal format: Profiler uses ATIF internally. create_dataframe_from_atif(trajectories) produces the same DataFrame schema as create_standardized_dataframe. Downstream modules consume DataFrame.
step.extra: Uses step.extra for profiling metadata (function_id, parent_function_id, span_event_timestamp, nat_tool_ancestry for per-tool ancestry).
Backward compatibility: IntermediateStep path remains supported.

Changes

New

atif_dataframe.py: create_dataframe_from_atif(trajectories) — builds profiler DataFrame from ATIF trajectories. Maps user → WORKFLOW_START; agent with metrics → LLM_START/LLM_END; agent with tool_calls+observation → TOOL_END; agent message-only → WORKFLOW_END. Includes cached_tokens in total_tokens. dataframe_to_profiler_traces(df) for JSON output and PredictionTrieBuilder.

Modified

profile_runner.py: Uses create_dataframe_from_atif; _normalize_to_trajectories() converts IntermediateStep → ATIF when needed.
Downstream modules: Accept pd.DataFrame | list[list[IntermediateStep]]; use DataFrame path when available.
RandomForestModel / ModelTrainer: Accept DataFrame from create_dataframe_from_atif for training.
atif_converter.py: Minor updates for profiling metadata in step.extra.

Tests

6 new tests: TOOL_END path, cached_tokens, WORKFLOW_END message-only, dataframe_to_profiler_traces, empty DataFrame, ModelTrainer with DataFrame.
252 profiler tests pass.
Smoke-tested via nat eval with simple calculator example (profiler enabled).

Testing

Unit: uv run pytest packages/nvidia_nat_profiler/tests/ --ignore=.../test_profiler_e2e.py
Smoke: nat eval --config_file examples/evaluation_and_profiling/simple_calculator_eval/configs/config-atif-smoke.yml

By Submitting this PR I confirm:

I am familiar with the Contributing Guidelines.
We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license.
- Any contribution which contains commits that are not Signed-Off will not be accepted.
When the PR is ready for review, new or existing tests cover these changes.
When the PR is ready for review, the documentation is up to date with these changes.

copy-pr-bot · 2026-03-05T17:47:10Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

coderabbitai · 2026-03-05T17:47:27Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 946eb237-c4dd-4acf-9d8e-a62c64691a39

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Tip

Try Coding Plans. Let us write the prompt for your AI agent so you can ship faster (with fewer bugs).
Share your feedback on Discord.

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

…lerStep adapter Signed-off-by: afourniernv <afournier@nvidia.com>

AnuradhaKaruppiah · 2026-03-05T18:51:18Z

Thx @afourniernv. I like the guiding principles of this change.

Is there a way to look for drift in behavior before and after? For example for evaluators, I added a script to provide a summary of diff in scores when using IntermediateStep vs. AtifTrajectory - PR-1747

AnuradhaKaruppiah · 2026-03-05T20:36:09Z

packages/nvidia_nat_profiler/src/nat/plugins/profiler/profile_runner.py

+from nat.plugins.profiler.atif_dataframe import dataframe_to_profiler_traces
 from nat.plugins.profiler.forecasting.model_trainer import ModelTrainer
-from nat.plugins.profiler.utils import create_standardized_dataframe
+from nat.utils.atif_converter import IntermediateStepToATIFConverter


With the changes in this #1747 you should be able to always pass ATIF Trajectory list to ProfilerRunner.run in EvaluationRun.profile_workflow.

That way we can drop the IntermediateStepToATIFConverter and have nvidia-nat-profiler only deal with ATIF (and drop all IntermediateStep references)

feat(profiler): NAT-366 Option C - ATIF-native profiler, remove Profi…

b740e0b

…lerStep adapter Signed-off-by: afourniernv <afournier@nvidia.com>

afourniernv force-pushed the feature/profiler-atif branch from 778cf53 to b740e0b Compare March 5, 2026 17:49

afourniernv changed the title ~~feat(profiler): NAT-366 Option C - ATIF-native profiler, remove Profi…~~ feat(profiler): NAT-366 ATIF-native profiler Mar 5, 2026

willkill07 added feature request New feature or request non-breaking Non-breaking change labels Mar 5, 2026

willkill07 changed the title ~~feat(profiler): NAT-366 ATIF-native profiler~~ feat(profiler): ATIF-native profiler Mar 5, 2026

AnuradhaKaruppiah reviewed Mar 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(profiler): ATIF-native profiler#1751

feat(profiler): ATIF-native profiler#1751
afourniernv wants to merge 1 commit intoNVIDIA:developfrom
afourniernv:feature/profiler-atif

afourniernv commented Mar 5, 2026 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Mar 5, 2026

Uh oh!

coderabbitai bot commented Mar 5, 2026 •

edited

Loading

Review skipped

Uh oh!

AnuradhaKaruppiah commented Mar 5, 2026

Uh oh!

AnuradhaKaruppiah Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

afourniernv commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

NAT-366: ATIF-native profiler

Summary

How it works

Requirements (per JIRA / Anuradha doc)

Changes

New

Modified

Tests

Testing

By Submitting this PR I confirm:

Uh oh!

copy-pr-bot bot commented Mar 5, 2026

Uh oh!

coderabbitai bot commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

AnuradhaKaruppiah commented Mar 5, 2026

Uh oh!

AnuradhaKaruppiah Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

afourniernv commented Mar 5, 2026 •

edited

Loading

coderabbitai bot commented Mar 5, 2026 •

edited

Loading