feat: add eval_config support to RL API client #271

JannikSt · 2025-12-30T17:36:11Z

Adds missing eval_config support to the RL API client.

Changes:

Add eval_config field to RLRun model
Add eval_config parameter to create_run()
Wire up payload["eval"] in request body

Relates to #256

Note

Adds evaluation support to RL runs and a new logs experience.

Adds eval_config to RLRun, plumbs eval_config through RLClient.create_run() as payload["eval"]
CLI: introduces [eval] config (envs, interval, num_examples, rollouts_per_example, base_model) and corresponding --eval-* flags; validates env/eval slugs; surfaces eval settings in run creation output
New prime rl logs command with --tail and --follow; cleans output by stripping ANSI and collapsing progress bars; handles rate limiting
API client: new get_logs(run_id, tail_lines) method used by CLI

^{Written by Cursor Bugbot for commit aa1b26d. This will update automatically on new commits. Configure here.}

packages/prime/src/prime_cli/api/rl.py

packages/prime/src/prime_cli/commands/rl.py

…flag

…al options

JannikSt · 2025-12-30T19:53:22Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

packages/prime/src/prime_cli/commands/rl.py

…h with rotation

packages/prime/src/prime_cli/commands/rl.py

- Use BaseConfig.from_sources for eval config precedence instead of manual if-statements - Require owner/name format for --eval-envs (same as training environments) - Rename EvalConfig.eval_base_model to base_model for proper underscore mapping

JannikSt · 2026-01-03T11:46:39Z

@codex review

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: aa1b26d886

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

packages/prime/src/prime_cli/commands/rl.py

* Implement commands for hosted RL * Hosted RL * Allow for user to use just Usage: prime rl [OPTIONS] ENVIRONMENTS... | COMMAND [ARGS]... Manage RL training runs. By default, 'prime rl <environments>' runs 'prime rl run <environments>'. ╭─ Options ──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮ │ --help -h Show this message and exit. │ ╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯ ╭─ Commands ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮ │ run Create an RL training run with specified environments and model. │ │ models List available models for RL training. │ │ runs List your RL training runs. │ │ stop Stop an RL training run. │ │ delete Delete an RL training run. │ ╰────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯ to start a run * Support tomls on prime rl cmd * Minor fix * Cleanup references to RFT * Minor improvements * Fix ruff * Match post rft run schema to new backend * Refactor delete_run method to remove return value and simplify success handling in RLClient and related command. * Fix/prime rl list (#267) * quick fix for prime rl list when no name set * remove truncation of id in prime rl list * Add support for run_config * feat: add eval_config support to RL API client (#271) * feat: add eval_config support to RL API client * Remove accidentally committed test files * feat: add logs command for RL runs * fix: move time import to top, add rl_config example * feat: add --watch flag and improve log streaming * fix: allow built-in envs like reverse-text, update example * feat: add --eval-* options to rl run command * fix: strip ANSI escape codes from logs output * fix: increase poll interval to 5s, add rate limit handling * fix: filter progress bars from logs output, remove redundant --watch flag * fix: keep 100% progress bar completion lines in logs * fix: address review comments - simplify log follow, warn on unused eval options * fix: handle log rotation in follow mode when tail window is full * fix: always use overlap detection for log follow to handle fast growth with rotation * feat: add [eval] section support in TOML config files * fix: improve progress bar filtering to remove empty lines * fix: require owner/name format for environments, remove example config * fix: use from_sources for eval config merging, require owner/name format - Use BaseConfig.from_sources for eval config precedence instead of manual if-statements - Require owner/name format for --eval-envs (same as training environments) - Rename EvalConfig.eval_base_model to base_model for proper underscore mapping * prime registry support (#215) * custom image registry for sandboxes * prime images * --image typo * linux/amd64 * updated to not build locally * full image path * rm emojis * remove inline * image status * full image path * add cleanup * adjust scope output * bug bot stuff * validate_output_format * bug bot comment * update prime images list * limit platform * bump timeout * add closed beta info * Chore/bump version 0.5.8 (#270) * bump version to 0.5.8 * bump versions * Fix: Update eval sample field (#265) * Update eval sample field. * Update docs. * Fix: Remove trailing comma from API token URL (#273) Co-authored-by: Cursor Agent <cursoragent@cursor.com> Co-authored-by: sami <sami@primeintellect.ai> --------- Co-authored-by: Johannes Hagemann <johannes@primeintellect.ai> Co-authored-by: JannikSt <JannikSt@users.noreply.github.com> Co-authored-by: Jannik Straube <info@jannik-straube.de>

JannikSt added 2 commits December 30, 2025 18:34

feat: add eval_config support to RL API client

f967344

Remove accidentally committed test files

37379f7

cursor bot reviewed Dec 30, 2025

View reviewed changes

packages/prime/src/prime_cli/api/rl.py Show resolved Hide resolved

JannikSt added 2 commits December 30, 2025 18:48

feat: add logs command for RL runs

06edd99

fix: move time import to top, add rl_config example

6d421de

cursor bot reviewed Dec 30, 2025

View reviewed changes

packages/prime/src/prime_cli/commands/rl.py Outdated Show resolved Hide resolved

JannikSt added 3 commits December 30, 2025 18:52

feat: add --watch flag and improve log streaming

3cad7f3

fix: allow built-in envs like reverse-text, update example

dc69731

feat: add --eval-* options to rl run command

18072d0

cursor bot reviewed Dec 30, 2025

View reviewed changes

packages/prime/src/prime_cli/commands/rl.py Outdated Show resolved Hide resolved

packages/prime/src/prime_cli/commands/rl.py Outdated Show resolved Hide resolved

JannikSt added 5 commits December 30, 2025 19:05

fix: strip ANSI escape codes from logs output

7dedc76

fix: increase poll interval to 5s, add rate limit handling

842e47e

fix: filter progress bars from logs output, remove redundant --watch …

fcac3bd

…flag

fix: keep 100% progress bar completion lines in logs

9fbef56

fix: address review comments - simplify log follow, warn on unused ev…

309c6e0

…al options

chatgpt-codex-connector bot reviewed Dec 30, 2025

View reviewed changes

packages/prime/src/prime_cli/commands/rl.py Outdated Show resolved Hide resolved

fix: handle log rotation in follow mode when tail window is full

37b5f64

cursor bot reviewed Dec 30, 2025

View reviewed changes

packages/prime/src/prime_cli/commands/rl.py Outdated Show resolved Hide resolved

JannikSt added 4 commits December 30, 2025 21:06

fix: always use overlap detection for log follow to handle fast growt…

8fa1d3d

…h with rotation

feat: add [eval] section support in TOML config files

ba1b5aa

fix: improve progress bar filtering to remove empty lines

88647e0

fix: require owner/name format for environments, remove example config

be6ae70

manveerxyz reviewed Dec 31, 2025

View reviewed changes

packages/prime/src/prime_cli/commands/rl.py Outdated Show resolved Hide resolved

packages/prime/src/prime_cli/commands/rl.py Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Jan 3, 2026

View reviewed changes

packages/prime/src/prime_cli/commands/rl.py Show resolved Hide resolved

JannikSt merged commit f553271 into feature/rft Jan 3, 2026
1 check passed

JannikSt deleted the feature/rft-eval-config branch January 3, 2026 12:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add eval_config support to RL API client #271

feat: add eval_config support to RL API client #271

Uh oh!

JannikSt commented Dec 30, 2025 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JannikSt commented Dec 30, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JannikSt commented Jan 3, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add eval_config support to RL API client #271

feat: add eval_config support to RL API client #271

Uh oh!

Conversation

JannikSt commented Dec 30, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JannikSt commented Dec 30, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JannikSt commented Jan 3, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JannikSt commented Dec 30, 2025 •

edited by cursor bot

Loading