Add num_trials trace to BenchmarkResult by saitcakmak · Pull Request #4941 · facebook/Ax

saitcakmak · 2026-02-24T19:57:29Z

Summary:
The optimization_trace and other traces on BenchmarkResult are indexed by "completion event" rather than trial number. In the asynchronous case, multiple trials can complete at the same simulated time and get grouped into a single trace entry, so the trace length can be less than the number of trials. Previously there was no way to determine how many trials had completed at each trace entry.

This diff adds a num_trials field to BenchmarkResult (a list[int]) that records the cumulative number of completed or early-stopped trials at each completion event. In the synchronous case this is simply [1, 2, ..., n]. In the async case it can increase by more than 1 at a step, e.g. [2, 4, 5, ...]. The field is optional (None) for backwards compatibility with old stored results.

On AggregatedBenchmarkResult, the mean num_trials across replications is added as a new column on the optimization_trace and score_trace DataFrames (when available on all results).

Reviewed By: hvarfner

Differential Revision: D93751894

meta-codesync · 2026-02-24T19:57:37Z

@saitcakmak has exported this pull request. If you are a Meta employee, you can view the originating Diff in D93751894.

Summary: The optimization_trace and other traces on BenchmarkResult are indexed by "completion event" rather than trial number. In the asynchronous case, multiple trials can complete at the same simulated time and get grouped into a single trace entry, so the trace length can be less than the number of trials. Previously there was no way to determine how many trials had completed at each trace entry. This diff adds a `num_trials` field to `BenchmarkResult` (a `list[int]`) that records the cumulative number of completed or early-stopped trials at each completion event. In the synchronous case this is simply `[1, 2, ..., n]`. In the async case it can increase by more than 1 at a step, e.g. `[2, 4, 5, ...]`. The field is optional (`None`) for backwards compatibility with old stored results. On `AggregatedBenchmarkResult`, the mean `num_trials` across replications is added as a new column on the `optimization_trace` and `score_trace` DataFrames (when available on all results). Reviewed By: hvarfner Differential Revision: D93751894

Summary: Pull Request resolved: facebook#4941 The optimization_trace and other traces on BenchmarkResult are indexed by "completion event" rather than trial number. In the asynchronous case, multiple trials can complete at the same simulated time and get grouped into a single trace entry, so the trace length can be less than the number of trials. Previously there was no way to determine how many trials had completed at each trace entry. This diff adds a `num_trials` field to `BenchmarkResult` (a `list[int]`) that records the cumulative number of completed or early-stopped trials at each completion event. In the synchronous case this is simply `[1, 2, ..., n]`. In the async case it can increase by more than 1 at a step, e.g. `[2, 4, 5, ...]`. The field is optional (`None`) for backwards compatibility with old stored results. On `AggregatedBenchmarkResult`, the mean `num_trials` across replications is added as a new column on the `optimization_trace` and `score_trace` DataFrames (when available on all results). Reviewed By: hvarfner Differential Revision: D93751894

codecov-commenter · 2026-02-24T23:09:35Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 96.80%. Comparing base (052a554) to head (975f98f).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #4941   +/-   ##
=======================================
  Coverage   96.80%   96.80%           
=======================================
  Files         595      595           
  Lines       63193    63213   +20     
=======================================
+ Hits        61173    61194   +21     
+ Misses       2020     2019    -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Summary: The optimization_trace and other traces on BenchmarkResult are indexed by "completion event" rather than trial number. In the asynchronous case, multiple trials can complete at the same simulated time and get grouped into a single trace entry, so the trace length can be less than the number of trials. Previously there was no way to determine how many trials had completed at each trace entry. This diff adds a `num_trials` field to `BenchmarkResult` (a `list[int]`) that records the cumulative number of completed or early-stopped trials at each completion event. In the synchronous case this is simply `[1, 2, ..., n]`. In the async case it can increase by more than 1 at a step, e.g. `[2, 4, 5, ...]`. The field is optional (`None`) for backwards compatibility with old stored results. On `AggregatedBenchmarkResult`, the mean `num_trials` across replications is added as a new column on the `optimization_trace` and `score_trace` DataFrames (when available on all results). Reviewed By: hvarfner Differential Revision: D93751894

Summary: Pull Request resolved: facebook#4941 The optimization_trace and other traces on BenchmarkResult are indexed by "completion event" rather than trial number. In the asynchronous case, multiple trials can complete at the same simulated time and get grouped into a single trace entry, so the trace length can be less than the number of trials. Previously there was no way to determine how many trials had completed at each trace entry. This diff adds a `num_trials` field to `BenchmarkResult` (a `list[int]`) that records the cumulative number of completed or early-stopped trials at each completion event. In the synchronous case this is simply `[1, 2, ..., n]`. In the async case it can increase by more than 1 at a step, e.g. `[2, 4, 5, ...]`. The field is optional (`None`) for backwards compatibility with old stored results. On `AggregatedBenchmarkResult`, the mean `num_trials` across replications is added as a new column on the `optimization_trace` and `score_trace` DataFrames (when available on all results). Reviewed By: hvarfner Differential Revision: D93751894

Summary: The optimization_trace and other traces on BenchmarkResult are indexed by "completion event" rather than trial number. In the asynchronous case, multiple trials can complete at the same simulated time and get grouped into a single trace entry, so the trace length can be less than the number of trials. Previously there was no way to determine how many trials had completed at each trace entry. This diff adds a `num_trials` field to `BenchmarkResult` (a `list[int]`) that records the cumulative number of completed or early-stopped trials at each completion event. In the synchronous case this is simply `[1, 2, ..., n]`. In the async case it can increase by more than 1 at a step, e.g. `[2, 4, 5, ...]`. The field is optional (`None`) for backwards compatibility with old stored results. On `AggregatedBenchmarkResult`, the mean `num_trials` across replications is added as a new column on the `optimization_trace` and `score_trace` DataFrames (when available on all results). Reviewed By: hvarfner Differential Revision: D93751894

meta-cla bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Feb 24, 2026

meta-codesync bot added fb-exported meta-exported labels Feb 24, 2026

saitcakmak force-pushed the export-D93751894 branch from 6d47967 to f4cb6c1 Compare February 24, 2026 22:13

saitcakmak force-pushed the export-D93751894 branch from f4cb6c1 to 41b1cdf Compare February 24, 2026 22:14

saitcakmak force-pushed the export-D93751894 branch from 41b1cdf to 5914a65 Compare February 24, 2026 22:21

saitcakmak force-pushed the export-D93751894 branch from 5914a65 to a7f5d20 Compare February 24, 2026 23:33

saitcakmak force-pushed the export-D93751894 branch from a7f5d20 to ff000f4 Compare February 24, 2026 23:36

saitcakmak force-pushed the export-D93751894 branch from ff000f4 to 975f98f Compare February 25, 2026 04:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add num_trials trace to BenchmarkResult#4941

Add num_trials trace to BenchmarkResult#4941
saitcakmak wants to merge 1 commit intofacebook:mainfrom
saitcakmak:export-D93751894

saitcakmak commented Feb 24, 2026

Uh oh!

meta-codesync bot commented Feb 24, 2026

Uh oh!

codecov-commenter commented Feb 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

saitcakmak commented Feb 24, 2026

Uh oh!

meta-codesync bot commented Feb 24, 2026

Uh oh!

codecov-commenter commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov-commenter commented Feb 24, 2026 •

edited

Loading