Add partner_blindness_prob (blind-agent robustness knob) by eugenevinitsky · Pull Request #414 · Emerge-Lab/PufferDrive

eugenevinitsky · 2026-05-03T20:47:22Z

Summary

Ports the blind-agent robustness feature from vcha/turbostream. Each agent has a per-episode probability partner_blindness_prob of being "blind" — its partner observations are zeroed for the whole episode, making it an unpredictable hazard for surrounding traffic.
Blind agents are masked out of the PPO rollout buffer (GIGAFLOW Appendix B.4) so their transitions don't pollute the gradient.
Default 0.0 (off), so behavior is unchanged unless you opt in.

Changes

config/drive.ini: new partner_blindness_prob = 0.0 knob under a [Robustness features] block.
sim/env_fields.h: wires the kwarg through ENV_FIELDS.
sim/drive.h: adds partner_blindness_prob to the Drive struct; per-episode sampling in c_reset; early-return in write_partner_obs for blind egos; mask out blind agents in c_step.
sim/datatypes.h: adds is_blind_partner flag to Agent.

Test plan

./build.sh --fast builds clean
./build.sh builds clean (torch backend)
With partner_blindness_prob = 0.0: training metrics unchanged vs. base
With partner_blindness_prob = 0.05: ~5% of agents per episode see zeroed partner obs and contribute mask=0 entries; check masks distribution in a short rollout
Sanity: collision rate doesn't spike with blind agents off

🤖 Generated with Claude Code

Ports the blind-agent feature from vcha/turbostream. Per-episode probability that an agent sees zeroed partner observations for the whole episode, making it an unpredictable hazard for the rest of traffic. Blind agents are masked out of the PPO rollout buffer (GIGAFLOW Appendix B.4) so they don't pollute the gradient. Default 0.0 (off). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Adds an opt-in “blind partner observations” robustness feature to the drive simulator, allowing a per-episode fraction of agents to have their partner (other-agent) observations zeroed while excluding their transitions from PPO rollouts to avoid gradient contamination.

Changes:

Introduces partner_blindness_prob as a new [env] configuration knob (default 0.0).
Wires the new kwarg through ENV_FIELDS into the Drive struct.
Implements per-episode sampling of Agent.is_blind_partner, zeros partner observations for blind agents, and sets masks[i]=0 for blind agents in c_step.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated no comments.

File	Description
config/drive.ini	Adds the new robustness configuration knob and documentation comments.
sim/env_fields.h	Exposes `partner_blindness_prob` via the centralized env-kwarg field list.
sim/datatypes.h	Adds an episode-level `Agent.is_blind_partner` flag.
sim/drive.h	Samples blindness per episode, skips writing partner observations for blind agents, and masks blind agents out of PPO rollouts.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot AI review requested due to automatic review settings May 3, 2026 20:47

Copilot started reviewing on behalf of eugenevinitsky May 3, 2026 20:47 View session

Copilot AI reviewed May 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add partner_blindness_prob (blind-agent robustness knob)#414

Add partner_blindness_prob (blind-agent robustness knob)#414
eugenevinitsky wants to merge 1 commit intopuffer-4from
ev/blind-partners

eugenevinitsky commented May 3, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

eugenevinitsky commented May 3, 2026

Summary

Changes

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants