Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning

Tianchong Jiang¹, Jingtian Ji¹, Xiangshan Tan¹, Jiading Fang^2,*, Anand Bhattad³, Vitor Guizilini^4,†, Matthew R. Walter^1,†

¹TTIC ²Waymo ³Johns Hopkins University ⁴Toyota Research Institute

Accepted to ICRA 2026 (Vienna, June 2026)

Installation

First, clone the repo and cd into it.

git clone https://github.com/ripl/CamPoseOpensource
cd CamPoseOpensource

Then, run the setup script. It will setup the conda environment and download data.

bash setup.sh

Activate the conda environment with

conda activate know_your_camera

If you only need ManiSkill or robosuite, comment out the lines to install other one.

How to run

You can run training in robosuite with

python policy_robosuite/train.py

or in ManiSkill with

python policy_maniskill/train.py

Reproducing the paper

Every experiment in the paper is specified in reproduce/paper_runs.yaml, keyed by figure (e.g. fig6) and entry. To launch one run, pass the figure, entry, and seed:

python reproduce/reproduce.py --paper_item fig6 --exp lift_randomized_with_conditioning --seed 0

This invokes the matching train.py with the exact overrides and seed used for the paper.

If you use a coding agent (Cursor, Claude Code, Codex, etc.), you can point it at reproduce/SKILL.md and just say e.g. "reproduce fig 6 lift randomized with conditioning" — it will ask about your scheduler and draft a job script. I honestly don't know how well this works in practice yet.

Results will not be bitwise identical across machines — this is not guaranteed on modern GPUs (see this blog for background) — but numbers should match the paper in expectation. If something looks off, or you hit any other issue, I'd really appreciate hearing about it — please open an issue or email tianchongj [at] ttic [dot] edu.

Training runs are long (typically hours to a day per seed on one GPU), so in practice you'll want a cluster (SLURM or similar).

Plücker Snippet

To add camera conditioning to your policy, you can use the following minimalist snippet to get Plücker raymap from intrinsics and extrinsics. (It assumes OpenCV convention i.e. image origin at top-left, +z is forward.)

import torch

def get_plucker_raymap(K, c2w, height, width):
    """intrinsics (3,3), cam2world (4,4), height int, width int"""
    vv, uu = torch.meshgrid(
        torch.arange(height, device=K.device, dtype=K.dtype) + 0.5,
        torch.arange(width, device=K.device, dtype=K.dtype) + 0.5,
        indexing="ij",
    )
    rays = torch.stack([uu, vv, torch.ones_like(uu)], dim=-1)
    d_world = torch.nn.functional.normalize(
        (rays @ torch.linalg.inv(K).T) @ c2w[:3, :3].T,
        dim=-1,
        eps=1e-9,
    )
    o = c2w[:3, 3].view(1, 1, 3)
    m = torch.cross(o, d_world, dim=-1)
    return torch.cat([d_world, m], dim=-1)

BibTeX

If you find this work useful, please cite:

@article{jiang2025knowyourcamera,
  title     = {Do You Know Where Your Camera Is? {V}iew-Invariant Policy Learning with Camera Conditioning},
  author    = {Tianchong Jiang and Jingtian Ji and Xiangshan Tan and Jiading Fang and Anand Bhattad and Vitor Guizilini and Matthew R. Walter},
  journal   = {arXiv preprint arXiv:2510.02268},
  year      = {2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
maniskill_source @ a0e4a9e		maniskill_source @ a0e4a9e
policy_maniskill		policy_maniskill
policy_robosuite		policy_robosuite
reproduce		reproduce
robosuite_source @ 0df3a5f		robosuite_source @ 0df3a5f
script_robosuite_demos		script_robosuite_demos
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning

Installation

How to run

Reproducing the paper

Plücker Snippet

BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning

Installation

How to run

Reproducing the paper

Plücker Snippet

BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages