SIR: Structured Image Representations for Explainable Robot Learning

Paper, Project Page, CVPR 2026

Paul Mattes¹, Jan Schwab, Jens Bosch, Maximilian Li, Nils Blank, Minh-Trung Tang, Moritz Haberland and Rudolf Lioutikov¹

¹Intuitive Robots Lab, Karlsruhe Institute of Technology

This is the official code repository for the paper SIR: Structured Image Representations for Explainable Robot Learning.

Installation

Start installation using the install.sh

cd sir
sh install.sh

Following instructions are taken from here: https://github.com/robocasa/robocasa

All installations should NOT be done in the SIR folder. RoboCasa and SIR, should be in one folder.

Copy robosuite repo and install it using

cd ..
git clone -b robocasa_v0.1 https://github.com/ARISE-Initiative/robosuite
cd robosuite
pip install -e .
python robosuite/scripts/setup_macros.py

For Windows user: https://robosuite.ai/docs/installation.html

Copy robocasa repo and install it. Afterwards download kitchen assets and setup macro

cd ..
git clone https://github.com/robocasa/robocasa
cd robocasa
git reset --hard 370f986aa3934be6c134ecb978952423df9a1ed0
pip install -e .
python robocasa/scripts/download_kitchen_assets.py
python robocasa/scripts/setup_macros.py

File Changes

Following files need to be changed in the robosuite and robocasa repos

Robosuite

Change in robosuite/macros_private.py IMAGE_CONVENTION from opgengl to opencv

Also add the following code in robosuite/robosuite/models/tasks/task.py and define self.count = 0 in the init method.

Add after line 112:

if cls == "MJCFObject":
    cls = model.name
self.count += 1
if self.count > 3:
    for geom in model.contact_geoms:
        if geom not in sim.model.geom_names:
            print("removed: ", geom)
            geom_name = geom.split("_")[-1]
            model._contact_geoms.remove(geom_name)
    for geom in model.visual_geoms:
        if geom not in sim.model.geom_names:
            print("removed: ", geom)
            geom_name = geom.split("_")[-1]
            model._visual_geoms.remove(geom_name)

RoboCasa

robocasa/environments/kitchen/kitchen.py

All line-numbers refer to the original code, without the changes made previously in the files, respectively.

Add below line 230 (right at the start of the init function)

np.random.seed(seed)
random.seed(seed)

Add to the end of the super().init() function after line 313:

camera_segmentations="class",

Additionally, you need to add the following code below every self.model.merge_objects([model]) starting from line 478 (3 times: after 507, 483, 474):

self.model.mujoco_objects.append(model)

Datasets (TBD)

The graph datasets for RoboCasa will be uploaded to HuggingFace in the upcoming days. https://huggingface.co/datasets/MrLayen/SIR_robocasa

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
dataloader		dataloader
envs/robocasa		envs/robocasa
manager		manager
method		method
networks		networks
trained_models/cropped_image_feature		trained_models/cropped_image_feature
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
install.sh		install.sh
main.py		main.py
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SIR: Structured Image Representations for Explainable Robot Learning

Installation

File Changes

Robosuite

RoboCasa

robocasa/environments/kitchen/kitchen.py

Datasets (TBD)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

intuitive-robots/SIR_Model

Folders and files

Latest commit

History

Repository files navigation

SIR: Structured Image Representations for Explainable Robot Learning

Installation

File Changes

Robosuite

RoboCasa

robocasa/environments/kitchen/kitchen.py

Datasets (TBD)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages