-
Notifications
You must be signed in to change notification settings - Fork 35
Open
Description
Hi! I find a code bug in inference.py:
It seems that context length is 16 frame_actions (16*347) , so we can set args.demo_num > 1 and set args.frams to 16-args.demo_num, and generate should work well.
However, when I set args.demo_num to some number > 1, the image collapse quickly.
possible reason:
I find that this is because When training(and inference), condition is in [image, action,image,action ... ] format. However, in inference code, condition (variable image_input) contains only images, and I guess this is OOD and caused collapse.
my solution: add actions between images, and it works well!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels