Skip to content

Significant bottleneck with text encoder outputs and latent caching to disk with .npz vs .safetensors #2266

@gazingstars123

Description

@gazingstars123

Hello, I wanted to ask if there are plans to officially support .safetensors for latent and text encoder caching (as discussed in #1750) or if it's already implemented that I'm not aware of. However, skimming through the newer Musubi Tuner code, it seems like that isn't the case.

I've been testing on very large datasets (hundreds of thousands of images or millions of images) and found that the current .npz implementation is a significant bottleneck with huge additional file sizes and with both read and write operations, in many cases up to 40-50 times slower reading cache from disk compared to .safetensors.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions