Name	Name	Last commit message	Last commit date
parent directory ..
cait.py	cait.py
cross_vit.py	cross_vit.py
deep_vit.py	deep_vit.py
readme.md	readme.md
transformer_wrapper_tutorial.ipynb	transformer_wrapper_tutorial.ipynb

Name

Last commit message

Last commit date

Using the FSDP Transformer wrapper

FSDP now has an express auto-wrapper for Transformer models. This allows FSDP to create a 'model aware' sharding plan for how it breaks up the model across the GPU's and can result in some significant performance improvements for your training time.

Here's the video: https://www.loom.com/share/2cc2633fa69940789f7f886cbe1fef79

And the notebook for it is included in this dir - transformer wrapper tutorial.jpynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

Using the FSDP Transformer wrapper

FilesExpand file tree

transformer_wrapping_tutorial

Directory actions

More options

Directory actions

More options

Latest commit

History

transformer_wrapping_tutorial

Folders and files

parent directory

readme.md

Using the FSDP Transformer wrapper