This repo gives the code and models of 'InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision'.
2025/12/01: The technical report of InternVideo-Next and the pretrained models are released. See Huggingface Collection for more details.
If this work is helpful for your research, please consider citing InternVideo.
@article{wang2025internvideonext,
title={InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision},
author={Chenting Wang and Yuhan Zhu and Yicheng Xu and Jiange Yang and Ziang Yan and Yali Wang and Yi Wang and Limin Wang},
year={2025},
journal={arXiv preprint arXiv:2512.01342},
}