Vjepa2
Visit Toolvjepa2 is an Open Source & Models tool that provides PyTorch code and models for self-supervised learning from video. It enables understanding, prediction, and planning using advanced video models.
At a glance
Trending
vjepa2 is an Open Source & Models tool that provides PyTorch code and models for self-supervised learning from video. It enables understanding, prediction, and planning using advanced video models.
Trending
About
vjepa2 is an open-source project from Facebook AI Research (FAIR) providing PyTorch code and models for V-JEPA 2 and V-JEPA 2.1, self-supervised learning approaches for video. These models are pre-trained on internet-scale video data to achieve state-of-the-art performance in motion understanding and human action anticipation tasks. V-JEPA 2.1 further refines the training recipe to learn high-quality and temporally consistent dense features, leveraging dense predictive loss, deep self-supervision, and multi-modal tokenizers. The project also includes V-JEPA 2-AC, a latent action-conditioned world model for robot manipulation tasks, demonstrating capabilities like reaching, grasping, and pick-and-place without extensive environment-specific data. It offers pretrained checkpoints and easy integration via PyTorch Hub and HuggingFace.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending