High-Fidelity and Controllable Generation of Textured 3D Assets
code for Mesh R-CNN, ICCV 2019
State-of-the-art 2D and 3D Face Analysis Project
Models for object and human mesh reconstruction
3D reconstruction software
A Unified Framework for Text-to-3D and Image-to-3D Generation
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Generating Immersive, Explorable, and Interactive 3D Worlds
PyTorch3D is FAIR's library of reusable components for deep learning
RGBD video generation model conditioned on camera input
Implementation of DeepLabCut
Official implementation of DreamCraft3D
Implementation of Video Diffusion Models
Simple and easily configurable grid world environments
DeepMind model for tracking arbitrary points across videos & robotics
Framework for building neural networks
Implementation of Make-A-Video, new SOTA text to video generator
Lightning fast C++/CUDA neural network framework
An Open Source package that allows video game creators
This repo contains the code for 1D tokenizer and generator
A text-to-speech, speech-to-text and speech-to-speech library
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
SAPIEN Manipulation Skill Framework
A Systematic Framework for Interactive World Modeling