DrivingDiffusion
Visit ToolDrivingDiffusion is an open-source video generation tool that creates multi-view driving scenarios. It uses a latent diffusion model guided by 3D layout to generate realistic and consistent videos.
At a glance
Trending
DrivingDiffusion is an open-source video generation tool that creates multi-view driving scenarios. It uses a latent diffusion model guided by 3D layout to generate realistic and consistent videos.
Trending
About
DrivingDiffusion is an open-source project that provides an official implementation of the paper "DrivingDiffusion: Layout-Guided Multi-View Driving Scenarios Video Generation with Latent Diffusion Model." This tool is designed to address the challenge of generating high-quality, large-scale multi-view video data with accurate annotations for autonomous driving research. It tackles cross-view and cross-frame consistency, as well as the quality of generated instances, through a cascaded approach involving multi-view single-frame image generation, single-view video generation, and post-processing for long video generation. DrivingDiffusion also incorporates local prompts to enhance the quality of generated instances and can extend video length using a temporal sliding window algorithm. It is built upon the stable-diffusion-v1-4 initial weights and base structure.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending