[R] Dynin-Omni: MaskedĀ Diffusion-BasedĀ OmnimodalĀ FoundationĀ Model
Visit ToolDynin-Omni is an omnimodal foundation model that unifies text, image, video, and speech understanding and generation. It uses a masked diffusion architecture for scalable cross-modal generation.
At a glance
Trending