LLaVA-Mini
Visit ToolLLaVA-Mini is a unified large multimodal model (LMM) for efficient image, high-resolution image, and video understanding. It supports various visual inputs with a single vision token, making it suitable for researchers.
At a glance
Trending