AViD
Visit ToolAViD is an open-source framework that enables fine-tuning of vision-language grounding models on custom datasets. It extends Grounding DINO with parameter-efficient adaptation, LoRA support, and EMA stabilization.
At a glance
Trending