▲ LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token at github.com▼1 up and 1 down, posted by Vily 9 days ago 2 comments