HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text • 2B • Updated • 24.5k • 588
State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm
Answer questions about images with AI chat