SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 180k • 910 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 118k • 723 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 32.8k • 580 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 186k • 182
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 228k • 995 HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 10.1k • • 209 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 416 • 20 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 17.1k • 184
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 10.1k • • 209
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 21.8k • 661 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 13.9k • 967 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 109k • 216 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 7.81k • 2.93k
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 396 • 74 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 53 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 133 • 104 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 10.4k • 803
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 14.8k • 676 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 662 • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 1.71M • 3.43k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 218 • 133
SmolLM 🤏 SmolLM models, datasets and demos HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 180k • 910 HuggingFaceTB/SmolLM2-1.7B-Instruct Text Generation • Updated Apr 21, 2025 • 118k • 723 HuggingFaceTB/SmolVLM-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 32.8k • 580 HuggingFaceTB/SmolLM2-360M-Instruct Text Generation • Updated Sep 22, 2025 • 186k • 182
Instruct datasets QuixiAI/SystemChat-2.0 Viewer • Updated Jun 15, 2025 • 141k • 396 • 74 arcee-ai/infini-instruct-top-500k Viewer • Updated Jun 30, 2024 • 500k • 53 • 6 arcee-ai/The-Tome Viewer • Updated Aug 15, 2024 • 1.75M • 133 • 104 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 10.4k • 803
📚 Filtering the web with LLMs HuggingFaceFW/fineweb-edu Viewer • Updated Jul 11, 2025 • 3.5B • 228k • 995 HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 10.1k • • 209 HuggingFaceFW/ablation-model-fineweb-edu Text Generation • 2B • Updated Jun 11, 2024 • 416 • 20 math-ai/AutoMathText Viewer • Updated Jul 16, 2025 • 7.89M • 17.1k • 184
HuggingFaceFW/fineweb-edu-classifier Text Classification • 0.1B • Updated Nov 17, 2024 • 10.1k • • 209
🌌 Synthetic textbooks Synthetically generated textbooks HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 14.8k • 676 Locutusque/UltraTextbooks Viewer • Updated Feb 2, 2024 • 5.52M • 662 • 198 microsoft/phi-2 Text Generation • 3B • Updated Dec 8, 2025 • 1.71M • 3.43k HuggingFaceTB/cosmo-1b Text Generation • 2B • Updated Jul 8, 2024 • 218 • 133
✨ Code Generation Code generation models and datassets! bigcode/starcoder2-15b Text Generation • Updated Jun 5, 2024 • 21.8k • 661 bigcode/the-stack Viewer • Updated Apr 13, 2023 • 546M • 13.9k • 967 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 109k • 216 bigcode/starcoder Text Generation • 16B • Updated Oct 8, 2024 • 7.81k • 2.93k