deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24, 2025 • 1.73M • • 1.51k
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct Text Generation • 8B • Updated Oct 31, 2024 • 7.92k • • 13
Running on CPU Upgrade Featured 1k Model Memory Utility 🚀 1k Calculate VRAM needed for training and inference of HF models