Running 113 Unlocking On-Policy Distillation for Any Model Family 📝 113 Explore on-policy distillation visualization for any model
Running Agents 7 Dataset Length Profiler 👁 7 Estimate optimal max_length for SFT training with token analysis
Running 3.88k The Ultra-Scale Playbook 🌌 3.88k The ultimate guide to training LLM on large GPU Clusters
Running Agents 88 Large Reasoning Models Leaderboard 🐳 88 A leaderboard to rank large reasoning models
Running 600 Scaling test-time compute 📈 600 Boost LLM answers with flexible test‑time search strategies
Running Agents 431 Reward Bench Leaderboard 📐 431 Explore and compare model scores on RewardBench benchmarks
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots