view article Article Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture Jan 5 • 38
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 7 days ago • 57
view article Article Rigth and left alignment on Large Language Models and its variants 11 days ago • 1
Pi05 Knolwedge Insulation Collection Models that I train for that matter • 6 items • Updated 11 days ago • 1
view article Article Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models 18 days ago • 10
GPU Acceleration and Portability of the TRIMEG Code for Gyrokinetic Plasma Simulations using OpenMP Paper • 2601.14301 • Published 24 days ago • 1
Physical AI Collection VLM and models used for Physical AI, LeRobot, Nvidia, etc. Handy • 4 items • Updated 16 days ago • 1
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2, 2025 • 149
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published Oct 22, 2025 • 115
view article Article Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models Oct 20, 2025 • 20
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13, 2025 • 180
view article Article Nano Banana (Gemini 2.5 Flash Image) Full Tutorial - 27 Unique Cases vs Qwen Image Edit - Free 2 Use Aug 27, 2025 • 2
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5, 2025 • 510