From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 18 days ago • 256
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 9 days ago • 199
view reply seems that there is an issue with sfr-llama-3.1-70b-judge Failed to parse Salesforce response format: Error with Salesforce model sfr-llama-3.1-70b-judge: 422 Client Error: Unprocessable Entity for url: https://gateway.salesforceresearch.ai/sfr-judge/process
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper • 2509.08721 • Published Sep 10 • 660
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper • 2508.16153 • Published Aug 22 • 160
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17 • 259
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper • 2504.08791 • Published Apr 7 • 137
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14 • 303
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 301
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Paper • 2503.18878 • Published Mar 24 • 119
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published Mar 14 • 145
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published Mar 3 • 89
view post Post 3350 Today is a big day for the Arabic Language,We have https://huggingface.co/spaces/Navid-AI/The-Arabic-Rag-Leaderboard,an Update for OALL/Open-Arabic-LLM-Leaderboardand the release of atlasia/darija-chatbot-arenaAll of this announcements was under 12 hours of time 🤯 See translation 👍 7 7 🔥 3 3 🧠 1 1 + Reply