1 40

Tom LUCAS

C0casio45

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

upvoted a paper 3 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

commented on an article 3 months ago

Judge Arena: Benchmarking LLMs as Evaluators

View all activity

Organizations

upvoted 2 papers 3 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 18 days ago • 256

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 9 days ago • 199

commented on Judge Arena: Benchmarking LLMs as Evaluators 3 months ago

seems that there is an issue with sfr-llama-3.1-70b-judge

Failed to parse Salesforce response format: Error with Salesforce model sfr-llama-3.1-70b-judge: 422 Client Error: Unprocessable Entity for url: https://gateway.salesforceresearch.ai/sfr-judge/process

upvoted a paper 3 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 660

upvoted a paper 4 months ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 160

upvoted 2 papers 5 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 259

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2 • 131

upvoted a paper 7 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317

upvoted 4 papers 8 months ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 137

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 303

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 110

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 301

upvoted 5 papers 9 months ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 119

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 89

upvoted a paper 10 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

reacted to MohamedRashad's post with 👍 10 months ago

Post

3350

Today is a big day for the Arabic Language,

We have https://huggingface.co/spaces/Navid-AI/The-Arabic-Rag-Leaderboard,
an Update for OALL/Open-Arabic-LLM-Leaderboard
and the release of atlasia/darija-chatbot-arena

All of this announcements was under 12 hours of time 🤯

upvoted a paper 10 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 124

Tom LUCAS

AI & ML interests

Recent Activity

Organizations

C0casio45's activity