view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 9 days ago • 63
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation Paper • 2604.14683 • Published 9 days ago • 35
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents Paper • 2604.18543 • Published 5 days ago • 26
GLiNER-PII Collection PII detection models developed in collaboration with Wordcab • 5 items • Updated Jan 29 • 23
jenerallee78/gemma-4-26B-A4B-it-ara-abliterated Image-Text-to-Text • 26B • Updated 17 days ago • 20.8k • 11
Toward Autonomous Long-Horizon Engineering for ML Research Paper • 2604.13018 • Published 11 days ago • 34
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 11 days ago • 85
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 11 days ago • 98