view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 10 days ago β’ 819
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 12 days ago β’ 47
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26, 2025 β’ 188
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 β’ 23 items β’ Updated 3 days ago β’ 50
The Y-Combinator for LLMs: Solving Long-Context Rot with Ξ»-Calculus Paper β’ 2603.20105 β’ Published 22 days ago β’ 37
CodeScout Collection RL-trained code search agents (1.7B, 4B, 14B) that outperform 2β18Γ larger models using only a Unix terminal. π arxiv.org/abs/2603.17829 β’ 12 items β’ Updated 24 days ago β’ 7
PyLate π Collection State-of-the-art late interaction models trained using PyLate β’ 5 items β’ Updated 5 days ago β’ 4
ColBERT-Zero πΆ Collection First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT β’ 10 items β’ Updated 5 days ago β’ 20
π§ LFM2.5 Collection Collection of post-trained and base LFM2.5 models. β’ 30 items β’ Updated 3 days ago β’ 124
LateOn-Code π» Collection State-of-the-art late interaction code retrieval models β’ 6 items β’ Updated 5 days ago β’ 17
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 β’ 53
view article Article **ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?** Feb 19 β’ 19
SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper β’ 2602.02361 β’ Published Feb 2 β’ 60