Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation Paper β’ 2504.17025 β’ Published Apr 23 β’ 17
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines Paper β’ 2504.14738 β’ Published Apr 20 β’ 5
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction Paper β’ 2504.15266 β’ Published Apr 21 β’ 6
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging Paper β’ 2504.10642 β’ Published Apr 14 β’ 2
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models Paper β’ 2504.15133 β’ Published Apr 21 β’ 26
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation Paper β’ 2504.14538 β’ Published Apr 20 β’ 30
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper β’ 2504.17192 β’ Published Apr 24 β’ 120
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 46 items β’ Updated Jul 21 β’ 670
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 24 items β’ Updated May 19 β’ 177
Big-Math Collection This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers β’ 4 items β’ Updated Apr 16 β’ 6
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data Paper β’ 2309.11235 β’ Published Sep 20, 2023 β’ 15
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. β’ 121 items β’ Updated Jan 31, 2024 β’ 567