InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper • 2604.03144 • Published 11 days ago • 228
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU Paper • 2604.05091 • Published 8 days ago • 43
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 4 items • Updated 1 day ago • 36
DFlash Collection Block Diffusion for Flash Speculative Decoding • 13 items • Updated 9 days ago • 54
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? Paper • 2603.24472 • Published 19 days ago • 53
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 8 items • Updated about 9 hours ago • 24