TianlaiChen
's Collections
papers
updated
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through
Two-Stage Rule-Based RL
Paper
•
2503.07536
•
Published
•
88
Seedream 2.0: A Native Chinese-English Bilingual Image Generation
Foundation Model
Paper
•
2503.07703
•
Published
•
37
Gemini Embedding: Generalizable Embeddings from Gemini
Paper
•
2503.07891
•
Published
•
45
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Paper
•
2503.07572
•
Published
•
47
Implicit Reasoning in Transformers is Reasoning through Shortcuts
Paper
•
2503.07604
•
Published
•
23
Beyond Decoder-only: Large Language Models Can be Good Encoders for
Machine Translation
Paper
•
2503.06594
•
Published
•
6
A Survey of Efficient Reasoning for Large Reasoning Models: Language,
Multimodality, and Beyond
Paper
•
2503.21614
•
Published
•
42
Exploring Data Scaling Trends and Effects in Reinforcement Learning from
Human Feedback
Paper
•
2503.22230
•
Published
•
45
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion
Transformers
Paper
•
2504.10483
•
Published
•
21
Efficient Reasoning Models: A Survey
Paper
•
2504.10903
•
Published
•
21
DataDecide: How to Predict Best Pretraining Data with Small Experiments
Paper
•
2504.11393
•
Published
•
18
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation
through Pretraining, SFT, and RL
Paper
•
2504.11455
•
Published
•
14
InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models
Paper
•
2504.10479
•
Published
•
306
Scaling Data-Constrained Language Models
Paper
•
2305.16264
•
Published
•
16