2 566

Lei Wang

demolei

https://demoleiwang.github.io/HomePage/

demo_lei_wang
lei-wang-0805831a2

AI & ML interests

LLMs

Recent Activity

upvoted a paper about 4 hours ago

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

upvoted a paper about 4 hours ago

Memento-Skills: Let Agents Design Agents

upvoted a paper about 4 hours ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

View all activity

Organizations

Collections 3

View 3 collections

Papers 14

models 6

datasets 0

None public yet

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Language Modeling Is Compression

SlimPajama-DC: Understanding Data Combinations for LLM Training

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Language Modeling Is Compression

SlimPajama-DC: Understanding Data Combinations for LLM Training

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Papers 14

models 6

demolei/qwen2_5_vl_7b_grpo_chartqa_filtered_40

demolei/Qwen2.5-VL-7B-Instruct-chartqa_filtered_240

demolei/Qwen2.5-1.5B-Open-R1-Distill

demolei/Qwen-2.5-7B-Simple-RL

demolei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

demolei/sft_openassistant-guanaco

datasets 0

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Papers 14

models 6 Sort: Recently updated

datasets 0

models 6