TongZheng's picture

5 44 4

TongZheng PRO

TongZheng1999

·

https://kidzheng.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 5 days ago

Training Data Efficiency in Multimodal Process Reward Models

liked a Space 6 days ago

EfficientReasoning/efficient_reasoning_online_judgement

upvoted a paper 6 days ago

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

View all activity

Organizations

submitted a paper to Daily Papers 6 days ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published 6 days ago • 24

authored a paper 4 months ago

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

Paper • 2510.01444 • Published Oct 1, 2025 • 20

authored 2 papers 5 months ago

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

Paper • 2509.09675 • Published Sep 11, 2025 • 28

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9, 2025 • 103

authored a paper 7 months ago

R1-RE: Cross-Domain Relationship Extraction with RLVR

Paper • 2507.04642 • Published Jul 7, 2025 • 7

authored a paper 9 months ago

Learning to Reason via Mixture-of-Thought for Logical Reasoning

Paper • 2505.15817 • Published May 21, 2025 • 18

authored a paper 11 months ago

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation

Paper • 2503.06594 • Published Mar 9, 2025 • 6

authored 4 papers 12 months ago

EIT: Enhanced Interactive Transformer

Paper • 2212.10197 • Published Dec 20, 2022 • 1

Towards Optimal Multi-draft Speculative Decoding

Paper • 2502.18779 • Published Feb 26, 2025 • 5

Asymmetric Conflict and Synergy in Post-training for LLM-based Multilingual Machine Translation

Paper • 2502.11223 • Published Feb 16, 2025 • 1

PartialFormer: Modeling Part Instead of Whole

Paper • 2310.14921 • Published Oct 23, 2023 • 1