Young-Jun Lee's picture

Young-Jun Lee PRO

passing2961

·

https://sites.google.com/view/passing2961/home

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper 2 days ago

Scaling Test-Time Compute for Agentic Coding

upvoted a paper 2 days ago

Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL

upvoted a paper 5 days ago

MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

View all activity

Organizations

upvoted 2 papers 2 days ago

Scaling Test-Time Compute for Agentic Coding

Paper • 2604.16529 • Published 10 days ago • 9

Abstain-R1: Calibrated Abstention and Post-Refusal Clarification via Verifiable RL

Paper • 2604.17073 • Published 8 days ago • 8

upvoted 6 papers 5 days ago

MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

Paper • 2604.18584 • Published 6 days ago • 14

SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Paper • 2604.17308 • Published 7 days ago • 22

When Can LLMs Learn to Reason with Weak Supervision?

Paper • 2604.18574 • Published 6 days ago • 24

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published 6 days ago • 78

PRL-Bench: A Comprehensive Benchmark Evaluating LLMs' Capabilities in Frontier Physics Research

Paper • 2604.15411 • Published 10 days ago • 4

The Amazing Agent Race: Strong Tool Users, Weak Navigators

Paper • 2604.10261 • Published 9 days ago • 7

upvoted a paper 11 days ago

Toward Autonomous Long-Horizon Engineering for ML Research

Paper • 2604.13018 • Published 12 days ago • 34

upvoted 2 papers 12 days ago

QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation

Paper • 2604.08570 • Published Mar 25 • 124

EXAONE 4.5 Technical Report

Paper • 2604.08644 • Published 17 days ago • 66