sprocket-lab

university

https://sprocketlab.github.io/

AI & ML interests

None defined yet.

Recent Activity

zihengh1 updated a dataset 5 days ago

sprocket-lab/PAJAMA

abeQ213 new activity 11 days ago

sprocket-lab/PAJAMA:Add PAJAMA validation and test splits

abeQ213 new activity 11 days ago

sprocket-lab/PAJAMA:Add PAJAMA validation and test splits

View all activity

updated a dataset 5 days ago

sprocket-lab/PAJAMA

Viewer • Updated 5 days ago • 19.8k • 95

in sprocket-lab/PAJAMA 11 days ago

Add PAJAMA validation and test splits

#6 opened 11 days ago by

Add PAJAMA validation and test splits

#5 opened 11 days ago by

Add PAJAMA validation and test splits

#4 opened 11 days ago by

Add PAJAMA validation and test splits

#3 opened 11 days ago by

published a dataset 11 days ago

sprocket-lab/PAJAMA

Viewer • Updated 5 days ago • 19.8k • 95

in sprocket-lab/PAJAMA 11 days ago

Add PAJAMA validation and test splits

#2 opened 11 days ago by

in sprocket-lab/PAJAMA 11 days ago

Add PAJAMA validation and test splits

#2 opened 11 days ago by

Smoke test via PR

#1 opened 11 days ago by

updated a dataset 29 days ago

ZhiqiGao/Text2Opt-Bench

Updated 9 days ago • 3.95k

published a dataset 29 days ago

ZhiqiGao/Text2Opt-Bench

Updated 9 days ago • 3.95k

submitted a paper to Daily Papers about 2 months ago

Test-Time Scaling Makes Overtraining Compute-Optimal

Paper • 2604.01411 • Published Apr 1 • 28

authored a paper 2 months ago

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published Mar 25 • 30

submitted a paper to Daily Papers 3 months ago

RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning

Paper • 2603.09160 • Published Mar 10 • 17

authored 6 papers 11 months ago

Geometry-Aware Adaptation for Pretrained Models

Paper • 2307.12226 • Published Jul 23, 2023

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training

Paper • 2505.00358 • Published May 1, 2025 • 26

Shrinking the Generation-Verification Gap with Weak Verifiers

Paper • 2506.18203 • Published Jun 22, 2025 • 2

Time To Impeach LLM-as-a-Judge: Programs are the Future of Evaluation

Paper • 2506.10403 • Published Jun 12, 2025 • 2

The ALCHEmist: Automated Labeling 500x CHEaper Than LLM Data Annotators

Paper • 2407.11004 • Published Jun 25, 2024

ScriptoriumWS: A Code Generation Assistant for Weak Supervision

Paper • 2502.12366 • Published Feb 17, 2025