StepWise
AI & ML interests
Natural Language Processing at Yale
Recent Activity
Papers
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation
References Improve LLM Alignment in Non-Verifiable Domains
models 94
yale-nlp/AgentTrek-1.0-32B_webarena-verified_milestone-bert
0.1B • Updated • 9
yale-nlp/gpt-oss-20b_webarena-verified_stuck-bert
0.1B • Updated • 11
yale-nlp/AgentTrek-1.0-32B_webarena-verified_stuck-bert
0.1B • Updated • 13
yale-nlp/gpt-oss-20b_webarena-verified_milestone-bert
0.1B • Updated • 13
yale-nlp/modernbert-evocua-milestone-detector
0.1B • Updated • 13
yale-nlp/modernbert-evocua-stuck-detector
0.1B • Updated • 12
yale-nlp/modernbert-qwen-milestone-detector
0.1B • Updated • 14
yale-nlp/modernbert-qwen-stuck-detector
0.1B • Updated • 13
yale-nlp/Qwen3-VL-8B-Anchor-Windows
770k • Updated • 2
yale-nlp/Qwen2.5-VL-7B-Anchor-Windows
849k • Updated
datasets 28
yale-nlp/Anchor
Viewer • Updated • 30.6k • 45
yale-nlp/MedTutor
Updated • 302 • 2
yale-nlp/SciArena
Viewer • Updated • 13.2k • 61 • 25
yale-nlp/SciReas-Pro
Viewer • Updated • 1.36k • 14 • 1
yale-nlp/MSRS
Viewer • Updated • 2.44k • 75 • 2
yale-nlp/SciArena-Eval
Viewer • Updated • 2k • 6
yale-nlp/SciArena-with-paperbank
Viewer • Updated • 15.2k • 12
yale-nlp/SciDQA
Viewer • Updated • 2.94k • 120 • 2
yale-nlp/AbGen
Viewer • Updated • 3.3k • 25 • 3
yale-nlp/LimitGen
Updated • 70