4 5 5

Botao Yu

btyu

https://btyu.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

upvoted a paper 3 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

liked a dataset 5 months ago

osunlp/Mind2Web-2

View all activity

Organizations

upvoted 2 papers 3 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 84

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 124

liked a dataset 5 months ago

osunlp/Mind2Web-2

Viewer • Updated Oct 27 • 130 • 228 • 15

upvoted a paper 5 months ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26 • 51

New activity in osunlp/SMolInstruct 6 months ago

Inquiry on Data Correction Methodology in SMolInstruct

#2 opened 10 months ago by

moshouxiaomu

updated a collection 11 months ago

LlaSMol

Collection

LLMs tuned on the SMolInstruct dataset for chemistry tasks. • 6 items • Updated Feb 4 • 2

liked a dataset about 1 year ago

osunlp/ScienceAgentBench

Viewer • Updated Oct 28, 2024 • 102 • 592 • 16

upvoted 2 papers about 1 year ago

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Paper • 2410.05080 • Published Oct 7, 2024 • 21

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Paper • 2410.05243 • Published Oct 7, 2024 • 20

New activity in osunlp/SMolInstruct about 1 year ago

Can 't load and download

#1 opened about 1 year ago by

suyuanonly

updated a dataset about 1 year ago

osunlp/SMolInstruct

Updated Sep 18, 2024 • 2.17k • 47

liked a dataset over 1 year ago

MMMU/MMMU_Pro

Viewer • Updated Mar 8 • 5.19k • 6.86k • 41

authored a paper over 1 year ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4, 2024 • 31

updated 2 models over 1 year ago

osunlp/LlaSMol-Mistral-7B

Updated May 6, 2024 • 17

osunlp/LlaSMol-Llama2-7B

Updated May 6, 2024 • 1