1 7 2

Shuyao Xu

Tim-Xu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

upvoted a paper 2 months ago

Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference

authored a paper 5 months ago

Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning

View all activity

Organizations

upvoted a paper about 1 month ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 53

upvoted a paper 2 months ago

Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference

Paper • 2508.19559 • Published Aug 27 • 6

authored a paper 5 months ago

Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning

Paper • 2505.24850 • Published May 30 • 8

commented a paper 5 months ago

Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning

Paper • 2505.24850 • Published May 30 • 8 •

upvoted 3 papers 5 months ago

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Paper • 2505.23693 • Published May 29 • 55

Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning

Paper • 2505.24850 • Published May 30 • 8

Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression

Paper • 2505.19602 • Published May 26 • 13

published a model 7 months ago

infly/INFLogic-Qwen2.5-32B-RL-Preview

Text Generation • 33B • Updated Apr 25 • 67 • 4

updated a model 7 months ago

infly/INFLogic-Qwen2.5-32B-RL-Preview

Text Generation • 33B • Updated Apr 25 • 67 • 4

published a model 8 months ago

Tim-Xu/Qwen2.5-7B-kk-GRPO-s380

Updated Mar 18

liked a model 10 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 412k • • 12.8k

upvoted a paper 10 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 84

liked a dataset about 1 year ago

neuralblog/llama3-8b-mlp-neurons

Updated Jun 8, 2024 • 11 • 3

upvoted a collection over 1 year ago

lshort-transformers

Collection

Papers useful when writing the paper: "The Not So Short Transfromers" • 10 items • Updated May 24, 2024 • 1

Shuyao Xu

AI & ML interests

Recent Activity

Organizations

Tim-Xu's activity