Wei He's picture

8 7 4

Wei He

hewei2001

·

https://hwcoder.top/about

hewei2001

AI & ML interests

LLM

Organizations

authored 6 papers 3 months ago

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9, 2025 • 26

Better Process Supervision with Bi-directional Rewarding Signals

Paper • 2503.04618 • Published Mar 6, 2025

LongCat-Flash Technical Report

Paper • 2509.01322 • Published Sep 1, 2025 • 6

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

LongCat-Flash-Thinking Technical Report

Paper • 2509.18883 • Published Sep 23, 2025 • 4

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Paper • 2509.26490 • Published Sep 30, 2025 • 19

authored 4 papers 4 months ago

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Paper • 2411.16579 • Published Nov 25, 2024 • 3

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

Paper • 2411.00750 • Published Nov 1, 2024 • 1

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Paper • 2402.05808 • Published Feb 8, 2024

Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation

Paper • 2503.12854 • Published Mar 17, 2025

authored 7 papers about 1 year ago

Towards Understanding the Capability of Large Language Models on Code Clone Detection: A Survey

Paper • 2308.01191 • Published Aug 2, 2023 • 1

The Rise and Potential of Large Language Model Based Agents: A Survey

Paper • 2309.07864 • Published Sep 14, 2023 • 7

LongHeads: Multi-Head Attention is Secretly a Long Context Processor

Paper • 2402.10685 • Published Feb 16, 2024 • 1

LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

Paper • 2402.11550 • Published Feb 18, 2024 • 18

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Paper • 2406.04151 • Published Jun 6, 2024 • 24

Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

Paper • 2404.00884 • Published Apr 1, 2024

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Paper • 2410.18798 • Published Oct 24, 2024 • 21