YANG SHU's picture

7 22

YANG SHU

babytreecc

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning

authored a paper about 2 months ago

When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment

upvoted a paper 2 months ago

When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment

View all activity

Organizations

upvoted a paper 6 days ago

Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning

Paper • 2510.04081 • Published Oct 5 • 23

upvoted a paper 2 months ago

When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment

Paper • 2509.00544 • Published Aug 30 • 11

upvoted a collection 3 months ago

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 145

upvoted a paper 9 months ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40

upvoted an article 9 months ago

Article

Open R1: Update #2

Feb 10

•

218

upvoted a collection 11 months ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 134 items • Updated Oct 20 • 116

upvoted an article over 1 year ago

Article

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

Jun 20, 2024

•

26