5 28 23

seongyun_lee

Seongyun

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

Efficient Long Context Language Model Retrieval with Compression

authored a paper 4 days ago

Lost in the Noise: How Reasoning Models Fail with Contextual Distractors

upvoted a paper 5 days ago

Lost in the Noise: How Reasoning Models Fail with Contextual Distractors

View all activity

Organizations

upvoted a paper 5 days ago

Lost in the Noise: How Reasoning Models Fail with Contextual Distractors

Paper • 2601.07226 • Published 6 days ago • 27

upvoted 2 papers 6 months ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8, 2025 • 75

How to Train Your LLM Web Agent: A Statistical Diagnosis

Paper • 2507.04103 • Published Jul 5, 2025 • 51

upvoted a paper 7 months ago

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11, 2025 • 101

upvoted a paper 8 months ago

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published May 15, 2025 • 26

upvoted a paper 9 months ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24, 2025 • 121

upvoted a paper 12 months ago

How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?

Paper • 2410.07571 • Published Oct 10, 2024 • 2

upvoted an article 12 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.32k

upvoted a collection about 1 year ago

Reasoning Datasets

Collection

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3, 2025 • 26

upvoted a paper about 1 year ago

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published Jan 10, 2025 • 75

upvoted a collection about 1 year ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 173

upvoted 2 papers about 1 year ago

OpenAI o1 System Card

Paper • 2412.16720 • Published Dec 21, 2024 • 36

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 13

upvoted a paper over 1 year ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126

upvoted an article over 1 year ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Jul 27, 2024

•

upvoted 3 papers over 1 year ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 42

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3, 2024 • 103

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Paper • 2306.16527 • Published Jun 21, 2023 • 46

upvoted an article over 1 year ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

•

191

upvoted a collection over 1 year ago

System Message Generalization

Collection

11 items • Updated Jun 7, 2024 • 4

seongyun_lee

AI & ML interests

Recent Activity

Organizations

Seongyun's activity

Open-source DeepResearch – Freeing our search agents

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community