Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted a paper about 6 hours ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

liked a model 1 day ago

open-thoughts/OpenThinker-Agent-v1

liked a model 2 days ago

EssentialAI/rnj-1-instruct

View all activity

Organizations

upvoted a paper about 6 hours ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

Paper • 2508.09726 • Published Aug 13 • 15

liked a model 1 day ago

open-thoughts/OpenThinker-Agent-v1

Text Generation • 8B • Updated 2 days ago • 113 • 39

liked a model 2 days ago

EssentialAI/rnj-1-instruct

8B • Updated 3 days ago • 441k • 124

upvoted 2 articles 3 days ago

Article

Yay! Organizations can now publish blog Articles

Jan 20

•

53

Article

We Got Claude to Fine-Tune an Open Source LLM

5 days ago

•

329

liked a dataset 4 days ago

Anthropic/AnthropicInterviewer

Viewer • Updated 4 days ago • 1.25k • 3.52k • 114

liked a Space 5 days ago

Evaluation Guidebook

Display evaluation metrics for LLM benchmarks

liked a model 6 days ago

mistralai/Ministral-3-3B-Instruct-2512

4B • Updated 3 days ago • 53.5k • 108

upvoted 3 papers 6 days ago

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28 • 8

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 23

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published 28 days ago • 13

liked 4 models 7 days ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated 7 days ago • 28.8k • • 801

deepseek-ai/DeepSeek-V3.2-Speciale

Text Generation • 685B • Updated 7 days ago • 8.02k • 544

arcee-ai/Trinity-Nano-Preview

Text Generation • 6B • Updated 7 days ago • 1.02k • 48

arcee-ai/Trinity-Mini

Text Generation • 26B • Updated 7 days ago • 1.24k • 118

upvoted an article 7 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

8 days ago

•

224

liked a model 9 days ago

nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated 6 days ago • 3.57k • 374

liked a model 10 days ago

deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated 11 days ago • 9.5k • 642

liked a model 11 days ago

PrimeIntellect/INTELLECT-3

Text Generation • 107B • Updated 11 days ago • 14.8k • 179

upvoted a paper 11 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 137