Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

The Massive Legal Embedding Benchmark (MLEB)

upvoted an article 8 days ago

Australian-made LLM beats OpenAI and Google at legal retrieval

upvoted an article about 1 month ago

There is no such thing as a tokenizer-free lunch

View all activity

Organizations

None yet

upvoted a paper 8 days ago

The Massive Legal Embedding Benchmark (MLEB)

Paper • 2510.19365 • Published 8 days ago • 17

upvoted an article 8 days ago

Article

Australian-made LLM beats OpenAI and Google at legal retrieval

and 2 others •

8 days ago

• 25

upvoted an article about 1 month ago

Article

There is no such thing as a tokenizer-free lunch

•

Sep 25

• 84

upvoted 2 papers about 2 months ago

Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12 • 26

The Majority is not always right: RL training for solution aggregation

Paper • 2509.06870 • Published Sep 8 • 16

upvoted 2 papers 5 months ago

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30 • 14

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Paper • 2505.11711 • Published May 16 • 11

upvoted an article 5 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

• 225

upvoted an article 6 months ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

Apr 25

• 300

upvoted a paper 6 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 135

upvoted an article 6 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

•

Apr 18

• 44

upvoted a collection 8 months ago

Gemma 3

Collection

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 55 items • Updated about 6 hours ago • 89

upvoted 2 articles 9 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 884

upvoted a collection 9 months ago

EvaByte

Collection

3 items • Updated Jan 21 • 4

upvoted an article 10 months ago

Article

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 103

upvoted a paper 10 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

upvoted a paper about 1 year ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted 2 articles about 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 271

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19, 2024

• 78

Stephen Oates PRO

AI & ML interests

Recent Activity

Organizations

soates's activity

Australian-made LLM beats OpenAI and Google at legal retrieval

There is no such thing as a tokenizer-free lunch

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Tiny Agents: a MCP-powered agent in 50 lines of code

Gotchas in Tokenizer Behavior Every Developer Should Know

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

Mastering Tensor Dimensions in Transformers

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging