Revs64 (Revs)

upvoted a paper 2 months ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 94

upvoted a paper 4 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 50

upvoted a paper 5 months ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7 • 71

upvoted a paper 6 months ago

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22

upvoted an article 8 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 308

upvoted 2 articles 9 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.31k

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 882

upvoted 2 papers 10 months ago

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 65

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147

upvoted 2 papers 11 months ago

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 118

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 70

upvoted 4 papers about 1 year ago

upvoted an article about 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 270

upvoted 4 papers about 1 year ago

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56

Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22, 2024 • 65

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

Paper • 2408.08072 • Published Aug 15, 2024 • 34

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 126

Revs

AI & ML interests

Organizations

SSRL: Self-Search Reinforcement Learning

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

SmolVLM2: Bringing Video Understanding to Every Device

Open-source DeepResearch – Freeing our search agents

Open-R1: a fully open reproduction of DeepSeek-R1

YuLan-Mini: An Open Data-efficient Language Model

Apollo: An Exploration of Video Understanding in Large Multimodal Models

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

WonderWorld: Interactive 3D Scene Generation from a Single Image

Imagine yourself: Tuning-Free Personalized Image Generation

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Controllable Text Generation for Large Language Models: A Survey

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Revs

AI & ML interests

Organizations

Revs64's activity

SmolVLM2: Bringing Video Understanding to Every Device

Open-source DeepResearch – Freeing our search agents

Open-R1: a fully open reproduction of DeepSeek-R1

Fine-tuning LLMs to 1.58bit: extreme quantization made easy