Open to Collab

Richard Gurtsiev

r1char9

vilovnok

AI & ML interests

Note yet

Recent Activity

upvoted a paper 17 days ago

Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning

upvoted a paper 17 days ago

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

upvoted a paper 17 days ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

View all activity

Organizations

None yet

upvoted 4 papers 17 days ago

Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning

Paper • 2511.07061 • Published 20 days ago • 3

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Paper • 2511.06411 • Published 21 days ago • 16

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published 19 days ago • 40

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published 21 days ago • 123

upvoted a paper 25 days ago

cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

Paper • 2505.22914 • Published May 28 • 36

upvoted an article 5 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

725

upvoted a paper 6 months ago

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Paper • 2506.06395 • Published Jun 5 • 132

upvoted an article 7 months ago

Article

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2

Jun 3, 2024

•

upvoted a paper about 1 year ago

The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design

Paper • 2408.12503 • Published Aug 22, 2024 • 27

Richard Gurtsiev

AI & ML interests

Recent Activity

Organizations

r1char9's activity

Uncensor any LLM with abliteration

⚗️ 🔥 Building High-Quality Datasets with distilabel and Prometheus 2