1 33 10

Yan Varakin

ZDPLI

https://www.researchgate.net/profile/Yan-Varakin

ZDPLI

AI & ML interests

All areas of NLP, computational mathematics, reinforcement learning, robotics.

Organizations

upvoted an article 3 months ago

Article

Activation Steering: A New Frontier in AI Control—But Does It Scale?

•

Feb 2

• 3

upvoted an article 4 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

Jun 26

• 118

upvoted 2 articles 6 months ago

Article

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Apr 5, 2023

• 45

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

• 64

upvoted a paper 6 months ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 52

upvoted an article 6 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 238

upvoted 5 papers 6 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 36

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers

Paper • 2504.20752 • Published Apr 29 • 92

upvoted 3 papers 9 months ago

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Paper • 2501.11858 • Published Jan 21 • 7

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 76

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28

upvoted 2 papers 10 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 95

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7 • 27

upvoted 3 papers 11 months ago

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Paper • 2412.01822 • Published Dec 2, 2024 • 15

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

Paper • 2411.15139 • Published Nov 22, 2024 • 15

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 129

upvoted an article 11 months ago

Article

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

•

Nov 19, 2024

• 12

Yan Varakin

AI & ML interests

Organizations

ZDPLI's activity

Activation Steering: A New Frontier in AI Control—But Does It Scale?

Gemma 3n fully available in the open-source ecosystem!

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tune Llama 2 with DPO

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

LLaVA-o1: Let Vision Language Models Reason Step-by-Step