Nikita Kezins's picture

Nikita Kezins

entfane

·

AI & ML interests

LLM post-training, adversarial training, safety, knowledge transfer

Recent Activity

updated a model 6 days ago

entfane/coder-reasoner-7Bv8

published a model 6 days ago

entfane/coder-reasoner-7Bv8

updated a model 6 days ago

entfane/coder-reasoner-7Bv7

View all activity

Organizations

upvoted an article about 1 month ago

Article

Self-Hosting LLaMA 3.1 70B (or any ~70B LLM) Affordably

Aug 20, 2024

•

26

upvoted 2 collections about 1 month ago

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 4 days ago • 46

RLCR

Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty • 10 items • Updated Aug 6 • 7

upvoted a collection about 2 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 249

upvoted 2 collections 3 months ago

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jul 21 • 88

Qwen3Guard

7 items • Updated Sep 30 • 58

upvoted an article 3 months ago

Article

Everything You Need to Know about Knowledge Distillation

Mar 6

•

61

upvoted a collection 3 months ago

[NeurIPS 2025] RPC Resources

Sampled Reasoning Paths for NeurIPS 2025 Paper: A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning • 6 items • Updated Oct 23 • 8

upvoted a collection 4 months ago

📐 FineMath

FineMath datasets and ablation models • 14 items • Updated May 5 • 25

upvoted 3 articles 5 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26

•

177

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

Dec 9, 2022

•

385

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30

•

202

upvoted an article 7 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

745

upvoted an article 9 months ago

Article

PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face

Nov 11, 2024

•

20