Pavlo Molchanov's picture

Pavlo Molchanov PRO

pmolchanov

·

https://www.pmolchanov.com

AI & ML interests

Efficiency in Multi-Modal LLMs

Recent Activity

upvoted a paper 11 days ago

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

upvoted a paper 11 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

upvoted a paper about 1 month ago

Set Block Decoding is a Language Model Inference Accelerator

View all activity

Organizations

upvoted 2 papers 11 days ago

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published 14 days ago • 15

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published 13 days ago • 85

upvoted a paper about 1 month ago

Set Block Decoding is a Language Model Inference Accelerator

Paper • 2509.04185 • Published Sep 4 • 52

upvoted a collection 2 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 2 days ago • 73

upvoted a paper 2 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20 • 36

upvoted a paper 4 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 157

upvoted a collection 4 months ago

Nemotron-H

Mamba-Transformer hybrid models • 10 items • Updated 9 days ago • 30

upvoted an article 5 months ago

Article

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

By

and 3 others •

Jun 10

• 7

upvoted a paper 5 months ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 34

upvoted a collection 5 months ago

Llama Nemotron

Open, Production-ready Enterprise Models • 11 items • Updated 9 days ago • 71

upvoted 6 papers 7 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 93

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Paper • 2504.11409 • Published Apr 15 • 9

Efficient Reasoning Models: A Survey

Paper • 2504.10903 • Published Apr 15 • 20

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 15

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 110

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25 • 41

upvoted 2 articles 9 months ago

Article

Bamba: Inference-Efficient Hybrid Mamba2 Model

Dec 18, 2024

• 60

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 705

upvoted a paper 9 months ago

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30 • 22

upvoted a collection 10 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated 9 days ago • 298