pavan kumar avn

pk3388

AI & ML interests

None yet

Recent Activity

published a model about 1 month ago

pk3388/medgemma-4b-medical-qa-finetuned

upvoted a paper 3 months ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

liked a model 3 months ago

FreedomIntelligence/HuatuoGPT-o1-8B

View all activity

Organizations

published a model about 1 month ago

pk3388/medgemma-4b-medical-qa-finetuned

Updated Oct 29

upvoted a paper 3 months ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6 • 72

liked a model 3 months ago

FreedomIntelligence/HuatuoGPT-o1-8B

Text Generation • 8B • Updated Dec 30, 2024 • 487 • 54

upvoted 3 papers 3 months ago

Lost in Embeddings: Information Loss in Vision-Language Models

Paper • 2509.11986 • Published Sep 15 • 28

HANRAG: Heuristic Accurate Noise-resistant Retrieval-Augmented Generation for Multi-hop Question Answering

Paper • 2509.09713 • Published Sep 8 • 24

Inpainting-Guided Policy Optimization for Diffusion Large Language Models

Paper • 2509.10396 • Published Sep 12 • 15

upvoted an article 3 months ago

Article

Jupyter Agents: training LLMs to reason with notebooks

Sep 10

•

upvoted 4 papers 3 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2 • 83

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28 • 110

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28 • 89

upvoted a paper 4 months ago

Autoregressive Universal Video Segmentation Model

Paper • 2508.19242 • Published Aug 26 • 28

updated a dataset 4 months ago

pk3388/legos

Viewer • Updated Aug 27 • 36 • 61

published a dataset 4 months ago

pk3388/legos

Viewer • Updated Aug 27 • 36 • 61

upvoted 6 papers 4 months ago

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published Aug 6 • 58

Adapting Vision-Language Models Without Labels: A Comprehensive Survey

Paper • 2508.05547 • Published Aug 7 • 11

pavan kumar avn

AI & ML interests

Recent Activity

Organizations

pk3388's activity

Jupyter Agents: training LLMs to reason with notebooks