2 464 2

Lei Wang

demolei

https://demoleiwang.github.io/HomePage/

demo_lei_wang
lei-wang-0805831a2

AI & ML interests

LLMs

Recent Activity

upvoted a paper 1 day ago

DeepSeek-OCR: Contexts Optical Compression

upvoted a paper 1 day ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

upvoted a paper 1 day ago

Attention Sinks in Diffusion Language Models

View all activity

Organizations

Collections 3

View 3 collections

Papers 13

models 6

datasets 0

None public yet

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Language Modeling Is Compression

SlimPajama-DC: Understanding Data Combinations for LLM Training

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Language Modeling Is Compression

SlimPajama-DC: Understanding Data Combinations for LLM Training

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Contrastive Decoding Improves Reasoning in Large Language Models

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Papers 13

models 6

demolei/qwen2_5_vl_7b_grpo_chartqa_filtered_40

demolei/Qwen2.5-VL-7B-Instruct-chartqa_filtered_240

demolei/Qwen2.5-1.5B-Open-R1-Distill

demolei/Qwen-2.5-7B-Simple-RL

demolei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

demolei/sft_openassistant-guanaco

datasets 0

Lei Wang

AI & ML interests

Recent Activity

Organizations

Collections 3

Papers 13

models 6 Sort: Recently updated

datasets 0

models 6