Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xiaochuan Li's picture
2 10 7

Xiaochuan Li PRO

lixiaochuan2020
MillanK's profile picture SteveSHEN's profile picture 21world's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
published a dataset 8 days ago
lixiaochuan2020/octothinker_decay_stage2_1Btokens
updated a dataset 9 days ago
lixiaochuan2020/octothinker_decay_stage2_1Btokens
View all activity

Organizations

Tsinghua University's profile picture XLang NLP Lab's profile picture Hugging Face Discord Community's profile picture XLANG-inner's profile picture

Collections 1

Papers
  • Defeating the Training-Inference Mismatch via FP16

    Paper • 2510.26788 • Published Oct 30 • 29
  • Kimi Linear: An Expressive, Efficient Attention Architecture

    Paper • 2510.26692 • Published Oct 30 • 116
Papers
  • Defeating the Training-Inference Mismatch via FP16

    Paper • 2510.26788 • Published Oct 30 • 29
  • Kimi Linear: An Expressive, Efficient Attention Architecture

    Paper • 2510.26692 • Published Oct 30 • 116

Papers 4

arxiv:2505.13227
arxiv:2410.14208
arxiv:2404.07972
arxiv:2310.05177

models 1

lixiaochuan2020/octothinker_reproduce_llama3.2_1b_stable_stage_10B_decay_stage_short_1B

Text Generation • 1B • Updated Nov 7 • 286

datasets 1

lixiaochuan2020/octothinker_decay_stage2_1Btokens

Updated 9 days ago • 185
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs