Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Defa Zhu's picture
3 9

Defa Zhu

mathfinder
kkish's profile picture taicheng's profile picture zhangysk's profile picture
·
https://zhudefa.github.io/
  • mathfinder

AI & ML interests

None yet

Recent Activity

authored a paper 15 days ago
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts
authored a paper 15 days ago
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning
authored a paper 15 days ago
UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
View all activity

Organizations

None yet

commented 3 papers 9 months ago

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Paper • 2503.16057 • Published Mar 20 • 14 •
2

Frac-Connections: Fractional Extension of Hyper-Connections

Paper • 2503.14125 • Published Mar 18 • 22 •
5

Frac-Connections: Fractional Extension of Hyper-Connections

Paper • 2503.14125 • Published Mar 18 • 22 •
5
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs