Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xzhren's picture
2 4 2

xzhren

xingzhang
21world's profile picture Cay0516's profile picture 0xSojalSec's profile picture
·

AI & ML interests

None yet

Organizations

Qwen's profile picture

upvoted a paper 9 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 66
upvoted a paper 10 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376
upvoted a collection about 1 year ago

Qwen2.5-Math

Collection
Math-specific model series based on Qwen2.5 • 11 items • Updated Jul 21 • 87
upvoted a collection over 1 year ago

Qwen2

Collection
Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Jul 21 • 371
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs