Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
junan zhang's picture
4 6 9

junan zhang

viewfinder-annn
Hecheng0625's profile picture CHIRUCS's profile picture HarryHe's profile picture
·

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago
laion/LAION-DISCO-12M
liked a Space 17 days ago
mimbres/YourMT3
upvoted a paper about 1 month ago
OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation
View all activity

Organizations

Amphion's profile picture

authored 6 papers 3 months ago

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Paper • 2410.09732 • Published Oct 13, 2024 • 54

Overview of the Amphion Toolkit (v0.2)

Paper • 2501.15442 • Published Jan 26 • 3

Metis: A Foundation Speech Generation Model with Masked Generative Pre-training

Paper • 2502.03128 • Published Feb 5 • 2

AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement

Paper • 2501.15417 • Published Jan 26 • 1

TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling

Paper • 2508.16790 • Published Aug 22 • 10

Vevo2: Bridging Controllable Speech and Singing Voice Generation via Unified Prosody Learning

Paper • 2508.16332 • Published Aug 22
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs