7 95 669

Full Name PRO

Gatozu35

AI & ML interests

Text-to-Speech, Voice Conversion

Recent Activity

liked a model 9 days ago

zhendongw/diffusion-gan

upvoted an article 19 days ago

Fish Speech V1 - New Multilingual Open Source TTS Model

upvoted an article 22 days ago

Text-to-image Architectural Experiments

View all activity

Organizations

upvoted an article 19 days ago

Article

Fish Speech V1 - New Multilingual Open Source TTS Model

May 3, 2024

•

upvoted an article 22 days ago

Article

Text-to-image Architectural Experiments

27 days ago

•

upvoted 11 papers about 1 month ago

MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion

Paper • 2502.04235 • Published Feb 6 • 23

Heptapod: Language Modeling on Visual Signals

Paper • 2510.06673 • Published Oct 8 • 4

Memory Retrieval and Consolidation in Large Language Models through Function Tokens

Paper • 2510.08203 • Published Oct 9 • 9

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4 • 57

upvoted a paper 2 months ago

Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space

Paper • 2510.04476 • Published Oct 6 • 16

upvoted a paper 3 months ago

FCPE: A Fast Context-based Pitch Estimation Model

Paper • 2509.15140 • Published Sep 18 • 8

upvoted 2 collections 4 months ago

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 400

Deep Ignorance

Collection

This collection contains the model and data artifacts from O'Brien et al. (2025). https://deepignorance.ai • 43 items • Updated about 1 month ago • 6

upvoted a collection 5 months ago

H-Net

Collection

The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11 • 20

upvoted a paper 7 months ago

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

Paper • 2505.03005 • Published May 5 • 36

upvoted an article 8 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

•

220

Full Name PRO

AI & ML interests

Recent Activity

Organizations

Gatozu35's activity

Fish Speech V1 - New Multilingual Open Source TTS Model

Text-to-image Architectural Experiments

Train 400x faster Static Embedding Models with Sentence Transformers