Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Blog, Articles, and discussions
New Article
community
guide
open source collab
partnerships
research
NLP
Audio
CV
RL
ethics
Diffusion
Game Development
RLHF
Leaderboard
Case Studies
LeRobot
Inference Providers
Community Articles
view all
We’re open-sourcing our text-to-image model and the process behind it
8 days ago
•
65
Text-to-image Architectural Experiments
7 days ago
•
31
Introducing Cogito v2.1
about 23 hours ago
•
16
Projected Abliteration
26 days ago
•
26
AI Model Optimization More Flexible Than Ever
3 days ago
•
12
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases
15 days ago
•
49
The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs
5 days ago
•
11
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30
•
175
Norm-Preserving Biprojected Abliteration
14 days ago
•
14
Uncensor any LLM with abliteration
Jun 13, 2024
•
721
Why Did MiniMax M2 End Up as a Full Attention Model?
21 days ago
•
65
The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix
17 days ago
•
42
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling
2 days ago
•
8
Granite 4.0 Nano: Just how small can you go?
23 days ago
•
119
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
1 day ago
•
7
Visualizing How VLMs Work
Oct 7
•
45
🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset
1 day ago
•
6
Join the AMD Open Robotics Hackathon
7 days ago
•
6
To Think or Not to Think: A Router for Hybrid LLMs
4 days ago
•
6
PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs
Jan 24
•
49
guide
expert-acceleration-program
case-studies
Accelerating Document AI
78
November 21, 2022
expert-acceleration-program
case-study
case-studies
How Sempre Health is leveraging the Expert Acceleration Program to accelerate their ML roadmap
1
May 19, 2022
Previous
1
2
Next
Community Articles
Sort: Trending
We’re open-sourcing our text-to-image model and the process behind it
8 days ago
•
65
Text-to-image Architectural Experiments
7 days ago
•
31
Introducing Cogito v2.1
about 23 hours ago
•
16
Projected Abliteration
26 days ago
•
26
AI Model Optimization More Flexible Than Ever
3 days ago
•
12
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases
15 days ago
•
49
The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs
5 days ago
•
11
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30
•
175
Norm-Preserving Biprojected Abliteration
14 days ago
•
14
Uncensor any LLM with abliteration
Jun 13, 2024
•
721
Why Did MiniMax M2 End Up as a Full Attention Model?
21 days ago
•
65
The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix
17 days ago
•
42
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling
2 days ago
•
8
Granite 4.0 Nano: Just how small can you go?
23 days ago
•
119
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
1 day ago
•
7
Visualizing How VLMs Work
Oct 7
•
45
🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset
1 day ago
•
6
Join the AMD Open Robotics Hackathon
7 days ago
•
6
To Think or Not to Think: A Router for Hybrid LLMs
4 days ago
•
6
PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs
Jan 24
•
49
View all articles