Community Blog & Articles

Community Articles

The Optimal Architecture for Small Language Models

Deriving the PPO Loss from First Principles

Continuity as a First-Class System Property in Artificial Intelligence

KV Caching Explained: Optimizing Transformer Inference Efficiency

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

Uncensor any LLM with abliteration

Deriving the DPO Loss from First Principles

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

LLM based Audio models

Small Language Models (SLM): A Comprehensive Overview

Mastering Tensor Dimensions in Transformers

What makes good reasoning data

Why You Should Care About Partial Differential Equations (PDEs)

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Encoding the World's Medical Knowledge into 970K

Code a simple RAG from scratch

Everything You Need to Know about Knowledge Distillation

Diffusion Language Models: The New Paradigm

We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️

huggingface_hubpythonannouncement

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

October 27, 2025

lerobotrobotics

LeRobot v0.4.0: Supercharging OSS Robot Learning

+5

October 24, 2025

announcementopen-sourcecommunity

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+6

October 23, 2025

hubpartnershipssecurity

Hugging Face and VirusTotal collaborate to strengthen AI security

October 22, 2025

announcementnlpopen-source

Sentence Transformers is joining Hugging Face!

October 22, 2025

ocrvisionmultimodal

Supercharge your OCR Pipelines with Open Models

+3

October 21, 2025

datasetsopen-sourcevision

Unlock the power of images with AI Sheets

+2

October 21, 2025

AI for Food Allergies

October 16, 2025

intelcpugpt-oss

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

October 16, 2025

inteloptimumquantization

Get your VLM running in 3 simple steps on Intel CPUs

+1

October 15, 2025

Nemotron-Personas-India: Synthesized Data for Sovereign AI

October 13, 2025

Arm will be @ PyTorch Conference, Join Us!

October 10, 2025

BigCodeArena: Judging code generations end to end with code executions

October 7, 2025

SOTA OCR with Core ML and dots.ocr

October 2, 2025

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

The Optimal Architecture for Small Language Models

Deriving the PPO Loss from First Principles

Continuity as a First-Class System Property in Artificial Intelligence

KV Caching Explained: Optimizing Transformer Inference Efficiency

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

Uncensor any LLM with abliteration

Deriving the DPO Loss from First Principles

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

LLM based Audio models

Small Language Models (SLM): A Comprehensive Overview

Mastering Tensor Dimensions in Transformers

What makes good reasoning data

Why You Should Care About Partial Differential Equations (PDEs)

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Encoding the World's Medical Knowledge into 970K

Code a simple RAG from scratch

Everything You Need to Know about Knowledge Distillation

Diffusion Language Models: The New Paradigm

We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️

View all articles