Blog, Articles, and discussions

Community Articles

We’re open-sourcing our text-to-image model and the process behind it

Text-to-image Architectural Experiments

Introducing Cogito v2.1

about 23 hours ago

Projected Abliteration

AI Model Optimization More Flexible Than Ever

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs

KV Caching Explained: Optimizing Transformer Inference Efficiency

Norm-Preserving Biprojected Abliteration

Uncensor any LLM with abliteration

Why Did MiniMax M2 End Up as a Full Attention Model?

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling

Granite 4.0 Nano: Just how small can you go?

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Visualizing How VLMs Work

🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset

Join the AMD Open Robotics Hackathon

To Think or Not to Think: A Router for Hybrid LLMs

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

guideexpert-acceleration-programcase-studies

Accelerating Document AI

November 21, 2022

expert-acceleration-programcase-studycase-studies

How Sempre Health is leveraging the Expert Acceleration Program to accelerate their ML roadmap

Community Articles

We’re open-sourcing our text-to-image model and the process behind it

Text-to-image Architectural Experiments

Introducing Cogito v2.1

about 23 hours ago

Projected Abliteration

AI Model Optimization More Flexible Than Ever

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs

KV Caching Explained: Optimizing Transformer Inference Efficiency

Norm-Preserving Biprojected Abliteration

Uncensor any LLM with abliteration

Why Did MiniMax M2 End Up as a Full Attention Model?

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling

Granite 4.0 Nano: Just how small can you go?

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Visualizing How VLMs Work

🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset

Join the AMD Open Robotics Hackathon

To Think or Not to Think: A Router for Hybrid LLMs

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

View all articles