20 20 305

Kurian Benoy PRO

kurianbenoy

https://kurianbenoy.com

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

ai4bharat/MANGO

liked a dataset 10 days ago

nvidia/Nemotron-Personas-India

upvoted a collection 13 days ago

Speech Evals

View all activity

Organizations

liked a dataset 1 day ago

ai4bharat/MANGO

Viewer • Updated May 13 • 51k • 328 • 5

liked a dataset 10 days ago

nvidia/Nemotron-Personas-India

Viewer • Updated 10 days ago • 3M • 3.15k • 27

upvoted a collection 13 days ago

Speech Evals

Collection

Synthesized speech evals generated by MistralAI from popular text evaluation datasets to evaluate spoken-language reasoning capabilities of Audio LLMs • 3 items • Updated Jul 18 • 9

liked a model 14 days ago

ai4bharat/Cadence

Token Classification • 1.0B • Updated Jun 18 • 7.64k • 13

liked a Space 16 days ago

Maintain the unmaintainable

📚

Visualize connections between transformer models

reacted to Molbap's post with 🔥 16 days ago

Post

2921

🚀 New blog: Maintain the unmaintainable – 1M+ Python LOC, 400+ models

How do you stop a million-line library built by thousands of contributors from collapsing under its own weight?
At 🤗 Transformers, we do it with explicit software-engineering tenets, principles that make the codebase hackable at scale.

🔍 Inside the post:
– One Model, One File: readability first — you can still open a modeling file and see the full logic, top to bottom.
– Modular Transformers: visible inheritance that cuts maintenance cost by ~15× while keeping models readable.
– Config-Driven Performance: FlashAttention, tensor parallelism, and attention scheduling are config-level features, not rewrites.

Written with @lysandre ,@pcuenq and @yonigozlan , this is a deep dive into how Transformers stays fast, open, and maintainable.

Read it here → transformers-community/Transformers-tenets