ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases By QuentinJG and 4 others • 3 days ago • 34
Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness By Steveeeeeeen • 3 days ago • 10
⚡ Power, Heat, and Intelligence ☁️ - AI Data Centers Explained 🏭 By sasha and 1 other • 3 days ago • 8
Classement compar:IA : des votes des utilisateurs au classement participatif des modèles By comparIA • 5 days ago • 6
Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation By exploding-gradients • Sep 16 • 11
Budget Alignment: Making Models Reason in the User’s Language By shanchen and 2 others • 4 days ago • 5
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases By QuentinJG and 4 others • 3 days ago • 34
Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness By Steveeeeeeen • 3 days ago • 10
⚡ Power, Heat, and Intelligence ☁️ - AI Data Centers Explained 🏭 By sasha and 1 other • 3 days ago • 8
Classement compar:IA : des votes des utilisateurs au classement participatif des modèles By comparIA • 5 days ago • 6
Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation By exploding-gradients • Sep 16 • 11
Budget Alignment: Making Models Reason in the User’s Language By shanchen and 2 others • 4 days ago • 5