🎚️ Batch Normalization — Quand ton réseau a besoin de chill pills ! 😤➡️😌 By RDTvlokip • 38 minutes ago • 1
🎚️ Batch Normalization — When your neural network needs anger management! 😤➡️😌 By RDTvlokip • about 1 hour ago • 1
Australian-made LLM beats OpenAI and Google at legal retrieval By isaacus and 2 others • about 9 hours ago • 9
Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes By nvidia and 1 other • about 13 hours ago • 4
Promoter-GPT: Writing DNA Instructions with Language Models By hugging-science • about 20 hours ago • 8
TIL: How a Harmless Refactor Exposed a Hidden CUDA Bug in Vision-Language Models By albertvillanova • 1 day ago
Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard By nvidia and 4 others • 1 day ago • 6
🔄 Transfer Learning — Quand l'IA apprend de l'expérience comme toi ! 🎓🚀 By RDTvlokip • 2 days ago • 1
Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models By nvidia and 3 others • 3 days ago • 11
Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling By RichardBian and 19 others • 3 days ago • 14
Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text By isaacchung and 2 others • 3 days ago • 27
GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms By otellm and 15 others • 3 days ago • 12