Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated 5 days ago • 146
view article Article OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve May 20 • 50
⚔️ BigCodeArena Collection Unveiling More Reliable Human Preferences in Code Generation via Execution • 8 items • Updated Oct 13 • 6
view article Article BigCodeArena: Judging code generations end to end with code executions Oct 7 • 17
miniCTX Collection miniCTX: Neural Theorem Proving with (Long-)Contexts (ICLR 2025 Oral) • 8 items • Updated Mar 19 • 2
L1 Collection L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 7 items • Updated Jul 13 • 8
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22, 2024 • 42
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7 • 64
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 88
TxGemma Release Collection Collection of open models to accelerate the development of therapeutics. • 5 items • Updated Jul 10 • 66
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 45
view article Article Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model Aug 22, 2023 • 37
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Sep 18 • 95