aquif-4 Collection aquif-4-Exp is the first hybrid attention model from aquif, built on a strong architecture with 256 experts. • 2 items • Updated 4 days ago • 3
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs Paper • 2510.05069 • Published 19 days ago • 12
view article Article Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B By nvidia and 9 others • Aug 18 • 30
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation Paper • 2412.11919 • Published Dec 16, 2024 • 36
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models Paper • 2410.10139 • Published Oct 14, 2024 • 52
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated Jul 21 • 345
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published Sep 3, 2024 • 83
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference Paper • 2407.14057 • Published Jul 19, 2024 • 46
Personal Favorites Collection Recommended models I use often or like for any reason. I recommend reading their cards for more details. • 10 items • Updated Dec 24, 2024 • 91
Quyen Collection State-of-the-arts General LLMs - based on Qwen1.5 • 26 items • Updated Feb 13, 2024 • 12
PolyLM: An Open Source Polyglot Large Language Model Paper • 2307.06018 • Published Jul 12, 2023 • 26