Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 28 days ago • 233
Power-LM Collection Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17, 2024 • 16
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging Aug 19, 2024 • 79