DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 25 days ago • 234
Running on CPU Upgrade Featured 2.69k The Smol Training Playbook 📚 2.69k The secrets to building world-class LLMs
VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator Paper • 2510.13454 • Published Oct 15 • 8
Learning an Image Editing Model without Image Editing Pairs Paper • 2510.14978 • Published Oct 16 • 8
SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation Paper • 2505.19151 • Published May 25 • 2
Running 3.6k The Ultra-Scale Playbook 🌌 3.6k The ultimate guide to training LLM on large GPU Clusters
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Oct 30 • 77