OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview Image-Text-to-Text • 0.4B • Updated Aug 29 • 45.8k • 82
MemAscend: System Memory Optimization for SSD-Offloaded LLM Fine-Tuning Paper • 2505.23254 • Published May 29
Analysis and Optimized CXL-Attached Memory Allocation for Long-Context LLM Fine-Tuning Paper • 2507.03305 • Published Jul 4
Running 3.6k The Ultra-Scale Playbook 🌌 3.6k The ultimate guide to training LLM on large GPU Clusters
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚 Aug 26, 2024 • 82
view article Article An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct Jun 11, 2024 • 66