view article Article Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement 15 days ago • 4
Open-Endedness is Essential for Artificial Superhuman Intelligence Paper • 2406.04268 • Published Jun 6, 2024 • 14
view article Article DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive Apr 9, 2024 • 30