1 15 6

Abdulhakeem Adefioye

kokolamba

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations

upvoted a paper 6 days ago

Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

updated a model 8 days ago

kokolamba/moe-mha

View all activity

Organizations

upvoted 2 papers 6 days ago

ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations

Paper • 2505.02819 • Published May 5 • 26

Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

Paper • 2508.04581 • Published Aug 6 • 5

updated a model 8 days ago

kokolamba/moe-mha

Updated 8 days ago • 17

published a model 8 days ago

kokolamba/moe-mha

Updated 8 days ago • 17

updated a model 8 days ago

kokolamba/moe-kv-128

Updated 8 days ago • 17

published a model 8 days ago

kokolamba/moe-kv-128

Updated 8 days ago • 17

updated a model 8 days ago

kokolamba/moe-o-192

Updated 8 days ago • 11

published a model 8 days ago

kokolamba/moe-o-192

Updated 8 days ago • 11

upvoted an article 22 days ago

Article

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

•

Mar 18, 2024

• 13

updated 8 models 27 days ago

published 3 models 27 days ago

kokolamba/SubspaceDecoder_mla192-0-0

Updated 27 days ago • 28

kokolamba/SubspaceDecoder_mla0-0-0

Updated 27 days ago

kokolamba/SubspaceDecoder_mla0-0-192

Updated 27 days ago • 33

Abdulhakeem Adefioye

AI & ML interests

Recent Activity

Organizations

kokolamba's activity

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity