Ankesh Bharti's picture

16 126

Ankesh Bharti

feynon

·

https://ankeshbharti.com/

AI & ML interests

On device AI stack

Recent Activity

liked a model 6 days ago

katanemo/Arch-Router-1.5B

updated a Space 6 days ago

tileshq/README

upvoted an article 11 days ago

mem-agent: Equipping LLM Agents with Memory Using RL

View all activity

Organizations

upvoted an article 11 days ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

By

and 1 other •

15 days ago

• 30

upvoted a collection 3 months ago

Unsloth 4-bit Dynamic Quants

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 22 days ago • 87

upvoted 2 collections 5 months ago

GRMR V3 GGUFs

GGUF Quantized versions of the GRMR V3 Models • 6 items • Updated Jun 4 • 7

llama.vim

Recommended models for the llama.vim and llama.vscode plugins • 10 items • Updated Aug 20 • 51

upvoted a collection 7 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 209

upvoted 3 collections 12 months ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 46 items • Updated Sep 10 • 131

LayerSkip

Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated Nov 21, 2024 • 48

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 638

upvoted 2 collections about 1 year ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 15 items • Updated Apr 18 • 239

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 10 • 80

upvoted 2 collections over 1 year ago

Gemma 2 Release

15 items • Updated Jul 10 • 223

LLM Compiler

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27, 2024 • 154

upvoted an article over 1 year ago

Article

Mixture of Depth is Vibe

By

•

Apr 22, 2024

• 48

upvoted 3 collections over 1 year ago

OpenELM Instruct Models

4 items • Updated Aug 25 • 123

OpenELM Pretrained Models

4 items • Updated Aug 25 • 52

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 859