12 15 93

Anshuman Suri

iamgroot42

https://anshumansuri.com/

AI & ML interests

Privacy, Distribution Inference, Membership Inference

Recent Activity

liked a dataset 4 days ago

allenai/Dolci-Think-SFT-7B

liked a dataset 4 days ago

liweijiang/infinite-chats-human-absolute

liked a model 11 days ago

Salesforce/xRouter

View all activity

Organizations

upvoted 2 papers about 1 month ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 60

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Paper • 2309.15223 • Published Sep 26, 2023 • 22

upvoted a paper about 2 months ago

Simple Projection Variants Improve ColBERT Performance

Paper • 2510.12327 • Published Oct 14 • 5

upvoted 2 collections about 2 months ago

Chart-RVR

Collection

Models trained using GRPO for enhanced Chart Reasoning • 3 items • Updated Aug 24 • 1

Steering the CensorShip

Collection

3 items • Updated Sep 28 • 1

upvoted an article 3 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

•

166

upvoted a paper 4 months ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 65

upvoted a paper 5 months ago

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published Mar 17 • 13

upvoted 2 articles 5 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11

•

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

734

upvoted a paper 7 months ago

Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control

Paper • 2504.17130 • Published Apr 23 • 1

upvoted a paper 9 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 22

upvoted 2 papers 10 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 250

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 182

upvoted a paper about 1 year ago

LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset

Paper • 2402.09391 • Published Feb 14, 2024 • 2

Anshuman Suri

AI & ML interests

Recent Activity

Organizations

iamgroot42's activity

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

SmolLM3: smol, multilingual, long-context reasoner