KOTA SASATANI's picture

KOTA SASATANI

sasa2000

·

AI & ML interests

None yet

Recent Activity

liked a model about 2 hours ago

slseanwu/MIDI-LLM_Llama-3.2-1B

upvoted a paper about 2 hours ago

Virtual Width Networks

liked a model 2 days ago

OpenMOSE/Qwen3-VL-REAP-145B-A22B

View all activity

Organizations

None yet

upvoted a paper about 2 hours ago

Virtual Width Networks

Paper • 2511.11238 • Published 11 days ago • 35

upvoted a paper 11 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 14 days ago • 180

upvoted a collection 14 days ago

Pre-training Dataset Samples

A collection of pre-training datasets samples of sizes 10M, 100M and 1B tokens. Ideal for use in quick experimentation and ablations. • 19 items • Updated 14 days ago • 14

upvoted a paper 15 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published 19 days ago • 201

upvoted a collection 17 days ago

Apriel-H1

Introducing Apriel-H1 hybrids each blending Attention and Mamba State Space layers in varying proportions. • 8 items • Updated 20 days ago • 7

upvoted a paper 18 days ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published 26 days ago • 80

upvoted a collection 28 days ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 17 items • Updated 8 days ago • 45

upvoted a paper 2 months ago

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published May 29 • 94

upvoted a collection 2 months ago

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 103

upvoted a collection 3 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 174

upvoted an article 3 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

723

upvoted a collection 5 months ago

GLM-4.1V-Thinking

5 items • Updated Jul 2 • 57