3 44 71

Chao Zhou

ASHIDAKA

AI & ML interests

Object Detection, Transformer

Recent Activity

liked a model 2 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

upvoted a paper 24 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

liked a model 2 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Text Generation • 32B • Updated 3 days ago • 352k • 207

upvoted a paper 24 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 25 days ago • 234

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.69k

The secrets to building world-class LLMs

upvoted 2 papers 2 months ago

VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator

Paper • 2510.13454 • Published Oct 15 • 8

Learning an Image Editing Model without Image Editing Pairs

Paper • 2510.14978 • Published Oct 16 • 8

liked a model 3 months ago

lightx2v/Wan2.1-T2V-14B-StepDistill-CfgDistill

Text-to-Video • Updated Oct 17 • 128

upvoted a paper 3 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24 • 98

liked 3 models 4 months ago

upvoted a paper 5 months ago

SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation

Paper • 2505.19151 • Published May 25 • 2

liked a model 5 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26 • 3.81M • • 4.28k

upvoted a paper 5 months ago

RecGPT Technical Report

Paper • 2507.22879 • Published Jul 30 • 37

New activity in Kratos-AI/KAI_handwriting-ocr 5 months ago

Botted likes

👍 2

#1 opened 5 months ago by

Delta-Vector

liked a model 5 months ago

Wan-AI/Wan2.2-I2V-A14B

Image-to-Video • Updated Aug 7 • 10.4k • • 546

liked a model 6 months ago

ibm-ai-platform/llama3-8b-accelerator

3B • Updated May 15, 2024 • 196 • 18

liked a Space 8 months ago

The Ultra-Scale Playbook

🌌

3.6k

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 9 months ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Oct 30 • 77

liked a dataset 9 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 4.78k • 618

upvoted an article 10 months ago

Article

Open R1: Update #3

Mar 11

•

296