Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
Ik-hwan Kim
12kimih
Follow
0 followers
·
11 following
https://github.com/12kimih
12kimih
ik-hwan-kim-083419330
AI & ML interests
Large Language Models, Reinforcement Learning, Multimodal AI, AI Agents, Mechanistic Interpretability
Recent Activity
updated
a dataset
4 days ago
12kimih/r1qa-revised-rollouts
updated
a model
6 days ago
12kimih/Qwen3-4B-r1qa-v1
published
a model
6 days ago
12kimih/Qwen3-4B-r1qa-v1
View all activity
Organizations
None yet
models
4
Sort: Recently updated
12kimih/Qwen3-4B-r1qa-v1
Text Generation
•
4B
•
Updated
6 days ago
•
27
12kimih/Qwen3-0.6B-r1qa-grpo-v0
Text Generation
•
0.6B
•
Updated
23 days ago
•
29
12kimih/Qwen3-0.6B-r1qa-gpt-oss-v0
Text Generation
•
0.6B
•
Updated
23 days ago
•
15
12kimih/Llama-3.2-3B-HiCUPID
Updated
Jun 3
datasets
7
Sort: Recently updated
12kimih/r1qa-revised-rollouts
Viewer
•
Updated
4 days ago
•
99.7k
•
30
12kimih/r1qa-raw-rollouts
Viewer
•
Updated
8 days ago
•
99.7k
•
103
12kimih/r1qa-guided-rollouts
Viewer
•
Updated
20 days ago
•
1.08M
•
217
12kimih/r1qa-benchmarks
Viewer
•
Updated
Oct 14
•
300k
•
87
12kimih/r1qa-clip-and-guide-using-Qwen3-8B
Viewer
•
Updated
Sep 12
•
2.97k
•
21
12kimih/r1qa-clip-with-perplexity
Viewer
•
Updated
Sep 9
•
2.97k
•
34
12kimih/HiCUPID
Viewer
•
Updated
Jun 3
•
918k
•
134