Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AI2 Adapt Dev
community
Activity Feed
Follow
68
AI & ML interests
Open science can (maybe) save the world
Recent Activity
DongfuJiang
authored
a paper
21 days ago
Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning
DongfuJiang
authored
a paper
21 days ago
VideoScore2: Think before You Score in Generative Video Evaluation
DongfuJiang
authored
a paper
about 2 months ago
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
View all activity
Team members
40
+6
ai2-adapt-dev
's models
13
Sort: Recently updated
ai2-adapt-dev/qwen_2_math_sft_4k
Updated
Mar 4
•
1
ai2-adapt-dev/s1k_seq_orig_hyper__42__1740446762
Updated
Mar 3
•
1
ai2-adapt-dev/qwen-2-math-sft-long
Updated
Mar 3
•
3
ai2-adapt-dev/tulu_3_long_finetune_qwen_7b_reg
Updated
Feb 24
•
46
ai2-adapt-dev/s1_1k_finetune_qwen_7b_reg_with_tulu3
Updated
Feb 24
•
2
ai2-adapt-dev/qwen2_7b_s1_1k
Updated
Feb 24
•
1
ai2-adapt-dev/tulu-3-multipref-dpo-gpt4
Updated
Feb 6
•
1
ai2-adapt-dev/tulu-3-multipref-dpo-human
Updated
Feb 6
•
1
ai2-adapt-dev/llama-3.1-8b-resized
Text Generation
•
8B
•
Updated
Oct 29, 2024
•
2
ai2-adapt-dev/instruction-tagger-llama3-8b
Updated
Sep 15, 2024
ai2-adapt-dev/olmo-7b-peteish7-step928646
Text Generation
•
7B
•
Updated
Aug 22, 2024
•
1
ai2-adapt-dev/peteish7-step928646
Text Generation
•
7B
•
Updated
Aug 22, 2024
ai2-adapt-dev/peteish7-stepstep928646
Updated
Aug 22, 2024