mlfoundations-cua-dev/grpo-7b-stage-3-on-103k-filtered-data-temp-2-2-zero-correct-to-0.2-no-pixmo-uground-seeclick 8B β’ Updated Sep 30 β’ 1
mlfoundations-cua-dev/grpo-7b-stage-3-on-103k-filtered-data-temp-2-1-0p2-show-ui-ui-vision-jedi-gta-dense-reward 8B β’ Updated Sep 30 β’ 1
mlfoundations-cua-dev/grpo-7b-stage-3-on-103k-filtered-data-temp-1-8-0p2-show-ui-ui-vision-jedi-gta-dense-reward 8B β’ Updated Sep 30 β’ 1
mlfoundations-cua-dev/grpo-7b-stage-3-on-103k-filtered-data-temp-2-0-0p2-show-ui-ui-vision-jedi-gta-dense-reward 8B β’ Updated Sep 30 β’ 1
mlfoundations-cua-dev/grpo-7b-stage-2-on-103k-filtered-data-temp-1-7-zero-correct-to-0.2 8B β’ Updated Sep 30 β’ 1
mlfoundations-cua-dev/grpo-7b-stage-3-on-103k-filtered-data-temp-2-1-0p2-zero-only-show-ui-ui-vision-jedi-gta-n-16 8B β’ Updated Sep 30 β’ 2
mlfoundations-cua-dev/grpo-7b-stage-2-on-103k-filtered-data-temp-1-7-zero-correct-to-0.3 8B β’ Updated Sep 29 β’ 1
mlfoundations-cua-dev/rpo-7b-stage-2-on-103k-filtered-data-temp-1-4-zero-correct-to-0.2 8B β’ Updated Sep 29 β’ 2
mlfoundations-cua-dev/grpo_colstart_3_3k_on_63k_dynamic_batching-bs_128_8nodes-dense_reward_step_220 8B β’ Updated Sep 28 β’ 1
mlfoundations-cua-dev/grpo_colstart_3_3k_on_63k_dynamic_batching-bs_128_8nodes-dense_reward_step_200 8B β’ Updated Sep 28 β’ 1
mlfoundations-cua-dev/grpo_colstart_3_3k_on_63k_dynamic_batching-bs_128_8nodes-dense_reward_step_160 8B β’ Updated Sep 28 β’ 1
mlfoundations-cua-dev/grpo_colstart_3_3k_on_63k_dynamic_batching-bs_128_8nodes-dense_reward_step_120 8B β’ Updated Sep 28 β’ 1
mlfoundations-cua-dev/grpo_colstart_3_3k_on_63k_dynamic_batching-bs_128_8nodes-dense_reward_step_80 8B β’ Updated Sep 28 β’ 1
mlfoundations-cua-dev/grpo_colstart_3_3k_on_63k_dynamic_batching-bs_128_8nodes-dense_reward_step_40 8B β’ Updated Sep 28 β’ 1
mlfoundations-cua-dev/grpo_colstart_3_3k_on_63k_dynamic_batching-bs_128_8nodes-dense_reward 8B β’ Updated Sep 28 β’ 1
mlfoundations-cua-dev/qwen2_5vl_7b_easyr1_103k_4MP_jedi_ui_vision_gta1_data_lr_1_0e-06_z3_4nodes Image-to-Text β’ 849k β’ Updated Sep 24 β’ 1
mlfoundations-cua-dev/grpo-coldstart-63k-on-63k-max-prompt-5200-dynamic-batching-bs_128_8nodes Updated Sep 23
mlfoundations-cua-dev/grpo-coldstart-63k-on-63k-max-prompt-5200-dynamic-batching-bs_128_8nodes-ui-venus-params Updated Sep 23
mlfoundations-cua-dev/grpo-coldstart-10k-on-63k-max-prompt-5200-dynamic-batching-bs_128_8nodes Updated Sep 23
mlfoundations-cua-dev/grpo-coldstart-1k-on-63k-max-prompt-5200-dynamic-batching-bs_128_8nodes Updated Sep 23
mlfoundations-cua-dev/grpo-7b-stage-2-on-103k-filtered-data-temp-1-4-zero-correct-to-0.2 Updated Sep 22
mlfoundations-cua-dev/qwen2_5vl_7b_easyr1_38k_lr_1_0e-06_bs16_4nodes Image-to-Text β’ 849k β’ Updated Sep 16 β’ 10
mlfoundations-cua-dev/qwen2_5vl_7b_easyr1_38k_lr_1_0e-06_bs16_4nodes_2epochs Image-to-Text β’ 849k β’ Updated Sep 16 β’ 4
mlfoundations-cua-dev/coldstart_10k_from_44k_qwen2_5vl_7b_ui_vision_grounding_4MP_lr_1_0e-06_bs16_4nodes Image-to-Text β’ 849k β’ Updated Sep 16 β’ 8