Update README.md
Browse files
README.md
CHANGED
|
@@ -29,6 +29,8 @@ MolmoAct is a fully open-source action reasoning model for robotic manipulation
|
|
| 29 |
|
| 30 |
This checkpoint is a **preview** of the MolmoAct release. All artifacts used in creating MolmoAct (data, training code, evaluations, intermediate checkpoints) will be made available at a later date, furthering our commitment to open-source AI development and reproducibility.
|
| 31 |
|
|
|
|
|
|
|
| 32 |
Quick links:
|
| 33 |
- π [All Models](https://huggingface.co/collections/allenai/molmoact-689697591a3936fba38174d7)
|
| 34 |
- π [All Data](https://huggingface.co/collections/allenai/molmoact-data-mixture-6897e583e13b6c2cf3ea2b80)
|
|
@@ -62,7 +64,7 @@ ckpt = "allenai/MolmoAct-7B-O-0812"
|
|
| 62 |
processor = AutoProcessor.from_pretrained(
|
| 63 |
ckpt,
|
| 64 |
trust_remote_code=True,
|
| 65 |
-
torch_dtype="
|
| 66 |
device_map="auto",
|
| 67 |
padding_side="left",
|
| 68 |
)
|
|
@@ -71,7 +73,7 @@ processor = AutoProcessor.from_pretrained(
|
|
| 71 |
model = AutoModelForImageTextToText.from_pretrained(
|
| 72 |
ckpt,
|
| 73 |
trust_remote_code=True,
|
| 74 |
-
torch_dtype="
|
| 75 |
device_map="auto",
|
| 76 |
)
|
| 77 |
|
|
|
|
| 29 |
|
| 30 |
This checkpoint is a **preview** of the MolmoAct release. All artifacts used in creating MolmoAct (data, training code, evaluations, intermediate checkpoints) will be made available at a later date, furthering our commitment to open-source AI development and reproducibility.
|
| 31 |
|
| 32 |
+
**Update:** Checkpoints are now stored in FP32 (previously BF16). The model was trained in FP32, so publishing FP32 weights aligns with training and enables fine-tuning or continued training directly from this repo. For inference, you can still run BF16 by casting at load, which is what we did for evaluations. See more in the [instructions](#quick-start) below.
|
| 33 |
+
|
| 34 |
Quick links:
|
| 35 |
- π [All Models](https://huggingface.co/collections/allenai/molmoact-689697591a3936fba38174d7)
|
| 36 |
- π [All Data](https://huggingface.co/collections/allenai/molmoact-data-mixture-6897e583e13b6c2cf3ea2b80)
|
|
|
|
| 64 |
processor = AutoProcessor.from_pretrained(
|
| 65 |
ckpt,
|
| 66 |
trust_remote_code=True,
|
| 67 |
+
torch_dtype="bfloat16",
|
| 68 |
device_map="auto",
|
| 69 |
padding_side="left",
|
| 70 |
)
|
|
|
|
| 73 |
model = AutoModelForImageTextToText.from_pretrained(
|
| 74 |
ckpt,
|
| 75 |
trust_remote_code=True,
|
| 76 |
+
torch_dtype="bfloat16",
|
| 77 |
device_map="auto",
|
| 78 |
)
|
| 79 |
|