EleutherAI
/

pile-t5-base

Text Generation

text2text-generation

encoder-decoder

Model card Files Files and versions

lintang commited on Apr 1, 2024

Commit

5541b9f

·

verified ·

1 Parent(s): 541538f

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -106,7 +106,15 @@ The Pile was deduplicated before being used to train Pile-T5.
 #### Training procedure
 Pile-T5 was trained with a batch size of approximately 1M tokens
-(2048 sequences of 512 tokens each), for a total of 2,000,000 steps.
 ### Evaluations

 #### Training procedure
 Pile-T5 was trained with a batch size of approximately 1M tokens
+(2048 sequences of 512 tokens each), for a total of 2,000,000 steps. Pile-T5 was trained
+with the span-corruption objective.
+#### Training checkpoints
+Intermediate checkpoints for Pile-T5 are accessible within this repository.
+There are in total 200 checkpoints that are spaced 10,000 steps. For T5x-native
+checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
 ### Evaluations