Hello everyone,
I downloaded few datasets and quickly ran out of disk space… ![]()
I was wondering whether I actually still need those download files for later usage of the dataset, or I can delete it’s content to free some space?
My datasets memory layout is as follows:

And I specifically asks about the ./downloads directory, which consist of files such as:
editing:
I’m mostly running the run_mlm.py script using those downloaded datasets.
In case I use the flag --dataset_cache_directory and specify the directory of the dataset in my files (which located under my .cache directory), do I still need the content of download directory?
Would appreciate any help on the topic ![]()
