Hugging Face Forums

PRO Plan and for running huge models on free inference api?

jelber2 May 15, 2023, 9:57am 1

Hi,

I was curious about whether the pro plan would enable me to do the following:

curl /static-proxy?url=https%3A%2F%2Fapi-inference.huggingface.co%2Fmodels%2FHuggingFaceH4%2Fstarchat-alpha%3C%2Fa%3E
-X POST
-d ‘{“inputs”: "Can you please let us know more details about your "}’
-H “Authorization: Bearer xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx”

As I am currently getting

{“error”:"The model HuggingFaceH4/starchat-alpha is too large to be loaded automatically (31GB > 10GB)…}

Best,
Jean Elbers

radames May 15, 2023, 5:34pm 2

This is a really large model, you may need a dedicated hardware , I recommend you looking at our Inference Endpoints - Hugging Face service and reaching out if you need help, thanks

Topic		Replies	Views	Activity
Inference Pro usage in colab Inference Endpoints on the Hub	0	252	April 15, 2024
Inference service for large models, such as Vicuna 13b Beginners	0	1440	May 5, 2023
Cannot run large models using API token Inference Endpoints on the Hub	5	7344	February 22, 2024
Paid API Service Beginners	6	1566	January 6, 2023
Anyone else VERY confused? Community Calls	1	1293	December 19, 2023