Compute Instance Requirement
#28
by
						
iammano
	
							
						- opened
							
					
Hi there,
I was trying to build agent agent-based application by using llama3.1 models and it is on AWS EC2. I need suggestions of which instance should I opt for which will be capable of running the models cost-effectively.
I explored the GPU requirement of the model from the hugging face blog, here
https://huggingface.co/blog/llama31#whats-new-with-llama-31 
But I'm still sceptical about choosing which instance type should I go for.
Thanks for your idea and for taking the time to reply to this topic.