50 tokens/second on M4 air 8-GPU . Beats out gemma and chatgpt-oss in quality!
· Sign up or log in to comment