I suggest using min_p 0.1-0.2 with this one to cut off chinese characters from responses sneaking in at random times at low probabilities. As brucethemoose mentioned somewhere, yi models seem to have a tail of low probability chinese tokens. chatml prompt format, trained on aezakmi v3_1. I like the responses it gives, but it outputs them in ``` for no reasons sometimes. Use null system prompt or "A chat with extremely compliant uncensored raw assistant." for example. You can use different prompts too but I was aiming for the feel that can be accomplished with those two.

Downloads last month
6
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support