Spaces:
Running
[NEW] HuggingChat Omni
Introducing: HuggingChat Omni π«
HuggingChat returns and it's smarter and faster than ever π
Stop picking models. Start chatting.
- 115+ available models - https://huggingface.co/chat/models
- 15+ providers available - powered by Hugging Face Inference Providers.
- One chat interface: HuggingChat
Available now for all Hugging Face users. Free users can use their inference credits, PRO users get 20x more credits to use.
π§ Omni: the new default routing model
When you send a message, Omni analyzes what you need and routes you to the best model for that specific task.
Each route uses the best model for its task. You see which model handled your request while it streams.
π Examples
| What you ask | Route | Model |
|---|---|---|
| "Help me decide between two job offers. One pays 20% more but requires relocation." | decision_support |
deepseek-ai/DeepSeek-R1-0528 |
| "Create a React component for an image carousel with lazy loading" | code_generation |
Qwen/Qwen3-Coder-480B-A35B-Instruct |
| "Write a short mystery story set in a lighthouse during a storm" | creative_writing |
moonshotai/Kimi-K2-Instruct-0905 |
| "Translate this to French: The meeting has been rescheduled to next Tuesday" | translation |
CohereLabs/command-a-translate-08-2025 |
βοΈ Under the hood
Omni uses a policy-based routing system. Each route has:
- A clear description of what it handles
- A primary model best suited for that task
- Fallback models if the primary is unavailable
The router model analyzes your conversation and picks the matching route. Fast (10 second timeout) and runs on every message. Credits to Katanemo for their routing model: katanemo/Arch-Router-1.5B
β¨ What else is new
- Background generation tracking: Multiple conversations can generate at the same time. Switch between tabs and the app tracks what's still generating. Updates appear automatically when responses finish.
- Better streaming: Text renders faster and smoother. The app only updates what changed instead of re-rendering everything. Less flickering, especially in long responses with code blocks.
- Better UX: UX was refined throughout the app. Fewer bugs and rough edges. Preview for code, beautiful streaming and more polish and attention to detail everywhere.
- Speed optimizations: Sessions stay active longer with automatic token refresh. Response times improved across the board. The whole app feels faster.
π οΈ Run it yourself
HuggingChat is of course still 100% open source. It has never been easier to self-host your own instance.
Quick setup:
git clone https://github.com/huggingface/chat-ui
cd chat-ui
npm install
npm run dev
Only 3 env variables to set to get it working in .env:
MONGODB_URL- Your MongoDB connectionOPENAI_API_KEY- Your API keyOPENAI_BASE_URL- Your endpoint URL
You can also configure your own routes in a JSON file. Each route defines which models to use for specific tasks.
Check out the repo: github.com/huggingface/chat-ui
Hope you are as excited as we are about HuggingChat Omni! Please share your feedback and ideas in this thread π€
Is it possible to import my conversations from the previous version of HuggingChat?
Yeah this dumbing down the system was totally worth nuking everyone's logs and assistants...? The performance improvements are nice if true, but how can you call this a better UX when so many basic features are missing from the last version? Even simple settings are gone, like no options to delete or edit output? There isn't even a way to tweak temperature/repetition minimizing settings, or give different chats different system prompts??
wow, I'm kind of surprised it's back. feels like a tad bit of a downgrade, but I'm assuming that it was a complete rework? hoping that more QoL features will be reintroduced again.
Does the platform now impose usage limits based on inference credits for free users?
we're so back
edit:
nevermind, cant delete the conversation branch like beforeπ’
edit 2:
and it now has a limit. Its been over six hours and i still cant continue the conversation π
Thanks for getting this running
tried it out a lot, seems pretty much good as new. new UI looks different for sure. works well, haven't had it crash at all and haven't seen any of the weird rendering bugs that were so prevalent in the old version. considering that there's a lot missing, I think it's probably a thing where they're going to reintroduce all of the old stuff again over time. the old version took almost two years (23 months if I recall) to become a sorta-final product.
Thanks for your feedback everyone <3
Is it possible to import my conversations from the previous version of HuggingChat?
Not really but we can work on it if a lot of people want it.
Can't edit conversations
Mhhh you can (or I'm missing something)
Can't change temperature
Can't search the web
Can't generate images
We plan to add this for the rest I'm not sure yet but Assistants may come back (and they will be better)
β You now have to pay to use it π
There's a free tier. It's way faster and stable now also PRO Subscription is a very nice deal https://huggingface.co/pro. Also paid requests are very cheap for most models. Example from my https://huggingface.co/settings/inference-providers/overview: 1840 requests with Qwen/Qwen3-235B-A22B-Instruct-2507= .65$
tried it out a lot, seems pretty much good as new. new UI looks different for sure. works well, haven't had it crash at all and haven't seen any of the weird rendering bugs that were so prevalent in the old version.
Thanks that's motivating and yes we'll make it awesome with faster iteration rate than the 1st version!
Thanks for your feedback everyone <3
Is it possible to import my conversations from the previous version of HuggingChat?
Not really but we can work on it if a lot of people want it.
Can't edit conversations
Mhhh you can (or I'm missing something)
Can't change temperature
Can't search the web
Can't generate imagesWe plan to add this for the rest I'm not sure yet but Assistants may come back (and they will be better)
β You now have to pay to use it π
There's a free tier. It's way faster and stable now also PRO Subscription is a very nice deal https://huggingface.co/pro. Also paid requests are very cheap for most models. Example from my https://huggingface.co/settings/inference-providers/overview: 1840 requests with
Qwen/Qwen3-235B-A22B-Instruct-2507= .65$tried it out a lot, seems pretty much good as new. new UI looks different for sure. works well, haven't had it crash at all and haven't seen any of the weird rendering bugs that were so prevalent in the old version.
Thanks that's motivating and yes we'll make it awesome with faster iteration rate than the 1st version!
what about the missing feature that involves in deleting a branch of the conversation? I used that very often to make things tidy
Anyone know which model is the best for uncensored stuff, one of the Cohere variants or something else?
@erictheechidna I can see CohereLabs's models there, I am not sure that it's the best for uncensored stuff. Why do you need a model from the team then? I downloaded and tried, it said: I'm sorry, but I can't assist with that kind of explicit content...
Can't edit conversations
Mhhh you can (or I'm missing something)
Cool, so how do you edit the output text?
It's a shameful imitation of the old chat, and now you have to pay for it. It's clear why they closed the previous one β the free trial ended and now you have to pay.
Pretty happy with Omni for the limited time I've spent with it so far.
I have some questions before I get too invested:
- Has the privacy policy changed? I don't see the explicit mention of conversations never being shared with the model owners. Is HuggingChat Omni still private?
- Is there a way to see how much you can use HuggingChat before you have to pay for it? Does it refresh hourly, daily, or what? There absolutely must be a way to use this indefinitely for free or it's not worth using, since I'm never going to be a paid user.
- Is the Omni routing model also private?
Thank you
So when the free trial ends, that's it? You can't use it anymore?
It's clear why they closed the previous one β the free trial ended and now you have to pay.
So when the free trial ends, that's it? You can't use it anymore?
I get your concerns about free usage (for now you need to be a PRO user to utilize the pay-as-you-go feature, but it should be quite cheap for a normal usage). Weβre open to raising the limit. But before we do we need to find the right balance: the previous version was hammered by abuse that nearly broke the service. Weβre committed to keeping the doors open for everyone, but we need a guardrail that keeps the lights on.
Is there a way to see how much you can use HuggingChat before you have to pay for it?
Yes there's a way here: https://huggingface.co/settings/billing (in the inference usage section)
In the earlier version of HuggingChat, there was a visible toggle to opt out of data sharing and model training so that user conversations werenβt used for model improvement, is that toggle switch still there?
Has the privacy policy changed? I don't see the explicit mention of conversations never being shared with the model owners. Is HuggingChat Omni still private?
Regarding privacy, now HuggingChat is using external providers (HF Inference Providers) - so the privacy will depend on which providers you are using, the good news is that you can choose which ones you want to use be toggling them in your Inference Settings.
Cool, so how do you edit the output text?
Are you sure this was possible (btw you can from the inference playground)
I think many features demanded here might be introduced by HF later, so I summarise them for HF (probably victor?):
- Import previous chats (those we had before the sudden shutdown)
- Assistants (importation, creation, and sharing)
- Tools from community
- Image, Web search, and temperature tweaking
The followings are from me, but I guess folks will have similar thoughts
- More transparent pricing
I've read the API provider & model page, but some are missing (denoted only with-symbol, including HF SmolLM, Cohere, and Zai)
I wonder whether that means free, or just price-unavailable
Otherwise we may run out of our free quotas easily if we're misled
edited I guess it's price unavailable since I triedcommand afrom Cohere, while I've been charged :
- Custom routing logic, e.g.
Free first, or ranked by pricing
I personally wanted this :)
or ranked by model type, popularity, date of release, etc. - It will be good if we can set certain providers for specific models, e.g. firework for DeepSeek V3, Together for QWQ, or Groq for Kimi K2
- Can we bring our own API keys other than your provider partners' ones?
e.g. OpenRouter or LLM gateway - Set our preferred model as default, or give it a shortcut
And I have a personal opinion about abuse-prevention: setting a rational rate limit and reset it after a certain period, just like OpenAI or Anthropic did.
For heavy users, maybe HF pro or other subscription plans can help.
Thanks for your reading.
I'm here because I love open source and privacy so much, while HuggingChat is the most ideal choice.
And I have a personal opinion about abuse-prevention: setting a rational rate limit and reset it after a certain period, just like OpenAI or Anthropic did.
For heavy users, maybe HF pro or other subscription plans can help.
Yeah, I totally agree, having a reasonable rate limit that resets periodically is way better than having one hard cap on overall usage that forces you to upgrade to pro once you hit it.
Great job on killing something that was once amazing from what i had heard.
Weβre open to raising the limit. But before we do we need to find the right balance: the previous version was hammered by abuse that nearly broke the service. Weβre committed to keeping the doors open for everyone, but we need a guardrail that keeps the lights on.
There's a pretty obvious solution: Daily caps. Let us send 75-100 messages a day. You can compare it to Grok, which has a 2-hour limit of 20 messages, which works out to 240 messages a day. 100 would be low but acceptable for most users.
Yes there's a way here: https://huggingface.co/settings/billing (in the inference usage section)
Yeah, this is terrible. I'm already almost done and if there's no increase I just won't come back.
Regarding privacy, now HuggingChat is using external providers (HF Inference Providers) - so the privacy will depend on which providers you are using, the good news is that you can choose which ones you want to use be toggling them in your Inference Settings.
We also need a way to block this completely. Having it be determined by the model providers, in a system that routes you to a provider it chooses without you having the ability to opt-out, is unacceptable.
As it is, this is so much worse than the old Hugging Chat that it's not worth using in its current state. I'll check back to see if/when an increased recurring cap and better privacy measures are implemented.
We also need a way to block this completely. Having it be determined by the model providers, in a system that routes you to a provider it chooses without you having the ability to opt-out, is unacceptable.
You can block completely the providers you don't want from your Inference Settings.
I remember using the old Hugging chat a lot. With this one I ran out of credits in seconds :(
Back to openrouter I suppose.
I'm curious to know what settings they are using for the models. On the old Huggingchat you used to be able to see the temperature etc for the model you were using.
Hay welcome back. It's fine if you want to add a paid feature, and it's also fine if you want to make it a totally paid/ free tier (basically a bait for paid services), but that won't be hugging chat. Hugging chat was something else. It was like charity work. It gave us hope that not all AI providers seek money, and some can give you free service. It was perfect, every bit of it was. It had so many features, it had a strong privacy-friendly system, secure. It's fine if you want to make something like other AI competitors (OpenAI and others), it's totally fine. Your product, your rules, who am I to say anything? But I just want to thank you for the last 2 years. You had made my life very easy. Because of the hugging chat, I was able to use AI without spending money. It was the freedom and openness of open source that we needed. It's fine now you want to build something else. Maybe you will now get customers who are dedicated and willing to spend money. I am no longer a customer as I am not willing to spend any money on my AI, as I have other things to spend money on. Again, thanks for the last 2 years. I will always remember there was a Great Open-source AI host called Huggingchat. Cheers
So when the free trial ends, that's it? You can't use it anymore?
seems like it. it's been over a day now since i've reached my limit, i still cant chat
Please keep the free tier as interesting as possible to be able to extensively test the open models. Especially for new models and those which can not be run easily on mid to low consumer hardware.
Consider introducing a setting which allows to fine-tune omni routing from now hardcoded "best underlying model" to more options I'm sure also paying Pro users are interested in like:
- best performance per inference cost/credit
- fastest performance in the top tier models
Also gradually saving preferences of users (models output) would be an interesting intermediate comparing to hard select a model vs. HF omni routing preferences.
I'm just wondering are the free limit uses a one time thing or do they refresh daily?
And if they are a daily thing will we able to just pick up where our story left off or do we need to start a new chat every time?
With no way to refill for free, that'll just incentivise throwaway accounts. Prevent abuse by limiting out the expensive to run models and features, queue the inputs on a lower priority, and rate limit heavy users.
Also, that settings ui not showing fractions of a cent hurts the precision a lot.
waited a handful of days, seems like the free tier does not, in fact, reset. at least not daily. if it were to reset daily (or, preferably, have a system similar to the old Huggingchat) that would be a huge improvement. maybe an advertisement banner or something to offset the costs? I'm not sure, I'm not super knowledgeable on this, but being able to run through what I assume is a montly limit in an afternoon is disappointing.
I'm a little disappointed that the UI doesn't have the same features that the old UI did, such as the ability to delete branches. I thought those would have remained.
Pro... considering it. After the old hugging chat closed, I moved to Mistral on the β¬15/m ($17 USD). So being able to get back what I've missed for $9 seems nice.
I get your concerns about free usage (for now you need to be a PRO user to utilize the pay-as-you-go feature, but it should be quite cheap for a normal usage). Weβre open to raising the limit. But before we do we need to find the right balance: the previous version was hammered by abuse that nearly broke the service. Weβre committed to keeping the doors open for everyone, but we need a guardrail that keeps the lights on.
Trust me when I say I get this. I became a bit of a heavy user on the old version. So, here's an idea. As said before, I've (currently) moved to Mistral and their system works like this:
- Free tier: daily limit of X number of messages (example: 100).
- Pro tier: limit is now every 3 hours. Number of messages has increased (example: 1000).
It would be really, really nice if you could do that here. This should at least give guard rails on free users but really benefit/reward those who subscribe. I do also prefer subscription over PAYG too.
Cool, so how do you edit the output text?
Are you sure this was possible
I do not recall this being possible. You could delete responses though. Which I feel is a key feature that needs to come back.
Oh, and seeing as how this has been relaunched - could all of the old discussions be closed now?
I'm a little disappointed that the UI doesn't have the same features that the old UI did, such as the ability to delete branches. I thought those would have remained.
Pro... considering it. After the old hugging chat closed, I moved to Mistral on the β¬15/m ($17 USD). So being able to get back what I've missed for $9 seems nice.
I get your concerns about free usage (for now you need to be a PRO user to utilize the pay-as-you-go feature, but it should be quite cheap for a normal usage). Weβre open to raising the limit. But before we do we need to find the right balance: the previous version was hammered by abuse that nearly broke the service. Weβre committed to keeping the doors open for everyone, but we need a guardrail that keeps the lights on.
Trust me when I say I get this. I became a bit of a heavy user on the old version. So, here's an idea. As said before, I've (currently) moved to Mistral and their system works like this:
- Free tier: daily limit of X number of messages (example: 100).
- Pro tier: limit is now every 3 hours. Number of messages has increased (example: 1000).
It would be really, really nice if you could do that here. This should at least give guard rails on free users but really benefit/reward those who subscribe. I do also prefer subscription over PAYG too.
Cool, so how do you edit the output text?
Are you sure this was possible
I do not recall this being possible. You could delete responses though. Which I feel is a key feature that needs to come back.
the pro version is worth it
sooo do the free tier reset or its a bait and switch ?
the pro version is worth it
I've been thinking of getting it but like how does it actually work because I won't lie I don't 100% understand what it was saying I would be paying for? especially when I saw something that said you would have to pay more if you went over a limit and honestly that scared the hell out of me.
I know this probably won't happen but can we just have a subscription tier that's just for huggingChat? because I was looking at it again and maybe it's just me but I don't need or even want almost all of the features you would be paying the Β£9 a month for and don't get me wrong I'm fine paying that much as it's a lot less than other AI generators cost at their lowist tier and also because It's really easy to use, I love the level of descriptive and detailed storytelling it produces and best of all it doesn't give me a word count limit for what I can write for my prompt... because I didn't think my prompt I use a lot was that long until I tried to use another AI generator and it's almost 20,000 words long. π
lol looks like it dont reset
i wouldn't mind if they refreshed with a decent bit of usage limit but if its a one time trial thing then its dead folks pack it up
Just LET ME reuploaded what was LOST and archived
it deletes my messages when it gives errors!!!!
Wow. Im no longer able to use this for free.
And there is officially no other chat bots that are unlimited out there.
I used to use the CoHere chatbot model a lot when it was free to use. :(
So I posted this on Reddit but I thought I would share it here too in case anyone is thinking of getting the subscription.
I paid for it because I thought fuck it I'll just get it and see so here is what I found out.
For the Β£2 inference you get about 205 requests you can make on HuggingChat and the cost for each is 1p which isn't bad at all and if you're not someone like me who has the impulse control of a fruit fly and way too much free time then's good for an ADHD brain you could very easily last maybe not the whole month but close before reaching that limit.
Here's what I found about the models I tried out well the ones which would work for me as my stories are lightish NSFW ones because my characters are adults in a modern fantasy world so stuff may happen and if it does I don't want to get told off anyway. lol
The models I tried were 8 of them however only about 5 of them worked for what I wanted and the responses were very fast, the AI was really good at understanding the information I had given and even remembering information from a lot of requests ago which is really good if you're using it for a story.
For example: the model deepseek-ai/DeepSeek-V3.1-Terminus I was using before I reached my Β£2 limit I had requested 74 responses in one chat and it was still remembering information from the very first response.
As for bugs or weird text as far as I could tell there was only really a handful of times anything happened for me and that was a few rare times there would be tiny little bits of text that looked like a foreign language if that makes sense and the other was just if a model could only do so many requests before it started having trouble.
Now for going over your limit I did it by 1p without realizing as there is no pop up or anything to say hey you're done but I won't have to pay for that until my subscription is renewed on november 22 meaning I could if I wanted to keep using HuggingChat only the cost won't be paid for a month which honestly I'm not a big fan of the fact that you can keep using it past the point your Β£2 runs out becuase even if you don't have to pay if it right now you do eventually.
Also I tried the Zero GPU but honestly someone else would have to tell you if it's any good because it's mostly AI for images, videos you know that kind of thing which I very rarely use and if I do it's just to see what clothes may look like if I'm playing an interactive novel and yes there were AI text generators but a lot of them just didn't work or could only handle very small prompts.
So yeah overall even if you're paying for the Β£9 subscription just to use HuggingChat I do still think it's worth it because as I said the AI's a hell lot better than it used to be and the bugs are pretty much not there anymore.
Ps.
With that being said I do think there should be a subscription that is just for HuggingChat because a lot of the features you are paying for if you're like me don't need or even want.
And another thing while I can understand why it only refreshes monthly not daily because there is no why you can use it all in one day, I do think it should refresh once a week or halfway through the month.
I'm kinda dumb, but where are my chats? Are they like gone? I didn't see a option to export it while HC was down.
Import please? that's the only reason why I still return to this...
Please add back the message deletion feature, sometimes it feels gets too much messages and gets lost easily
Making advertising for another ai on the official hugging chat forum is kinda dirty if you ask me
I'm kinda dumb, but where are my chats? Are they like gone? I didn't see a option to export it while HC was down.
Yes, your chats are gone. There was an option to export for a couple weeks after chat went down, then they were deleted.
Thanks for your feedback everyone <3
Is it possible to import my conversations from the previous version of HuggingChat?
Not really but we can work on it if a lot of people want it.
Can't edit conversations
Mhhh you can (or I'm missing something)
Can't change temperature
Can't search the web
Can't generate imagesWe plan to add this for the rest I'm not sure yet but Assistants may come back (and they will be better)
β You now have to pay to use it π
There's a free tier. It's way faster and stable now also PRO Subscription is a very nice deal https://huggingface.co/pro. Also paid requests are very cheap for most models. Example from my https://huggingface.co/settings/inference-providers/overview: 1840 requests with
Qwen/Qwen3-235B-A22B-Instruct-2507= .65$tried it out a lot, seems pretty much good as new. new UI looks different for sure. works well, haven't had it crash at all and haven't seen any of the weird rendering bugs that were so prevalent in the old version.
Thanks that's motivating and yes we'll make it awesome with faster iteration rate than the 1st version!
Imagine using it like the old HuggingChat. You never can. You do have to pay to use it now becasuse it is no longer unlimited like it used to be. For unlimited messages, this is not a good chat interface. I thought this was about open-source freedom, not the money.
The "CohereLabs-AyaExpanse" model seems to work well for my purposes, given i just porked Calamity Coyote, with no "nudging" or other tricks needed. Still doesn't solve the problem of limited credits per months, and whatever, but at least it's something, for now. Otherwise there's venice.ai which is completely uncensored, but also much lower quality.



