[NEW] HuggingChat Omni

#764
by victor - opened
Hugging Chat org
β€’
edited 18 days ago

Introducing: HuggingChat Omni πŸ’«

HuggingChat

HuggingChat returns and it's smarter and faster than ever πŸš€

Stop picking models. Start chatting.

Available now for all Hugging Face users. Free users can use their inference credits, PRO users get 20x more credits to use.

🧭 Omni: the new default routing model

When you send a message, Omni analyzes what you need and routes you to the best model for that specific task.
Each route uses the best model for its task. You see which model handled your request while it streams.

πŸ“Š Examples

What you ask Route Model
"Help me decide between two job offers. One pays 20% more but requires relocation." decision_support deepseek-ai/DeepSeek-R1-0528
"Create a React component for an image carousel with lazy loading" code_generation Qwen/Qwen3-Coder-480B-A35B-Instruct
"Write a short mystery story set in a lighthouse during a storm" creative_writing moonshotai/Kimi-K2-Instruct-0905
"Translate this to French: The meeting has been rescheduled to next Tuesday" translation CohereLabs/command-a-translate-08-2025

βš™οΈ Under the hood

Omni uses a policy-based routing system. Each route has:

  • A clear description of what it handles
  • A primary model best suited for that task
  • Fallback models if the primary is unavailable

The router model analyzes your conversation and picks the matching route. Fast (10 second timeout) and runs on every message. Credits to Katanemo for their routing model: katanemo/Arch-Router-1.5B

✨ What else is new

  • Background generation tracking: Multiple conversations can generate at the same time. Switch between tabs and the app tracks what's still generating. Updates appear automatically when responses finish.
  • Better streaming: Text renders faster and smoother. The app only updates what changed instead of re-rendering everything. Less flickering, especially in long responses with code blocks.
  • Better UX: UX was refined throughout the app. Fewer bugs and rough edges. Preview for code, beautiful streaming and more polish and attention to detail everywhere.
  • Speed optimizations: Sessions stay active longer with automatic token refresh. Response times improved across the board. The whole app feels faster.

πŸ› οΈ Run it yourself

HuggingChat is of course still 100% open source. It has never been easier to self-host your own instance.

Quick setup:

git clone https://github.com/huggingface/chat-ui
cd chat-ui
npm install
npm run dev

Only 3 env variables to set to get it working in .env:

  • MONGODB_URL - Your MongoDB connection
  • OPENAI_API_KEY - Your API key
  • OPENAI_BASE_URL - Your endpoint URL

You can also configure your own routes in a JSON file. Each route defines which models to use for specific tasks.

Check out the repo: github.com/huggingface/chat-ui

Hope you are as excited as we are about HuggingChat Omni! Please share your feedback and ideas in this thread πŸ€—

victor pinned discussion

Is it possible to import my conversations from the previous version of HuggingChat?

Yeah this dumbing down the system was totally worth nuking everyone's logs and assistants...? The performance improvements are nice if true, but how can you call this a better UX when so many basic features are missing from the last version? Even simple settings are gone, like no options to delete or edit output? There isn't even a way to tweak temperature/repetition minimizing settings, or give different chats different system prompts??

wow, I'm kind of surprised it's back. feels like a tad bit of a downgrade, but I'm assuming that it was a complete rework? hoping that more QoL features will be reintroduced again.

Does the platform now impose usage limits based on inference credits for free users?

we're so back

edit:
nevermind, cant delete the conversation branch like before😒

edit 2:
and it now has a limit. Its been over six hours and i still cant continue the conversation 😭

Thanks for getting this running

❌ Can't use assistants
❌ Can't generate images
❌ Can't edit conversations
❌ Can't search the web
❌ Can't change temperature
❌ Can't import your old conversations
βœ… You now have to pay to use it πŸ˜‚

So I posted this on Reddit but I thought I would share it here too in case anyone is thinking of getting the subscription.

I paid for it because I thought fuck it I'll just get it and see so here is what I found out.

For the Β£2 inference you get about 205 requests you can make on HuggingChat and the cost for each is 1p which isn't bad at all and if you're not someone like me who has the impulse control of a fruit fly and way too much free time then's good for an ADHD brain you could very easily last maybe not the whole month but close before reaching that limit.

Here's what I found about the models I tried out well the ones which would work for me as my stories are lightish NSFW ones because my characters are adults in a modern fantasy world so stuff may happen and if it does I don't want to get told off anyway. lol

The models I tried were 8 of them however only about 5 of them worked for what I wanted and the responses were very fast, the AI was really good at understanding the information I had given and even remembering information from a lot of requests ago which is really good if you're using it for a story.

For example: the model deepseek-ai/DeepSeek-V3.1-Terminus I was using before I reached my Β£2 limit I had requested 74 responses in one chat and it was still remembering information from the very first response.

As for bugs or weird text as far as I could tell there was only really a handful of times anything happened for me and that was a few rare times there would be tiny little bits of text that looked like a foreign language if that makes sense and the other was just if a model could only do so many requests before it started having trouble.

Now for going over your limit I did it by 1p without realizing as there is no pop up or anything to say hey you're done but I won't have to pay for that until my subscription is renewed on november 22 meaning I could if I wanted to keep using HuggingChat only the cost won't be paid for a month which honestly I'm not a big fan of the fact that you can keep using it past the point your Β£2 runs out becuase even if you don't have to pay if it right now you do eventually.

Also I tried the Zero GPU but honestly someone else would have to tell you if it's any good because it's mostly AI for images, videos you know that kind of thing which I very rarely use and if I do it's just to see what clothes may look like if I'm playing an interactive novel and yes there were AI text generators but a lot of them just didn't work or could only handle very small prompts.

So yeah overall even if you're paying for the Β£9 subscription just to use HuggingChat I do still think it's worth it because as I said the AI's a hell lot better than it used to be and the bugs are pretty much not there anymore.

Ps.

With that being said I do think there should be a subscription that is just for HuggingChat because a lot of the features you are paying for if you're like me don't need or even want.

And another thing while I can understand why it only refreshes monthly not daily because there is no why you can use it all in one day, I do think it should refresh once a week or halfway through the month.

I'm kinda dumb, but where are my chats? Are they like gone? I didn't see a option to export it while HC was down.

Import please? that's the only reason why I still return to this...

Please add back the message deletion feature, sometimes it feels gets too much messages and gets lost easily

seems like it. it's been over a day now since i've reached my limit, i still cant chat

Please consider upgrading to PRO!

no thanks, i'll stick with Shapes Inc for now

This comment has been hidden (marked as Spam)

Making advertising for another ai on the official hugging chat forum is kinda dirty if you ask me

I'm kinda dumb, but where are my chats? Are they like gone? I didn't see a option to export it while HC was down.

Yes, your chats are gone. There was an option to export for a couple weeks after chat went down, then they were deleted.

Sign up or log in to comment