Well I tried the new DevQuasar Kwaipilot.KAT-Dev.Q8_0.gguf - Lmstudio and Jan.ai errors out:

#9
by akierum - opened

Well I tried the new DevQuasar Kwaipilot.KAT-Dev.Q8_0.gguf
Now no more errors in Cline. Lets see if output is worth it, as previous version was way worse the Qwen3 coder 30b

Update:

Lmstudio still errors out:
2025-10-08 23:50:30 [INFO]
[LM STUDIO SERVER] Running chat completion on conversation with 30 messages.
2025-10-08 23:50:30 [INFO]
[LM STUDIO SERVER] Streaming response...
2025-10-08 23:51:06 [ERROR]
The model has crashed without additional information. (Exit code: 18446744072635812000). Error Data: n/a, Additional Data: n/a
2025-10-08 23:51:07 [INFO]
[JIT] Requested model (kwaipilot.kat-dev) is not loaded. Loading "DevQuasar/Kwaipilot.KAT-Dev-GGUF/Kwaipilot.KAT-Dev.Q8_0.gguf" now...
2025-10-08 23:52:38 [INFO]
[LM STUDIO SERVER] Running chat completion on conversation with 30 messages.
2025-10-08 23:52:38 [INFO]
[LM STUDIO SERVER] Streaming response...
2025-10-08 23:56:07 [ERROR]
The model has crashed without additional information. (Exit code: 18446744072635812000). Error Data: n/a, Additional Data: n/a
2025-10-08 23:56:09 [INFO]
[JIT] Requested model (kwaipilot.kat-dev) is not loaded. Loading "DevQuasar/Kwaipilot.KAT-Dev-GGUF/Kwaipilot.KAT-Dev.Q8_0.gguf" now...

Jan.ai also errors out:

Invalid API Response: The provider returned an empty or unparsable response. This is a provider-side issue where the model failed to generate valid output or returned tool calls that Cline cannot process. Retrying the request may help resolve this issue.

API Request Failed$0.0000

502 Proxy request to model failed: error sending request for url (http://127.0.0.1:3643/chat/completions): error trying to connect: tcp connect error: No connection could be made because the target machine actively refused it. (os error 10061)

[22:04:56]
ERROR
Proxy request to model failed: error sending request for url (http://127.0.0.1:3643/chat/completions): error trying to connect: tcp connect error: No connection could be made because the target machine actively refused it. (os error 10061)
[22:04:57]
DEBUG
Handling POST request to /chat/completions requiring model lookup in body
[22:04:57]
DEBUG
Extracted model_id: Kwaipilot.KAT-Dev.Q8_0
[22:04:57]
DEBUG
Found session for model_id Kwaipilot.KAT-Dev.Q8_0
[22:04:57]
DEBUG
Adding session Authorization header
[22:04:57]
DEBUG
Sending buffered body (221820 bytes)
[22:04:59]
ERROR
Proxy request to model failed: error sending request for url (http://127.0.0.1:3643/chat/completions): error trying to connect: tcp connect error: No connection could be made because the target machine actively refused it. (os error 10061)

Cline also throws Erros like these:
The model used search patterns that don't match anything in the file. Retrying...

Kwaipilot org

Hello, thank you for your interest! We’re preparing a quantized & more stable version.

Your model made code better then GLM4.5 Air and Qwen3 coder 30B, so when can we expect a proper Q8 version. that works with Cline/RooCode?

V7 is DevQuasar Kwaipilot.KAT-Dev.Q8_0.gguf
V2 is GLM 4.5 Air

Compared by Claude
menu

Kwaipilot org

The proper Q8 version is expected to be released later this month.

update

It seems no errors are present when running:

Jan.ai v0.7.1 recent versions are unstable errors above
Mungert-kat-Dev https://huggingface.co/Mungert/KAT-Dev-GGUF
No jinja template at all.
Context set to 90k
GPU layers 100
Others settings to default

roocode v3.28.15
vscode:
Version: 1.105.1 (user setup)
Commit: 7d842fb85a0275a4a8e4d7e040d2625abbf7f084
Date: 2025-10-14T22:33:36.618Z
Electron: 37.6.0
ElectronBuildId: 12502201
Chromium: 138.0.7204.251
Node.js: 22.19.0
V8: 13.8.258.32-electron.0
OS: Windows_NT x64 10.0.26100

If you decide to test Jan.ai with other jinja templates you need to restart it and vscode close apps and start fresh, otherwise false positives happen.

No template has nice formatting, somehow it gets removed with even official template

roo1

roo2

roo3

roo4

roo5

roo6

This needs permanent fix

Any fixes coming ? or is this dead?

Sign up or log in to comment