Update README.md

a93bd00 verified 6 months ago

2.62 kB

	---
	license: apache-2.0
	language:
	- en
	base_model:
	- prithivMLmods/Blitzar-Coder-4B-F.1
	pipeline_tag: text-generation
	library_name: transformers
	tags:
	- text-generation-inference
	- code
	- coder
	---

	# Blitzar-Coder-4B-F.1-GGUF

	> Blitzar-Coder-4B-F.1 is a high-efficiency, multi-language coding model fine-tuned on Qwen3-4B using larger coding traces datasets spanning 10+ programming languages including Python, Java, C#, C++, C, Go, JavaScript, TypeScript, Rust, and more. This model delivers exceptional code generation, debugging, and reasoning capabilities—making it an ideal tool for developers seeking advanced programming assistance under constrained compute.

	## Model Files

	\| Filename \| Size \| Format \| Description \|
	\|----------\|------\|--------\|-------------\|
	\| Blitzar-Coder-4B-F.1.BF16.gguf \| 8.05 GB \| BF16 \| Brain Float 16-bit quantization \|
	\| Blitzar-Coder-4B-F.1.F16.gguf \| 8.05 GB \| F16 \| Half precision (16-bit) floating point \|
	\| Blitzar-Coder-4B-F.1.F32.gguf \| 16.1 GB \| F32 \| Full precision (32-bit) floating point \|
	\| Blitzar-Coder-4B-F.1.Q2_K.gguf \| 1.67 GB \| Q2_K \| 2-bit quantization with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q3_K_L.gguf \| 2.24 GB \| Q3_K_L \| 3-bit quantization (Large) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q3_K_M.gguf \| 2.08 GB \| Q3_K_M \| 3-bit quantization (Medium) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q3_K_S.gguf \| 1.89 GB \| Q3_K_S \| 3-bit quantization (Small) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q4_K_M.gguf \| 2.5 GB \| Q4_K_M \| 4-bit quantization (Medium) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q4_K_S.gguf \| 2.38 GB \| Q4_K_S \| 4-bit quantization (Small) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q5_K_M.gguf \| 2.89 GB \| Q5_K_M \| 5-bit quantization (Medium) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q5_K_S.gguf \| 2.82 GB \| Q5_K_S \| 5-bit quantization (Small) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q6_K.gguf \| 3.31 GB \| Q6_K \| 6-bit quantization with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q8_0.gguf \| 4.28 GB \| Q8_0 \| 8-bit quantization \|

	### Recommended Usage

	- Q4_K_M or Q5_K_M: Best balance of quality and performance for most users
	- Q6_K or Q8_0: Higher quality, larger file sizes
	- Q2_K or Q3_K_S: Fastest inference, lower quality
	- F16 or BF16: High quality, requires more VRAM
	- F32: Highest quality, requires significant VRAM

	## Quants Usage

	(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

	Here is a handy graph by ikawrakow comparing some lower-quality quant
	types (lower is better):

	![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)

	---
	license: apache-2.0
	language:
	- en
	base_model:
	- prithivMLmods/Blitzar-Coder-4B-F.1
	pipeline_tag: text-generation
	library_name: transformers
	tags:
	- text-generation-inference
	- code
	- coder
	---

	# Blitzar-Coder-4B-F.1-GGUF

	> Blitzar-Coder-4B-F.1 is a high-efficiency, multi-language coding model fine-tuned on Qwen3-4B using larger coding traces datasets spanning 10+ programming languages including Python, Java, C#, C++, C, Go, JavaScript, TypeScript, Rust, and more. This model delivers exceptional code generation, debugging, and reasoning capabilities—making it an ideal tool for developers seeking advanced programming assistance under constrained compute.

	## Model Files

	\| Filename \| Size \| Format \| Description \|
	\|----------\|------\|--------\|-------------\|
	\| Blitzar-Coder-4B-F.1.BF16.gguf \| 8.05 GB \| BF16 \| Brain Float 16-bit quantization \|
	\| Blitzar-Coder-4B-F.1.F16.gguf \| 8.05 GB \| F16 \| Half precision (16-bit) floating point \|
	\| Blitzar-Coder-4B-F.1.F32.gguf \| 16.1 GB \| F32 \| Full precision (32-bit) floating point \|
	\| Blitzar-Coder-4B-F.1.Q2_K.gguf \| 1.67 GB \| Q2_K \| 2-bit quantization with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q3_K_L.gguf \| 2.24 GB \| Q3_K_L \| 3-bit quantization (Large) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q3_K_M.gguf \| 2.08 GB \| Q3_K_M \| 3-bit quantization (Medium) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q3_K_S.gguf \| 1.89 GB \| Q3_K_S \| 3-bit quantization (Small) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q4_K_M.gguf \| 2.5 GB \| Q4_K_M \| 4-bit quantization (Medium) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q4_K_S.gguf \| 2.38 GB \| Q4_K_S \| 4-bit quantization (Small) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q5_K_M.gguf \| 2.89 GB \| Q5_K_M \| 5-bit quantization (Medium) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q5_K_S.gguf \| 2.82 GB \| Q5_K_S \| 5-bit quantization (Small) with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q6_K.gguf \| 3.31 GB \| Q6_K \| 6-bit quantization with K-quant \|
	\| Blitzar-Coder-4B-F.1.Q8_0.gguf \| 4.28 GB \| Q8_0 \| 8-bit quantization \|

	### Recommended Usage

	- Q4_K_M or Q5_K_M: Best balance of quality and performance for most users
	- Q6_K or Q8_0: Higher quality, larger file sizes
	- Q2_K or Q3_K_S: Fastest inference, lower quality
	- F16 or BF16: High quality, requires more VRAM
	- F32: Highest quality, requires significant VRAM

	## Quants Usage

	(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)

	Here is a handy graph by ikawrakow comparing some lower-quality quant
	types (lower is better):

	![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)