ArtusDev
/

Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3

ArtusDev commited on 12 days ago

Commit

be5ed5d

verified ·

1 Parent(s): 185b4c4

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md ADDED Viewed

+---
+base_model: Qwen/Qwen3-Coder-30B-A3B-Instruct
+base_model_relation: quantized
+quantized_by: ArtusDev
+---
+## EXL3 Quants of Qwen/Qwen3-Coder-30B-A3B-Instruct
+EXL3 quants of [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) using <a href="https://github.com/turboderp-org/exllamav3/">exllamav3</a> for quantization.
+### Quants
+| Quant(Revision) | Bits per Weight | Head Bits |
+| -------- | ---------- | --------- |
+| [4.0_H6](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/4.0bpw_H6) | 4.0 | 6 |
+| [4.5_H6](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/4.5bpw_H6) | 4.5 | 6 |
+| [5.0_H6](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/5.0bpw_H6) | 5.0 | 6 |
+| [6.0_H6](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/6.0bpw_H6) | 6.0 | 6 |
+| [8.0_H8](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/8.0bpw_H8) | 8.0 | 8 |
+### Downloading quants with huggingface-cli
+<details>
+  <summary>Click to view download instructions</summary>
+Install hugginface-cli:
+```bash
+pip install -U "huggingface_hub[cli]"
+```
+Download quant by targeting the specific quant revision (branch):
+```
+huggingface-cli download ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3 --revision "5.0bpw_H6" --local-dir ./
+```
+</details>