ArtusDev commited on
Commit
be5ed5d
·
verified ·
1 Parent(s): 185b4c4

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +36 -0
README.md ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Qwen/Qwen3-Coder-30B-A3B-Instruct
3
+ base_model_relation: quantized
4
+ quantized_by: ArtusDev
5
+ ---
6
+
7
+ ## EXL3 Quants of Qwen/Qwen3-Coder-30B-A3B-Instruct
8
+
9
+ EXL3 quants of [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) using <a href="https://github.com/turboderp-org/exllamav3/">exllamav3</a> for quantization.
10
+
11
+ ### Quants
12
+ | Quant(Revision) | Bits per Weight | Head Bits |
13
+ | -------- | ---------- | --------- |
14
+ | [4.0_H6](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/4.0bpw_H6) | 4.0 | 6 |
15
+ | [4.5_H6](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/4.5bpw_H6) | 4.5 | 6 |
16
+ | [5.0_H6](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/5.0bpw_H6) | 5.0 | 6 |
17
+ | [6.0_H6](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/6.0bpw_H6) | 6.0 | 6 |
18
+ | [8.0_H8](https://huggingface.co/ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3/tree/8.0bpw_H8) | 8.0 | 8 |
19
+
20
+ ### Downloading quants with huggingface-cli
21
+
22
+ <details>
23
+ <summary>Click to view download instructions</summary>
24
+
25
+ Install hugginface-cli:
26
+
27
+ ```bash
28
+ pip install -U "huggingface_hub[cli]"
29
+ ```
30
+
31
+ Download quant by targeting the specific quant revision (branch):
32
+
33
+ ```
34
+ huggingface-cli download ArtusDev/Qwen_Qwen3-Coder-30B-A3B-Instruct-EXL3 --revision "5.0bpw_H6" --local-dir ./
35
+ ```
36
+ </details>