Update README.md
Browse files
README.md
CHANGED
@@ -8,9 +8,9 @@ base_model:
|
|
8 |
---
|
9 |
## SmallThinker-21BA3B-Instruct-GGUF
|
10 |
|
11 |
-
- GGUF models with `.gguf` suffix can used with [*llama.cpp*](https://github.com/ggml-org/llama.cpp)
|
12 |
|
13 |
-
- GGUF models with `.powerinfer.gguf` suffix are integrated with fused sparse FFN operators and sparse LM head operators. These models are only compatible to [*powerinfer*](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker)
|
14 |
|
15 |
## Introduction
|
16 |
|
|
|
8 |
---
|
9 |
## SmallThinker-21BA3B-Instruct-GGUF
|
10 |
|
11 |
+
- GGUF models with `.gguf` suffix can used with [*llama.cpp*](https://github.com/ggml-org/llama.cpp) framework.
|
12 |
|
13 |
+
- GGUF models with `.powerinfer.gguf` suffix are integrated with fused sparse FFN operators and sparse LM head operators. These models are only compatible to [*powerinfer*](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker) framework.
|
14 |
|
15 |
## Introduction
|
16 |
|