PowerInfer
/

SmallThinker-21BA3B-Instruct-GGUF

Text Generation

Model card Files Files and versions Community

yixinsong commited on 4 days ago

Commit

1ebc147

·

verified ·

1 Parent(s): fc99d13

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -8,9 +8,9 @@ base_model:
 ---
 ## SmallThinker-21BA3B-Instruct-GGUF
-- GGUF models with `.gguf` suffix can used with [*llama.cpp*](https://github.com/ggml-org/llama.cpp) framwork.
-- GGUF models with `.powerinfer.gguf` suffix are integrated with fused sparse FFN operators and sparse LM head operators. These models are only compatible to [*powerinfer*](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker) framwork.
 ## Introduction

 ---
 ## SmallThinker-21BA3B-Instruct-GGUF
+- GGUF models with `.gguf` suffix can used with [*llama.cpp*](https://github.com/ggml-org/llama.cpp) framework.
+- GGUF models with `.powerinfer.gguf` suffix are integrated with fused sparse FFN operators and sparse LM head operators. These models are only compatible to [*powerinfer*](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker) framework.
 ## Introduction