RWKV
/

RWKV7-Goose-Pile-421M-HF

Text Generation

Model card Files Files and versions

SmerkyG commited on Jul 23

Commit

b4b912e

·

verified ·

1 Parent(s): 1abf3cf

Upload folder using huggingface_hub

Files changed (2) hide show

README.md +9 -7
model.safetensors +2 -2

README.md CHANGED Viewed

@@ -1,13 +1,13 @@
 ---
-license: apache-2.0
 datasets:
 - EleutherAI/the_pile_deduplicated
 language:
 - en
 metrics:
 - accuracy
-base_model:
-- BlinkDL/rwkv-7-pile
 pipeline_tag: text-generation
 library_name: transformers
 ---
@@ -38,15 +38,17 @@ This is RWKV-7 model under flash-linear attention format.
 <!-- Provide the basic links for the model. -->
 - **Repository:** https://github.com/fla-org/flash-linear-attention ; https://github.com/BlinkDL/RWKV-LM
-- **Paper:** [RWKV-7 "Goose" with Expressive Dynamic State Evolution](https://arxiv.org/abs/2503.14456)
 ## Uses
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
-Install `flash-linear-attention` <= 0.1.2 and the latest version of `transformers` before using this model:
 ```bash
-pip install --no-use-pep517 flash-linear-attention==0.1.2
 pip install 'transformers>=4.48.0'
 ```
@@ -81,4 +83,4 @@ This model is trained on the Pile with a total of 332 billion tokens.
 ## FAQ
 Q: safetensors metadata is none.
-A: upgrade transformers to >=4.48.0: `pip install 'transformers>=4.48.0'`

 ---
+base_model:
+- BlinkDL/rwkv-7-pile
 datasets:
 - EleutherAI/the_pile_deduplicated
 language:
 - en
+license: apache-2.0
 metrics:
 - accuracy
 pipeline_tag: text-generation
 library_name: transformers
 ---
 <!-- Provide the basic links for the model. -->
 - **Repository:** https://github.com/fla-org/flash-linear-attention ; https://github.com/BlinkDL/RWKV-LM
+- **Paper:** [RWKV: Parallelizable RNN with Transformer-level LLM Performance](https://huggingface.co/papers/2503.14456)
+- **Project Page:** [RWKV](https://huggingface.co/RWKV)
 ## Uses
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+Install `flash-linear-attention` and the latest version of `transformers` before using this model:
 ```bash
+pip install git+https://github.com/fla-org/flash-linear-attention
 pip install 'transformers>=4.48.0'
 ```
 ## FAQ
 Q: safetensors metadata is none.
+A: upgrade transformers to >=4.48.0: `pip install 'transformers>=4.48.0'`

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a779f0d1b1814c6f5481328c30f4e3ac314ec227e528336c70c721a252b5ebfc
-size 842232712

 version https://git-lfs.github.com/spec/v1
+oid sha256:7cf7acea0d2becc8b7c5f085b3fefb1e32bd0b3c6dee4e39fd2a3e419c9aee35
+size 842244712