Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
zhixuan-lin
/
delta_net-760m-longcrawl64-48b
like
0
Text Generation
Transformers
Safetensors
delta_net-project_fox
long-context
forgetting-attention
deltanet
arxiv:
2503.02130
arxiv:
2406.06484
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
delta_net-760m-longcrawl64-48b
Ctrl+K
Ctrl+K
2 contributors
History:
6 commits
zhixuan-lin
nielsr
HF Staff
Add pipeline tag, tags and license to metadata (
#1
)
6cd9f55
verified
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
5 months ago
README.md
5.06 kB
Add pipeline tag, tags and license to metadata (#1)
4 months ago
config.json
Safe
891 Bytes
Upload DeltaNetForCausalLM
5 months ago
generation_config.json
Safe
69 Bytes
Upload DeltaNetForCausalLM
5 months ago
merges.txt
Safe
456 kB
Upload tokenizer
5 months ago
model.safetensors
Safe
3.34 GB
LFS
Upload DeltaNetForCausalLM
5 months ago
special_tokens_map.json
Safe
438 Bytes
Upload tokenizer
5 months ago
tokenizer_config.json
Safe
519 Bytes
Upload tokenizer
5 months ago
vocab.json
Safe
999 kB
Upload tokenizer
5 months ago