Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
zhixuan-lin
/
delta_net-760m-longcrawl64-48b
like
0
Text Generation
Transformers
Safetensors
delta_net-project_fox
long-context
forgetting-attention
deltanet
arxiv:
2503.02130
arxiv:
2406.06484
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
delta_net-760m-longcrawl64-48b
/
merges.txt
zhixuan-lin
Upload tokenizer
b705461
verified
5 months ago
raw
Copy download link
history
contribute
delete
Safe
456 kB
File too large to display, you can
check the raw version
instead.