Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
13.5
TFLOPS
3
6
10
BinghengWu
wubingheng
Follow
bighu's profile picture
minpeter's profile picture
yangfa's profile picture
7 followers
·
7 following
https://github.com/wubingheng111
HangWu19938
wubingheng111
AI & ML interests
I like to fine-tune the small models of the Doge series.
Recent Activity
authored
a paper
about 1 month ago
Trainable Dynamic Mask Sparse Attention
upvoted
an
article
about 2 months ago
Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models
published
an
article
about 2 months ago
Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models
View all activity
Organizations
wubingheng
's models
3
Sort: Recently updated
wubingheng/Doge-20M-Medical-SFT
Text Generation
•
0.0B
•
Updated
Apr 16
•
4
wubingheng/Doge-20M-Chinese
Text Generation
•
0.0B
•
Updated
Apr 15
•
20
•
2
wubingheng/Doge-197M-Medical-SFT
Question Answering
•
0.2B
•
Updated
Jan 31
•
6
•
2