BinghengWu's picture

3 6 10

BinghengWu

wubingheng

·

https://github.com/wubingheng111

AI & ML interests

I like to fine-tune the small models of the Doge series.

Recent Activity

authored a paper about 1 month ago

Trainable Dynamic Mask Sparse Attention

upvoted an article about 2 months ago

Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models

published an article about 2 months ago

Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models

View all activity

Organizations

wubingheng 's models 3

wubingheng/Doge-20M-Medical-SFT

Text Generation • 0.0B • Updated Apr 16 • 4

wubingheng/Doge-20M-Chinese

Text Generation • 0.0B • Updated Apr 15 • 20 • 2

wubingheng/Doge-197M-Medical-SFT

Question Answering • 0.2B • Updated Jan 31 • 6 • 2