BinghengWu's picture

3 6 10

BinghengWu

wubingheng

·

https://github.com/wubingheng111

AI & ML interests

I like to fine-tune the small models of the Doge series.

Recent Activity

authored a paper about 1 month ago

Trainable Dynamic Mask Sparse Attention

upvoted an article about 1 month ago

Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models

published an article about 1 month ago

Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models

View all activity

Organizations

published an article about 1 month ago

Article

Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models

By

and 2 others •

Aug 5

• 6