Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhen Yang's picture
2 3 2

Zhen Yang

andyyang
SteveSHEN's profile picture ishaqsaviani's profile picture 21world's profile picture
·

AI & ML interests

None yet

Organizations

Tencent's profile picture

upvoted a paper 5 months ago

TransMamba: Flexibly Switching between Transformer and Mamba

Paper • 2503.24067 • Published Mar 31 • 21
upvoted a paper 6 months ago

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published Feb 21 • 15
upvoted a paper 8 months ago

Lossless KV Cache Compression to 2%

Paper • 2410.15252 • Published Oct 20, 2024 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs