Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

Ruisi Cai's picture
1 4

Ruisi Cai

CCCCRS
·

AI & ML interests

None yet

Organizations

DeepMamba's profile picture

authored a paper 7 months ago

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

Paper • 2501.00712 • Published Jan 1 • 6
authored 5 papers 9 months ago

Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?

Paper • 2302.12480 • Published Feb 24, 2023

H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Paper • 2306.14048 • Published Jun 24, 2023 • 12

Robust Mixture-of-Expert Training for Convolutional Neural Networks

Paper • 2308.10110 • Published Aug 19, 2023 • 2

Flextron: Many-in-One Flexible Large Language Model

Paper • 2406.10260 • Published Jun 11, 2024 • 2

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

Paper • 2410.19123 • Published Oct 24, 2024 • 15
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs