Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
slothCreepTree 's Collections
papers

papers

updated May 15
Upvote
-

  • Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

    Paper • 2412.17739 • Published Dec 23, 2024 • 42

  • SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM

    Paper • 2312.03788 • Published Dec 6, 2023 • 1

  • FlatQuant: Flatness Matters for LLM Quantization

    Paper • 2410.09426 • Published Oct 12, 2024 • 16

  • FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

    Paper • 2501.01005 • Published Jan 2 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs