papers - a slothCreepTree Collection

slothCreepTree 's Collections

papers

papers

updated May 15

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 42
SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM

Paper • 2312.03788 • Published Dec 6, 2023 • 1
FlatQuant: Flatness Matters for LLM Quantization

Paper • 2410.09426 • Published Oct 12, 2024 • 16
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

Paper • 2501.01005 • Published Jan 2 • 1