Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

igormolybog 's Collections
Domain spec fine-tuning
Inference speed
llama + WebWork
evals
Solver training
Datasets
Reasoning
Hetero training
Long context
Open
Agents
LM economy
Scaling laws
compression
robotics
Alignment
Imagen

compression

updated May 1, 2024
Upvote
-

  • BitDelta: Your Fine-Tune May Only Be Worth One Bit

    Paper • 2402.10193 • Published Feb 15, 2024 • 23

  • OneBit: Towards Extremely Low-bit Large Language Models

    Paper • 2402.11295 • Published Feb 17, 2024 • 25

  • BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

    Paper • 2402.04291 • Published Feb 6, 2024 • 51

  • GPTVQ: The Blessing of Dimensionality for LLM Quantization

    Paper • 2402.15319 • Published Feb 23, 2024 • 22

  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 624

  • SnapKV: LLM Knows What You are Looking for Before Generation

    Paper • 2404.14469 • Published Apr 22, 2024 • 28
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs