Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nthngdy 's Collections
Q-Filters

Q-Filters

updated Mar 3

Pre-computed Q-Filters for efficient KV cache compression.

Upvote
7

  • nthngdy/Llama-3.1-8B-Instruct_qfilt

    0.0B • Updated Nov 28, 2024 • 451

  • nthngdy/Llama-3.2-1B-Instruct_qfilt

    0.0B • Updated Nov 28, 2024 • 176

  • nthngdy/Llama-3.2-3B-Instruct_qfilt

    0.0B • Updated Feb 6 • 170

  • nthngdy/Llama-3.2-3B_qfilt

    0.0B • Updated Nov 28, 2024 • 172

  • nthngdy/Llama-3.1-8B_qfilt

    0.0B • Updated Nov 28, 2024 • 171

  • nthngdy/Llama-3.1-70B-Instruct_qfilt

    0.0B • Updated Mar 7 • 172

  • nthngdy/Llama-3.1-70B_qfilt

    0.0B • Updated Feb 6 • 169

  • nthngdy/Meta-Llama-3.1-405B_qfilt

    0.0B • Updated Feb 6 • 172

  • nthngdy/Mistral-Small-24B-Instruct-2501_qfilt

    0.0B • Updated Feb 6 • 169

  • nthngdy/phi-4_qfilt

    0.0B • Updated Feb 6 • 171

  • nthngdy/Llama-3.2-1B_qfilt

    0.0B • Updated Nov 28, 2024 • 352

  • nthngdy/Qwen2.5-7B_qfilt

    0.0B • Updated Feb 6 • 171

  • nthngdy/Qwen2.5-7B-Instruct_qfilt

    0.0B • Updated Feb 6 • 172

  • nthngdy/DeepSeek-R1-Distill-Llama-8B_qfilt

    0.0B • Updated Mar 3 • 215

  • nthngdy/DeepSeek-R1-Distill-Qwen-1.5B_qfilt

    0.0B • Updated Mar 3 • 174
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs