Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

arxiv: 2310.08659

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

25

Full-text search

Active filters: 2310.08659

LoftQ/Llama-2-7b-hf-4bit-64rank

Text Generation • 4B • Updated May 3, 2024 • 422 • 2

LoftQ/Llama-2-13b-hf-4bit-64rank

Text Generation • 13B • Updated Dec 19, 2023 • 6 • 2

LoftQ/Llama-2-70b-hf-4bit-64rank

Text Generation • 69B • Updated May 3, 2024 • 5 • 1

LoftQ/Mistral-7B-v0.1-4bit-32rank

Text Generation • 7B • Updated Dec 20, 2023 • 5

LoftQ/Mistral-7B-v0.1-4bit-64rank

Text Generation • 4B • Updated Apr 18, 2024 • 6 • 2

LoftQ/Llama-2-7b-hf-fp16-64rank-gsm8k

Updated Dec 20, 2023 • 3

LoftQ/phi-2-4bit-64rank

Text Generation • 3B • Updated Aug 15, 2024 • 3

LoftQ/Meta-Llama-3-8B-4bit-64rank

Text Generation • 5B • Updated May 3, 2024 • 106 • 1

LoftQ/CodeLlama-7b-hf-4bit-64rank

Text Generation • 4B • Updated Apr 20, 2024 • 3

LoftQ/CodeLlama-13b-hf-4bit-64rank

Text Generation • 7B • Updated Apr 20, 2024 • 2

LoftQ/Meta-Llama-3-8B-Instruct-4bit-64rank

Text Generation • 5B • Updated May 3, 2024 • 9 • 1

LoftQ/Meta-Llama-3-70B-4bit-64rank-1iter

Text Generation • 37B • Updated Apr 21, 2024 • 4 • 2

LoftQ/Meta-Llama-3-70B-4bit-64rank

Text Generation • 37B • Updated May 3, 2024 • 4 • 1

LoftQ/Meta-Llama-3-70B-Instruct-4bit-64rank

Text Generation • 37B • Updated May 3, 2024 • 4 • 1

LoftQ/Phi-3-mini-128k-instruct-4bit-64rank

Text Generation • 2B • Updated May 3, 2024 • 4

LoftQ/Phi-3-mini-4k-instruct-4bit-64rank

Text Generation • 2B • Updated May 3, 2024 • 7

anamikac2708/Llama3-8b-LoftQ-finetuned-investopedia-Lora-Adapters

Updated Jun 18, 2024

RichardErkhov/LoftQ_-_Llama-2-13b-hf-4bit-64rank-gguf

13B • Updated Aug 12, 2024 • 319

RichardErkhov/LoftQ_-_Mistral-7B-v0.1-4bit-32rank-gguf

7B • Updated Aug 18, 2024 • 42

RichardErkhov/LoftQ_-_Mistral-7B-v0.1-4bit-32rank-4bits

4B • Updated Oct 18, 2024 • 3

RichardErkhov/LoftQ_-_Mistral-7B-v0.1-4bit-32rank-8bits

7B • Updated Oct 18, 2024 • 3

RichardErkhov/LoftQ_-_Llama-2-13b-hf-4bit-64rank-4bits

7B • Updated Oct 26, 2024 • 2

RichardErkhov/LoftQ_-_phi-2-4bit-64rank-gguf

3B • Updated Oct 30, 2024 • 38

RichardErkhov/LoftQ_-_phi-2-4bit-64rank-4bits

2B • Updated Oct 30, 2024 • 5

RichardErkhov/LoftQ_-_phi-2-4bit-64rank-8bits

3B • Updated Oct 30, 2024 • 3