Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

Edit Models filters

Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Inference Providers
Novita
Nebius AI
Cerebras
Fireworks
Featherless AI
Together AI
Groq
fal
Hyperbolic
Replicate
SambaNova
Nscale
Cohere
HF Inference API
Misc
arxiv: 2501.13921
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

9
Full-text search
Active filters: 2501.13921

MediaTek-Research/Llama-Breeze2-8B-Instruct

8B • Updated Mar 2 • 1.45k • 46

MediaTek-Research/Llama-Breeze2-3B-Instruct

4B • Updated Mar 2 • 1.46k • 28

MediaTek-Research/BreezyVoice

Updated Feb 18 • 47

Qwe1325/Llama-Breeze2-8B-Instruct_4bit

5B • Updated Feb 26 • 4

Qwe1325/Llama-Breeze2-3B-Instruct_4bit

2B • Updated Feb 27 • 11

Qwe1325/Llama-Breeze2-8B-Instruct_8bit

8B • Updated Feb 28 • 4

Qwe1325/Llama-Breeze2-3B-Instruct_8bit

4B • Updated Feb 28 • 3

twinkle-ai/Llama-3.2-3B-F1-Instruct

Text Generation • 4B • Updated Apr 30 • 345 • 16

ThanatosDi/Llama-Breeze2-8B-Instruct

Updated May 6 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs