Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

Full-text search

Active filters: text-generation-inference

deepseek-ai/DeepSeek-V3.1-Terminus

Text Generation • 685B • Updated 2 days ago • 2.46k • • 219

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 7.16M • • 4.66k

Hcompany/Holo1.5-7B

Image-Text-to-Text • 8B • Updated about 8 hours ago • 1.28k • 81

google/gemma-3-270m

Text Generation • 0.3B • Updated Aug 14 • 158k • 838

driaforall/mem-agent

Text Generation • 4B • Updated 7 days ago • 1.18k • 71

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 499k • • 12.7k

Qwen/Qwen3Guard-Gen-8B

Text Generation • 8B • Updated about 14 hours ago • 153 • 29

Qwen/Qwen3-4B-Instruct-2507

Text Generation • 4B • Updated 8 days ago • 1.53M • • 328

deepseek-ai/DeepSeek-V3.1

Text Generation • 685B • Updated 19 days ago • 262k • • 763

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated 7 days ago • 2.77k • 28

Qwen/Qwen3Guard-Gen-0.6B

Text Generation • 0.8B • Updated about 14 hours ago • 293 • 24

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 4.38M • • 1.26k

Qwen/Qwen3-8B

Text Generation • 8B • Updated Jul 26 • 2.09M • • 624

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated Jun 20 • 3.44M • • 614

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26 • 6.15M • • 641

LLM360/K2-Think

Text Generation • 33B • Updated 11 days ago • 17.3k • 302

Qwen/Qwen3Guard-Stream-8B

Feature Extraction • 8B • Updated about 9 hours ago • 26 • 16

meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.74M • • 1.73k

meta-llama/Llama-3.1-8B

Text Generation • 8B • Updated Oct 16, 2024 • 941k • • 1.82k

Qwen/Qwen3Guard-Gen-4B

Text Generation • 4B • Updated about 14 hours ago • 1.32k • 14

google/gemma-3-270m-it

Text Generation • 0.3B • Updated Aug 14 • 169k • 410

meta-llama/Llama-2-7b-chat-hf

Text Generation • 7B • Updated Apr 17, 2024 • 1.12M • 4.59k

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • 8B • Updated Jun 18 • 1.05M • • 4.19k

meta-llama/Llama-3.2-1B-Instruct

Text Generation • 1B • Updated Oct 24, 2024 • 7.48M • • 1.08k

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 966k • • 1.62k

OpenGVLab/ScaleCUA-32B

Image-Text-to-Text • 33B • Updated 7 days ago • 169 • 15

meta-llama/Llama-2-7b-hf

Text Generation • 7B • Updated Apr 17, 2024 • 2.14M • 2.15k

Qwen/Qwen3-Embedding-8B

Feature Extraction • 8B • Updated Jul 7 • 492k • • 373

Qwen/Qwen3Guard-Stream-4B

Feature Extraction • 4B • Updated about 9 hours ago • 65 • 11

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • 7B • Updated Jul 24 • 860k • • 2.96k