Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

EmbeddedLLM 's Collections
ONNX GenAI
Onnxruntime DirectML GenAI
Onnxruntime CPU GenAI
OpenVINO GenAI
Ipex LLM GenAI
NPU LLM
OpenVINO Model Server (OVMS)

ONNX GenAI

updated Jul 2, 2024

A collection of models that are able to be run using onnxruntime-genai and can be served through embeddedllm library.

Upvote
2

  • EmbeddedLLM/mistral-7b-instruct-v0.3-onnx

    Text Generation • Updated Jun 17, 2024 • 2

  • EmbeddedLLM/Starling-LM-7b-beta-onnx

    Text Generation • Updated Jun 17, 2024

  • EmbeddedLLM/gemma-2b-it-onnx

    Text Generation • Updated Jun 17, 2024 • 1

  • EmbeddedLLM/gemma-7b-it-onnx

    Text Generation • Updated Jun 20, 2024 • 1

  • EmbeddedLLM/llama-2-7b-chat-int4-onnx-directml

    Text Generation • Updated Jun 19, 2024 • 11

  • EmbeddedLLM/llama-2-13b-chat-int4-onnx-directml

    Text Generation • Updated Jun 17, 2024 • 5

  • luweigen/Llama-3-8B-Instruct-int4-onnx-directml

    Text Generation • Updated Jun 15, 2024 • 4

  • EmbeddedLLM/Phind-CodeLlama-34B-v2-onnx

    Updated Jun 19, 2024

  • EmbeddedLLM/openchat-3.6-8b-20240522-onnx

    Text Generation • Updated Jun 17, 2024 • 1

  • EmbeddedLLM/01-ai_Yi-1.5-6B-Chat-onnx

    Text Generation • Updated Jun 20, 2024

  • EmbeddedLLM/01-ai_Yi-1.5-34B-Chat-16K-onnx

    Updated Jun 19, 2024 • 1

  • EmbeddedLLM/Phi-3-mini-4k-instruct-062024-onnx

    Text Generation • Updated Jul 5, 2024
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs