Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AXERA-TECH 's Collections
Multimodal Models
Qwen2.5
MiniCPM4
Qwen3
DeepSeek-R1-Distill
HuggingFaceTB
Vision Models
Audio Models
Tools
TestData

Multimodal Models

updated 21 days ago
Upvote
-

  • AXERA-TECH/lcm-lora-sdv1-5

    Updated Jun 23 • 5 • 1

  • AXERA-TECH/InternVL3-2B

    Visual Question Answering • Updated 5 days ago • 10 • 2

  • AXERA-TECH/Qwen2.5-VL-3B-Instruct

    Image-Text-to-Text • Updated 4 days ago • 29

  • AXERA-TECH/InternVL3-1B

    Image-Text-to-Text • Updated 14 days ago • 17

  • AXERA-TECH/SmolVLM2-500M-Video-Instruct

    Visual Question Answering • Updated Jul 14 • 12 • 2

  • AXERA-TECH/InternVL2_5-1B-MPO

    Image-Text-to-Text • Updated Aug 8 • 13

  • AXERA-TECH/InternVL2_5-1B

    Image-Text-to-Text • Updated Apr 4 • 4 • 1

  • AXERA-TECH/Janus-Pro-1B

    Visual Question Answering • Updated Apr 14 • 5 • 2

  • AXERA-TECH/SmolVLM-256M-Instruct

    Updated Apr 4 • 6 • 2

  • AXERA-TECH/LivePortrait

    Image-to-Video • Updated Jun 21 • 3 • 4

  • AXERA-TECH/cnclip

    Updated Aug 4 • 7 • 1

  • AXERA-TECH/clip

    Updated Aug 4 • 11

  • AXERA-TECH/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • Updated 18 days ago • 20

  • AXERA-TECH/YOLO-World-V2

    Zero-Shot Object Detection • Updated 5 days ago • 13 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs