Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
XiaomiMiMo
/
MiMo-Audio-7B-Instruct
like
101
Follow
Xiaomi MiMo
866
Any-to-Any
Safetensors
qwen2
Audio-to-Text
Text-to-Audio
Audio-to-Audio
Text-to-Text
Audio-Text-to-Text
License:
mit
Model card
Files
Files and versions
xet
Community
7
No vision?
#4
by
yukiarimo
- opened
5 days ago
Discussion
yukiarimo
5 days ago
When the vision is coming?
See translation
yukiarimo
5 days ago
Like VLM + Audio
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment