Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
1809
277
576
merve
PRO
merve
Follow
iswarup's profile picture
jqop's profile picture
kramp's profile picture
8596 followers
·
369 following
https://github.com/merveenoyan/smol-vision
mervenoyann
merveenoyan
merve.bsky.social
AI & ML interests
I love this website VLMs, vision & co
Recent Activity
new
activity
1 day ago
openbmb/MiniCPM-V-4:
License?
posted
an
update
1 day ago
we're all sleeping on this OCR model https://huggingface.co/rednote-hilab/dots.ocr 🔥 dots.ocr is a new 3B model with sota performance, support for 100 languages & allowing commercial use! 🤯 single e2e model to extract image, convert tables, formula, and more into markdown 📝 try it https://huggingface.co/spaces/MohamedRashad/Dots-OCR
liked
a dataset
1 day ago
Qwen/PolyMath
View all activity
Organizations
merve
's models
98
Sort: Recently updated
merve/Qwen2.5-VL-3B-Instruct-trl-mpo-rlaif-v
Updated
14 days ago
merve/smol-vision
Image-Text-to-Text
•
Updated
15 days ago
•
93
merve/Qwen2.5-VL-7B-Instruct-trl-mpo-rlaif-v
Updated
15 days ago
merve/gemma-3n-finevideo
Updated
21 days ago
•
7
merve/vjepa2-vitl-fpc16-256-ssv2-ucf101
Video Classification
•
0.4B
•
Updated
Jun 13
•
79
merve/test
Updated
May 16
merve/SmolVLM2-500M-Video-Instruct-video-feedback
Image-Text-to-Text
•
0.5B
•
Updated
Feb 20
•
5
merve/SmolVLM2-500M-Video-Instruct-videofeedback
Image-Text-to-Text
•
0.5B
•
Updated
Feb 20
•
4
merve/SmolVLM2-500M-Video-Instruct-emotions
Image-Text-to-Text
•
0.5B
•
Updated
Feb 20
•
5
merve/colpali_ufo
Updated
Dec 20, 2024
•
7
merve/paligemma_vqav2
Image-to-Text
•
3B
•
Updated
Dec 18, 2024
•
45
•
13
merve/paligemma2-3b-vqav2
Updated
Dec 5, 2024
•
40
•
6
merve/google-ckpts
Updated
Oct 22, 2024
merve/google-tokenizers
Updated
Oct 22, 2024
merve/idefics3-llama-vqav2
Updated
Sep 11, 2024
merve/idefics3llama-vqav2
Updated
Sep 11, 2024
•
8
merve/flux-dreambooth-lora
Updated
Aug 16, 2024
•
1
merve/trained-flux-lora-lego
Text-to-Image
•
Updated
Aug 16, 2024
•
11
•
•
1
merve/flux-lego-lora-dreambooth
Text-to-Image
•
Updated
Aug 16, 2024
•
39
•
•
13
merve/sam2-hiera-large
Mask Generation
•
Updated
Aug 2, 2024
•
1.12k
•
2
merve/sam2-hiera-base-plus
Mask Generation
•
Updated
Aug 2, 2024
•
48
merve/sam2-hiera-small
Mask Generation
•
Updated
Aug 2, 2024
•
57
•
1
merve/sam2-hiera-tiny
Mask Generation
•
Updated
Aug 2, 2024
•
29
merve/vq-vae
Updated
Jul 18, 2024
•
18
•
2
merve/MobileNetV2-pascalvoc
Updated
Jul 15, 2024
merve/pg-vqav2
Updated
May 22, 2024
merve/VeCap-DFN-h14
Zero-Shot Image Classification
•
1.0B
•
Updated
Mar 26, 2024
•
3
merve/VeCap-DFN-l14
0.4B
•
Updated
Mar 26, 2024
merve/VeCap-DFN-b16
Zero-Shot Image Classification
•
0.1B
•
Updated
Mar 26, 2024
•
3
merve/VeCLIP-b16-100m
Zero-Shot Image Classification
•
0.1B
•
Updated
Mar 26, 2024
•
3
Previous
1
2
3
4
Next