Zero-Shot Image Classification
Transformers
Safetensors
siglip
vision

The accuracy on the ImageNet dataset is low

#13
by qingshuiL - opened

I used clip_benchmark to evaluate the model weights, and the accuracy on imagNet-1K is only 69.8. Is there anything to note here?

Hi, I'm struggling with the similar issue, and I posted a new post at: https://discuss.huggingface.co/t/siglip-2-models-show-lower-zero-shot-accuracy-than-reported/166735
Did you resolve this issue?

Sign up or log in to comment