Devstral-Small-2507-Rebased-Vision

This model was created by taking Mistral-Small-3.2-24B-Instruct-2506 and replacing the weights under the language_model with the weights from Devstral-Small-2507. The result is Devstral with vision capabilities, but you should expect a small quality degradation.

Notes: I used unsloth's uploads of these models for convenience, since they include some extra files and configs too. I didn't name this "-Vision" because it was not trained or finetuned after weight rebase, and in case a future version by mistralai has vision.

The code will be released soon.

Evaluation

Evaluation was performed on 7 benchmarks using lm_eval and sglang. Scripts and other details will also be released with the code. This is not a comprehensive evaluation, and it's not directly comparable to the official benchmark numbers from Mistral, the goal was to approximate quality degradation. Make sure to test on your own downstream tasks!

Tasks Metric Devstral-Small-2507 Devstral-Small-2507-rebased Relative Loss (%) Relative Stderr (%)
arc_challenge_chat exact_match 0.9292 0.9283 0.10% ยฑ0.81%
eq_bench eqbench 72.3376 73.7481 -1.95% ยฑ3.52%
gsm8k exact_match 0.8643 0.862 0.27% ยฑ1.09%
gsm8k exact_match 0.8605 0.8567 0.44% ยฑ1.10%
ifeval inst_level_loose_acc 0.6631 0.6595 0.54% N/A
ifeval inst_level_strict_acc 0.6067 0.6019 0.79% N/A
ifeval prompt_level_loose_acc 0.5619 0.5545 1.32% ยฑ3.81%
ifeval prompt_level_strict_acc 0.4917 0.4861 1.14% ยฑ4.37%
mbpp pass_at_1 0.118 0.112 5.08% ยฑ12.20%
mmlu_pro exact_match 0.5786 0.579 -0.07% ยฑ0.76%
triviaqa exact_match 0.7075 0.7068 0.10% ยฑ0.48%
Downloads last month
8
Safetensors
Model size
24B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for kmouratidis/Devstral-Small-2507-Rebased-Vision