Uploaded finetuned model
- Developed by: Nitish035
- License: apache-2.0
- Finetuned from model : Nitish035/merged16-sft_qwen3
This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 51
Model tree for Nitish035/merged16_qwen_grpo-4000-2
Base model
Qwen/Qwen3-14B-Base
Finetuned
Qwen/Qwen3-14B
Quantized
unsloth/Qwen3-14B-unsloth-bnb-4bit
Finetuned
Nitish035/merged16-sft_qwen3