Post
2532
We just released TRL v0.20 with major multimodal upgrades!
ποΈ VLM support for GRPO (highly requested by the community!)
ποΈ New GSPO trainer (from @Qwen , released last week, VLM-ready)
π New MPO trainer (multimodal by design, as in the paper)
π Full release notes here: https://github.com/huggingface/trl/releases/tag/v0.20.0
ποΈ VLM support for GRPO (highly requested by the community!)
ποΈ New GSPO trainer (from @Qwen , released last week, VLM-ready)
π New MPO trainer (multimodal by design, as in the paper)
π Full release notes here: https://github.com/huggingface/trl/releases/tag/v0.20.0