Show-o-512x512-RecA (Paper Coming Soon)

A self-supervised training framework that aligns understanding and generation in modest compute, with huge zero-shot gain on generation and editing capability.

This repository hosts the model weights for Show-o-512x512-RecA. For installation, usage instructions, and further documentation, please visit Show-o's original GitHub repository.

🧠 Method

Coming soon! Stay tuned~

πŸ“Š Benchmarks

Model GenEval ↑ DPGBench ↑ WISE ↑
Show-o-512x512 0.67 82.21 0.40
Show-o-512x512-RecA 0.72 84.94 0.40

License

Show-o-512x512-RecA is licensed under the Apache 2.0 license.

✍️ Citation

If you find our work inspiring or use our codebase in your research, please consider giving a star ⭐ and a citation~

@misc{xie2025reconstructionalignmentimprovesunified, title={Reconstruction Alignment Improves Unified Multimodal Models}, author={Ji Xie and Trevor Darrell and Luke Zettlemoyer and XuDong Wang}, year={2025}, eprint={2509.07295}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2509.07295}, }

Downloads last month
22
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for sanaka87/Show-o-512x512-RecA

Finetuned
(1)
this model

Dataset used to train sanaka87/Show-o-512x512-RecA

Collection including sanaka87/Show-o-512x512-RecA