π Introduction
Lumina-mGPT 2.0 is a stand-alone, decoder-only autoregressive model, trained from scratch, that unifies a broad spectrum of image generation tasks, including text-to-image generation, image pair generation, subject-driven generation, multi-turn image editing, controllable generation, and dense prediction.
π Usage
We provide the implementation of Lumina-mGPT 2.0, as well as sampling code, visit our GitHub.
π½οΈ Demo Examples
Qualitative Performance

Comparison with Lumina-mGPT and Janus Pro

π Citation
If you find the provided code or models useful for your research, consider citing them as:
@article{xin2025lumina,
title={Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling},
author={Xin, Yi and Yan, Juncheng and Qin, Qi and Li, Zhen and Liu, Dongyang and Li, Shicheng and Huang, Victor Shea-Jay and Zhou, Yupeng and Zhang, Renrui and Zhuo, Le and others},
journal={arXiv preprint arXiv:2507.17801},
year={2025}
}
- Downloads last month
- 3
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support