📚 Introduction

Lumina-mGPT 2.0 is a stand-alone, decoder-only autoregressive model, trained from scratch, that unifies a broad spectrum of image generation tasks, including text-to-image generation, image pair generation, subject-driven generation, multi-turn image editing, controllable generation, and dense prediction.

🚀 Usage

We provide the implementation of Lumina-mGPT 2.0, as well as sampling code, visit our GitHub.

📽️ Demo Examples

Qualitative Performance

Comparison with Lumina-mGPT and Janus Pro

📖 Citation

If you find the provided code or models useful for your research, consider citing them as:

@article{xin2025lumina,
  title={Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling},
  author={Xin, Yi and Yan, Juncheng and Qin, Qi and Li, Zhen and Liu, Dongyang and Li, Shicheng and Huang, Victor Shea-Jay and Zhou, Yupeng and Zhang, Renrui and Zhuo, Le and others},
  journal={arXiv preprint arXiv:2507.17801},
  year={2025}
}