Reconstruction Alignment Improves Unified Multimodal Models Paper • 2509.07295 • Published 3 days ago • 36
RecA Collection Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning! • 8 items • Updated 2 days ago • 9
CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation Paper • 2506.10890 • Published Jun 12 • 10
In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer Paper • 2504.20690 • Published Apr 29 • 19
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper • 2503.23461 • Published Mar 30 • 95
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models Paper • 2503.12885 • Published Mar 17 • 44
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper • 2502.17258 • Published Feb 24 • 80
3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering Paper • 2501.05131 • Published Jan 9 • 38
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances Paper • 2410.18775 • Published Oct 24, 2024 • 10