Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper • 2505.02567 • Published May 5 • 79
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published Apr 29 • 44
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper • 2504.20438 • Published Apr 29 • 44
stable-diffusion-v1-5/stable-diffusion-inpainting Text-to-Image • Updated Sep 6, 2024 • 431k • 74
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Paper • 2502.13144 • Published Feb 18 • 39
diffusers/stable-diffusion-xl-1.0-inpainting-0.1 Text-to-Image • Updated Sep 3, 2023 • 1.2M • 342
stabilityai/stable-diffusion-xl-refiner-1.0 Image-to-Image • Updated Sep 25, 2023 • 561k • 1.96k