HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration Paper • 2504.03536 • Published Apr 4 • 12
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model Paper • 2411.19108 • Published Nov 28, 2024 • 20
DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation Paper • 2410.13571 • Published Oct 17, 2024 • 1
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond Paper • 2405.03520 • Published May 6, 2024 • 1
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation Paper • 2411.08380 • Published Nov 13, 2024 • 26
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception Paper • 2303.03991 • Published Mar 7, 2023 • 1
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving Paper • 2311.05332 • Published Nov 9, 2023 • 13
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens Paper • 2401.09985 • Published Jan 18, 2024 • 18
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving Paper • 2309.09777 • Published Sep 18, 2023 • 2