Sequential Modeling Enables Scalable Learning for Large Vision Models Paper • 2312.00785 • Published Dec 1, 2023 • 1
EgoPet: Egomotion and Interaction Data from an Animal's Perspective Paper • 2404.09991 • Published Apr 15, 2024
Forgotten Polygons: Multimodal Large Language Models are Shape-Blind Paper • 2502.15969 • Published Feb 21 • 2