TLDR: Token-Level Detective Reward Model for Large Vision Language Models Paper • 2410.04734 • Published Oct 7, 2024 • 17
LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning Paper • 2312.03849 • Published Dec 6, 2023 • 7