Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models Paper • 2506.04220 • Published Jun 4 • 4
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Paper • 2410.10818 • Published Oct 14, 2024 • 17
NNsight and NDIF: Democratizing Access to Foundation Model Internals Paper • 2407.14561 • Published Jul 18, 2024 • 36