Submitted by ZacharyNovack 8 WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning · 7 authors 1
Submitted by taesiri 6 LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation · 10 authors 2
Submitted by lizizun 3 WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool · 10 authors 1
Submitted by taesiri 3 MedVista3D: Vision-Language Modeling for Reducing Diagnostic Errors in 3D CT Disease Detection, Understanding and Reporting · 6 authors 1
Submitted by kevinr 2 On Robustness and Reliability of Benchmark-Based Evaluation of LLMs · 4 authors 1
Submitted by Z-LIRA 1 U-ARM : Ultra low-cost general teleoperation interface for robot manipulation · 7 authors 12 1