Submitted by YannQi 83 R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning · 6 authors 37 1
Submitted by delinqu 58 EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control · 15 authors 123 2
Submitted by wanng 46 A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code · 21 authors 133 2
Submitted by lixiaochuan 32 Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation · 14 authors 1
Submitted by CoCoOne 24 A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers · 103 authors 153 1
Submitted by Shunian 15 TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis · 13 authors 70 2
Submitted by taesiri 10 Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models · 8 authors 2
Submitted by taesiri 7 UItron: Foundational GUI Agent with Advanced Perception and Planning · 10 authors 1
Submitted by XiaohuanZhou 7 TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training · 9 authors 1
Submitted by gemcollector 1 HERMES: Human-to-Robot Embodied Learning from Multi-Source Motion Data for Mobile Dexterous Manipulation · 7 authors 1
Submitted by JiaaqiLiu 1 Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery · 15 authors 1
Submitted by nennomp - Deep Residual Echo State Networks: exploring residual orthogonal connections in untrained Recurrent Neural Networks · 3 authors 0 1
Submitted by AllanK24 - Quantization Robustness to Input Degradations for Object Detection · 3 authors 1
Submitted by yhua219 - EduRABSA: An Education Review Dataset for Aspect-based Sentiment Analysis Tasks · 4 authors 3 1