Submitted by richardxp888 84 WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent · 14 authors 6k 4
Submitted by xssstory 34 Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL · 8 authors 111 3
Submitted by Gaojunyao 28 CharacterShot: Controllable and Consistent 4D Character Animation · 8 authors 22 3
Submitted by wwen1997 26 Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models · 9 authors 30 2
Submitted by zstanjj 24 HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches · 7 authors 23 3
Submitted by yanyc 17 Test-Time Reinforcement Learning for GUI Grounding via Region Consistency · 8 authors 19 2
Submitted by wjkang 13 UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation · 7 authors 11 2
Submitted by hammh0a 11 Train Long, Think Short: Curriculum Learning for Efficient Reasoning · 6 authors 2
Submitted by huangsiteng 9 Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors · 13 authors 2
Submitted by Alex-GSL 9 Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy · 7 authors 2
Submitted by michaeltqw108 8 Adversarial Video Promotion Against Text-to-Video Retrieval · 6 authors 4 2
Submitted by Alex-xu 8 ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants · 12 authors 27 2
Submitted by Lakoc 7 DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition · 6 authors 2
Submitted by Junjie-Ye 7 Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments · 10 authors 12 2
Submitted by xxzcc 5 AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators · 16 authors 4
Submitted by Yinpei 4 AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies · 9 authors 10 2
Submitted by ArmelRandy 3 TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation · 3 authors 1 2
Submitted by FrancisRing 3 StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation · 9 authors 147 2
Submitted by sebasmos 2 Bridging Theory and Practice in Quantum Game Theory: Optimized Implementation of the Battle of the Sexes with Error Mitigation on NISQ Hardware · 5 authors 2
Submitted by ariG23498 1 Technical Report: Full-Stack Fine-Tuning for the Q Programming Language · 5 authors 1
Submitted by sofianebouaziz 1 WGAST: Weakly-Supervised Generative Network for Daily 10 m Land Surface Temperature Estimation via Spatio-Temporal Fusion · 4 authors 6 2
Submitted by Qznan 1 GeRe: Towards Efficient Anti-Forgetting in Continual Learning of LLM via General Samples Replay · 7 authors 7 2
Submitted by ElmanGhazaei - Text-conditioned State Space Model For Domain-generalized Change Detection Visual Question Answering · 2 authors 2
Submitted by Hecheng0625 - NVSpeech: An Integrated and Scalable Pipeline for Human-Like Speech Modeling with Paralinguistic Vocalizations · 8 authors 2