Submitted by chengshuaizhao 141 Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens · 8 authors 24 4
Submitted by xavier-hu 51 Efficient Agents: Building Effective Agents While Reducing Cost · 14 authors 125 2
Submitted by Zery 37 SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience · 8 authors 55 2
Submitted by sbkarasik 30 Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning · 12 authors 4
Submitted by kefirski 28 Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success · 5 authors 0 1
Submitted by daixufang 21 Agent Lightning: Train ANY AI Agents with Reinforcement Learning · 8 authors 173 2
Submitted by P-YI 18 CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction · 7 authors 1
Submitted by Gnonymous 15 Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents · 15 authors 5 2
Submitted by starmage520 12 LaTCoder: Converting Webpage Design to Code with Layout-as-Thought · 13 authors 1
Submitted by BwZhang 12 Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis · 7 authors 56 2
Submitted by zhangyik21 9 LeanK: Learnable K Cache Channel Pruning for Efficient Decoding · 7 authors 2
Submitted by Shuliang 8 DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework · 10 authors 1
Submitted by Mor-Li 6 Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management · 5 authors 2
Submitted by nuojohnchen 6 Position: The Current AI Conference Model is Unsustainable! Diagnosing the Crisis of Centralized AI Conference · 6 authors 2
Submitted by YerbaPage 4 EVOC2RUST: A Skeleton-guided Framework for Project-Level C-to-Rust Translation · 8 authors 1
Submitted by tnlin 4 RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization · 14 authors 1
Submitted by nicopi 4 Reasoning Language Models for Root Cause Analysis in 5G Wireless Networks · 7 authors 2
Submitted by songw-zju 3 A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding · 4 authors 4 2
Submitted by tianyilt 2 IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards · 9 authors 1
Submitted by xavier-hu 2 HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization · 7 authors 14 2
Submitted by jimbozhang 2 MiDashengLM: Efficient Audio Understanding with General Audio Captions · 10 authors 260 2
Submitted by SunZhigang7 2 DiffSemanticFusion: Semantic Raster BEV Fusion for Autonomous Driving via Online HD Map Diffusion · 16 authors 15 3
Submitted by MaziyarPanahi 2 OpenMed NER: Open-Source, Domain-Adapted State-of-the-Art Transformers for Biomedical NER Across 12 Public Datasets · 1 authors 2
Submitted by Moon-bow 2 DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior · 11 authors 98 3
Submitted by alokabhishek 1 Data and AI governance: Promoting equity, ethics, and fairness in large language models · 3 authors 1
Submitted by dorienh 1 SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering · 3 authors 14 2
Submitted by wenliang1990 1 Light-IF: Endowing LLMs with Generalizable Reasoning via Preview and Self-Checking for Complex Instruction Following · 5 authors 2
Submitted by HanzheL 1 C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor · 6 authors 2 2
Submitted by tianyilt 1 Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation · 9 authors 2
Submitted by sergiopicascia 1 The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models · 3 authors 2 2
Submitted by wyt2000 - StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion · 11 authors 2
Submitted by MahtabBg - MedBLINK: Probing Basic Perception in Multimodal Language Models for Medicine · 8 authors 2
Submitted by mingdachenmeta - FACTORY: A Challenging Human-Verified Prompt Set for Long-Form Factuality · 6 authors 1