Submitted by XingweiT 21 IntrEx: A Dataset for Modeling Engagement in Educational Conversations · 4 authors 1 1
Submitted by shash42 20 The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs · 5 authors 23 1
Submitted by taesiri 18 InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis · 7 authors 1
Submitted by HowieYan 18 X-Part: high fidelity and structure coherent shape decomposition · 11 authors 2
Submitted by prayerdan 13 HANRAG: Heuristic Accurate Noise-resistant Retrieval-Augmented Generation for Multi-hop Question Answering · 10 authors 4
Submitted by zhanjun 9 VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions · 14 authors 14 1
Submitted by mbreuss 7 FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies · 6 authors 1
Submitted by siyanzhao 6 Inpainting-Guided Policy Optimization for Diffusion Large Language Models · 11 authors 1
Submitted by JiahaoChen1 5 LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios · 4 authors 1
Submitted by taesiri 4 Color Me Correctly: Bridging Perceptual Color Spaces and Text Embeddings for Improved Diffusion Generation · 8 authors 1
Submitted by Wyattz23 4 QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading · 5 authors 1
Submitted by taesiri 3 MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools · 6 authors 2
Submitted by MarcHaraoui 3 Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images · 2 authors 3 1
Submitted by chnln 2 DeMeVa at LeWiDi-2025: Modeling Perspectives with In-Context Learning and Label Distribution Learning · 5 authors 1
Submitted by Kairong-Han 2 CAT: Causal Attention Tuning For Injecting Fine-grained Causal Knowledge into Large Language Models · 6 authors 1
Submitted by Geralt-Targaryen 1 CMHG: A Dataset and Benchmark for Headline Generation of Minority Languages in China · 7 authors 1
Submitted by joebaumann 1 Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation · 7 authors 2
Submitted by prnvpwr2612 - Interpretable Physics Reasoning and Performance Taxonomy in Vision-Language Models · 9 authors 1 1
Submitted by Rushi2002 - Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts · 9 authors 1