Submitted by CodeGoat24 124 Unified Reward Model for Multimodal Understanding and Generation · 5 authors 529 3
Submitted by Nicolas-BZRD 81 EuroBERT: Scaling Multilingual Encoders for European Languages · 19 authors 9
Submitted by akhaliq 58 R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model · 6 authors 606 2
Submitted by liuxuan320 48 S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information · 6 authors 16 2
Submitted by jinheon 47 Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching · 3 authors 126 3
Submitted by akhaliq 39 R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning · 3 authors 936 3
Submitted by zhixuan-lin 32 Forgetting Transformer: Softmax Attention with a Forget Gate · 4 authors 124 4
Submitted by akhaliq 27 R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning · 8 authors 2
Submitted by BianYx 24 VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control · 7 authors 473 3
Submitted by wbhu-tc 19 TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models · 4 authors 749 2
Submitted by akhaliq 18 Learning from Failures in Multi-Attempt Reinforcement Learning · 3 authors 19 2
Submitted by akhaliq 15 TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation · 17 authors 2
Submitted by yunfanj 11 BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities · 10 authors 138 2
Submitted by weigao266 8 Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts · 5 authors 118 2
Submitted by EliverQ 8 An Empirical Study on Eliciting and Improving R1-like Reasoning Models · 13 authors 726 3
Submitted by tobiaslee 6 LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding · 11 authors 2
Submitted by hongyanz 5 EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test · 4 authors 2
Submitted by SkiddieAhn 3 AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM · 6 authors 37 2
Submitted by wangkevin02 3 Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles · 6 authors 12 3