Reverse-Engineered Reasoning for Open-Ended Generation Paper • 2509.06160 • Published 6 days ago • 138
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published 4 days ago • 87
Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation Paper • 2509.02040 • Published 11 days ago • 14
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model Paper • 2509.00676 • Published 13 days ago • 78
Can MLLMs Understand the Deep Implication Behind Chinese Images? Paper • 2410.13854 • Published Oct 17, 2024 • 12