17 34 167

Yongliang Shen

tricktreat

tricktreat

AI & ML interests

None yet

Recent Activity

liked a model 18 days ago

Qwen/Qwen3-30B-A3B

liked a model 18 days ago

Qwen/Qwen3-32B

authored a paper 21 days ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

View all activity

Organizations

authored a paper 21 days ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published 22 days ago • 36

authored 3 papers 23 days ago

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published 27 days ago • 17

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Paper • 2508.05614 • Published 27 days ago • 19

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published 27 days ago • 21

authored 6 papers about 1 month ago

MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task

Paper • 2502.11684 • Published Feb 17 • 2

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published Jul 21 • 34

Hierarchical Budget Policy Optimization for Adaptive Reasoning

Paper • 2507.15844 • Published Jul 21 • 16

authored 5 papers 3 months ago

TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence

Paper • 2505.24500 • Published May 30 • 12

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models

Paper • 2505.21500 • Published May 27 • 13

Let LLMs Break Free from Overthinking via Self-Braking Tuning

Paper • 2505.14604 • Published May 20 • 23

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning

Paper • 2505.14684 • Published May 20 • 24

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Paper • 2505.15801 • Published May 21 • 17

authored 5 papers 4 months ago

DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL

Paper • 2503.04959 • Published Mar 6

AskToAct: Enhancing LLMs Tool Use via Self-Correcting Clarification

Paper • 2503.01940 • Published Mar 3

A Survey on (M)LLM-Based GUI Agents

Paper • 2504.13865 • Published Mar 27 • 4

InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

Paper • 2503.06692 • Published Mar 9 • 2

Think Twice, Click Once: Enhancing GUI Grounding via Fast and Slow Systems

Paper • 2503.06470 • Published Mar 9 • 2

Yongliang Shen

AI & ML interests

Recent Activity

Organizations

tricktreat's activity