Rin
hu5enpai
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
9 days ago
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
commented on
a paper
10 days ago
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised
Fine-Tuning and Reinforcement Learning via Dynamic Weighting
commented on
a paper
about 1 month ago
On the Generalization of SFT: A Reinforcement Learning Perspective with
Reward Rectification