tzjz89's picture

6 4

tzjz89

tzjz89

·

AI & ML interests

NLP

Recent Activity

upvoted a paper 11 days ago

Group Sequence Policy Optimization

upvoted a paper 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 3 months ago

Understanding R1-Zero-Like Training: A Critical Perspective

View all activity

Organizations

models 0

None public yet

datasets 0

None public yet