tzjz89
tzjz89
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
11 days ago
Group Sequence Policy Optimization
upvoted
a
paper
3 months ago
Understanding R1-Zero-Like Training: A Critical Perspective