The official datasets and model checkpoints of ARPO
KABI
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
collection
1 day ago
RL+reason model
upvoted
a
paper
4 days ago
RecGPT Technical Report
upvoted
a
paper
5 days ago
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving