Changyu Chen's picture

Changyu Chen PRO

Cameron-Chen

·

AI & ML interests

Generative Models, LLMs, Reinforcement Learning.

Recent Activity

updated a dataset 25 days ago

axon-rl/search-eval

published a dataset 25 days ago

axon-rl/search-eval

updated a dataset about 1 month ago

axon-rl/HotpotQA

View all activity

Organizations

upvoted 3 papers 2 months ago

Fostering Video Reasoning via Next-Event Prediction

Paper • 2505.22457 • Published May 28 • 29

Reinforcing General Reasoning without Verifiers

Paper • 2505.21493 • Published May 27 • 26

Lifelong Safety Alignment for Language Models

Paper • 2505.20259 • Published May 26 • 24

upvoted a paper 3 months ago

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19 • 36

upvoted 2 papers 4 months ago

Efficient Process Reward Model Training via Active Learning

Paper • 2504.10559 • Published Apr 14 • 13

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 57

upvoted a collection 8 months ago

🔱 Sailor2 Language Models

Sailing in South-East Asia with Inclusive Multilingual LLMs • 34 items • Updated Jun 4 • 28

upvoted a paper 9 months ago

Sample-Efficient Alignment for LLMs

Paper • 2411.01493 • Published Nov 3, 2024 • 12

upvoted 2 collections about 1 year ago

Gemma 2 Release

15 items • Updated 28 days ago • 222

💡 DICE

Self-alignment with DPO Implicit Rewards • 5 items • Updated Jul 28, 2024 • 9

upvoted 2 papers about 1 year ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 41

Bootstrapping Language Models with DPO Implicit Rewards

Paper • 2406.09760 • Published Jun 14, 2024 • 41