Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated
a dataset
13 days ago
mehuldamani/synthtool-v1-modified
published
a dataset
13 days ago
mehuldamani/synthtool-v1-modified
updated
a dataset
21 days ago
mehuldamani/grok-4-trial
Organizations
None yet