Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Matthieu Lin's picture
7 1

Matthieu Lin

Hatm
yifanpu001's profile picture
·
https://linyuhongg.github.io/

AI & ML interests

RLHF

Organizations

None yet

upvoted 2 papers 3 months ago

Magistral

Paper • 2506.10910 • Published Jun 12 • 64

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 180
upvoted 2 papers 4 months ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19 • 82

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 184
upvoted a paper 10 months ago

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Paper • 2411.02359 • Published Nov 4, 2024 • 13
upvoted a paper 11 months ago

LLM-based Optimization of Compound AI Systems: A Survey

Paper • 2410.16392 • Published Oct 21, 2024 • 17
upvoted a paper over 1 year ago

DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints

Paper • 2405.19026 • Published May 29, 2024 • 8
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs