Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhiwei He's picture
7 9 36

Zhiwei He

zwhe99
Joelzhang's profile picture LiteMind's profile picture TristanDonze's profile picture
·
https://zwhe99.github.io/
  • zwhe99
  • zwhe99

AI & ML interests

Natural Language Processing

Recent Activity

updated a model about 1 month ago
zwhe99/OctoThinker-3B-Long-Base-orz
published a model about 1 month ago
zwhe99/OctoThinker-3B-Long-Base-orz
liked a dataset about 1 month ago
nvidia/OpenCodeReasoning-2
View all activity

Organizations

None yet

authored a paper 7 months ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 62
authored a paper 8 months ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 42
authored a paper 9 months ago

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

Paper • 2411.18462 • Published Nov 27, 2024 • 6
authored a paper almost 2 years ago

Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

Paper • 2203.08394 • Published Mar 16, 2022
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs