Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
jwergieluk 's Collections
Papers inbox

Papers inbox

updated Feb 12
Upvote
-

  • Training Language Models to Self-Correct via Reinforcement Learning

    Paper • 2409.12917 • Published Sep 19, 2024 • 141

  • Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

    Paper • 2409.12191 • Published Sep 18, 2024 • 78

  • Expect the Unexpected: FailSafe Long Context QA for Finance

    Paper • 2502.06329 • Published Feb 10 • 132

  • Competitive Programming with Large Reasoning Models

    Paper • 2502.06807 • Published Feb 3 • 69

  • Retrieval-augmented Large Language Models for Financial Time Series Forecasting

    Paper • 2502.05878 • Published Feb 9 • 42

  • LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

    Paper • 2502.07374 • Published Feb 11 • 41

  • Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

    Paper • 2502.07640 • Published Feb 11 • 10

  • Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

    Paper • 2502.06703 • Published Feb 10 • 154
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs