Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
adnaan525 's Collections
To read

To read

updated Jun 24
Upvote
-

  • Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

    Paper • 2503.05179 • Published Mar 7 • 47

  • SafeArena: Evaluating the Safety of Autonomous Web Agents

    Paper • 2503.04957 • Published Mar 6 • 21

  • Learning from Failures in Multi-Attempt Reinforcement Learning

    Paper • 2503.04808 • Published Mar 4 • 18

  • START: Self-taught Reasoner with Tools

    Paper • 2503.04625 • Published Mar 6 • 114

  • LLM as a Broken Telephone: Iterative Generation Distorts Information

    Paper • 2502.20258 • Published Feb 27 • 27

  • How to Steer LLM Latents for Hallucination Detection?

    Paper • 2503.01917 • Published Mar 1 • 11

  • Identifying Sensitive Weights via Post-quantization Integral

    Paper • 2503.01901 • Published Feb 28 • 8

  • Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

    Paper • 2506.14245 • Published Jun 17 • 42
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs