Hiroshi Yoshihara
RabotniKuma
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy
with SFT and Efficiency with Reinforcement Learning
liked
a model
about 1 month ago
openai/gpt-oss-20b
upvoted
a
paper
about 2 months ago
Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive
Branching Tree Search