Rationale-aided Efficient 7B size Large Language and Vision Models. Let's enjoy it!
Byung-Kwan Lee
BK-Lee
AI & ML interests
Vision Language Models
Recent Activity
upvoted
a
paper
about 13 hours ago
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
upvoted
a
paper
about 13 hours ago
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn
Tool-Integrated Reasoning
upvoted
a
paper
about 13 hours ago
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey