Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper โข 2505.24726 โข Published May 30 โข 270
Sadeed: Advancing Arabic Diacritization Through Small Language Model Paper โข 2504.21635 โข Published Apr 30 โข 59
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper โข 2503.09573 โข Published Mar 12 โข 73
Agent models: Internalizing Chain-of-Action Generation into Reasoning models Paper โข 2503.06580 โข Published Mar 9 โข 19
Adding Conditional Control to Text-to-Image Diffusion Models Paper โข 2302.05543 โข Published Feb 10, 2023 โข 55