view article Article Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation By cbensimon and 3 others • about 20 hours ago • 25
TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling Paper • 2508.16790 • Published 11 days ago • 7
STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models Paper • 2507.15375 • Published Jul 21 • 26
ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow Matching Paper • 2507.09318 • Published Jul 12 • 1
USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning Paper • 2508.18966 • Published 7 days ago • 52
Representing Speech Through Autoregressive Prediction of Cochlear Tokens Paper • 2508.11598 • Published 18 days ago • 17
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 29 days ago • 483
view article Article Vision Language Model Alignment in TRL ⚡️ By sergiopaniego and 4 others • 27 days ago • 75
view article Article Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training By siro1 and 4 others • 26 days ago • 56
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! By dvilasuero and 5 others • 26 days ago • 77
view article Article TextQuests: How Good are LLMs at Text-Based Video Games? By justinphan3110 and 1 other • 22 days ago • 27
MiDashengLM: Efficient Audio Understanding with General Audio Captions Paper • 2508.03983 • Published 28 days ago • 8
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder Paper • 2505.07916 • Published May 12 • 132