nvidia/Nemotron-Research-Reasoning-Qwen-1.5B Text Generation • 2B • Updated 23 days ago • 7.59k • 215
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published 14 days ago • 242
Running on CPU Upgrade 270 270 GPT-OSS-120B on AMD MI300X 💻 gpt-oss-120b + web browsing + reasoning on AMD MI300X GPUs
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 298