Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence β’ 2 items β’ Updated 25 days ago β’ 115
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning Paper β’ 2506.24119 β’ Published Jun 30 β’ 46
view article Article π€ππ¬π₯οΈπ Kimi-VL-A3B-Thinking-2506: A Quick Navigation By moonshotai and 1 other β’ Jun 21 β’ 66
VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning? Paper β’ 2505.23359 β’ Published May 29 β’ 40
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking β’ 7 items β’ Updated Jul 1 β’ 73
VideoRoPE: What Makes for Good Video Rotary Position Embedding? Paper β’ 2502.05173 β’ Published Feb 7 β’ 65
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper β’ 2501.12599 β’ Published Jan 22 β’ 123
An Empirical Study of Autoregressive Pre-training from Videos Paper β’ 2501.05453 β’ Published Jan 9 β’ 42
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence Paper β’ 2407.16655 β’ Published Jul 23, 2024 β’ 31