4 31 16

Leng Sicong PRO

Sicong

AI & ML interests

None yet

Recent Activity

updated a model about 13 hours ago

Sicong/mmr1_ckpts_0924

published a model about 18 hours ago

Sicong/mmr1_ckpts_0924

updated a model 5 days ago

Sicong/mmr1_mixv7_15k_ep10_0818_wollm_mixture00_rollout8_shuffle

View all activity

Organizations

upvoted a paper 6 days ago

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Paper • 2509.15212 • Published 6 days ago • 20

upvoted a paper 24 days ago

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Paper • 2508.21112 • Published 27 days ago • 75

upvoted an article about 1 month ago

Article

RynnEC: Bringing MLLMs into Embodied World

and 6 others •

Aug 14

• 6

upvoted a paper about 1 month ago

Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Paper • 2508.08896 • Published Aug 12 • 10

upvoted 2 articles about 1 month ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

and 8 others •

Jun 3

• 253

Article

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

and 9 others •

Aug 11

• 27

upvoted a paper about 2 months ago

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Paper • 2507.22607 • Published Jul 30 • 46

upvoted a paper 2 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 131

upvoted 3 papers 4 months ago

Through the Valley: Path to Effective Long CoT Training for Small Language Models

Paper • 2506.07712 • Published Jun 9 • 18

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 113

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

Paper • 2505.23604 • Published May 29 • 23

upvoted a collection 4 months ago

Multimodal Reasoning

Collection

120 items • Updated 3 days ago • 29

upvoted a paper 7 months ago

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published Feb 19 • 28

upvoted 3 papers 8 months ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 51

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 417

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 90

upvoted 2 papers 9 months ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published Dec 31, 2024 • 47

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 106

upvoted a paper 11 months ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published Nov 9, 2024 • 45

upvoted a collection 11 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 688

Leng Sicong PRO

AI & ML interests

Recent Activity

Organizations

Sicong's activity

RynnEC: Bringing MLLMs into Embodied World

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation