wangrui's picture

wangrui

varuy322

·

varuy322

AI & ML interests

None yet

Recent Activity

liked a model about 3 hours ago

Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

upvoted a paper 9 days ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

liked a dataset 10 days ago

jupyter-agent/jupyter-agent-dataset

View all activity

Organizations

None yet

liked a model about 3 hours ago

Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Text Generation • 31B • Updated 1 day ago • 1.03k • 304

upvoted a paper 9 days ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published 21 days ago • 88

liked a dataset 10 days ago

jupyter-agent/jupyter-agent-dataset

Viewer • Updated 8 days ago • 95.8k • 5.19k • 138

liked a model 10 days ago

google/embeddinggemma-300m

Sentence Similarity • 0.3B • Updated 8 days ago • 204k • • 827

upvoted an article 12 days ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

By

and 6 others •

May 21

• 214

upvoted a paper 12 days ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published 16 days ago • 116

upvoted a collection 13 days ago

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated 20 days ago • 92

liked a dataset 13 days ago

HuggingFaceM4/FineVision

Viewer • Updated 14 days ago • 24.2M • 231k • 329

upvoted a paper 17 days ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 185

upvoted a collection 24 days ago

Seed-X

A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 8 items • Updated 27 days ago • 65

liked a model 27 days ago

internlm/Intern-S1-mini

Image-Text-to-Text • 9B • Updated 24 days ago • 8.42k • 96

upvoted a paper 27 days ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published 28 days ago • 246

upvoted a collection 27 days ago

Intern-S1

7 items • Updated 27 days ago • 23

liked 2 datasets about 1 month ago

nvidia/Nemotron-CC-v2

Viewer • Updated about 20 hours ago • 5.81B • 96.3k • 75

tokyotech-llm/swallow-code

Viewer • Updated Jul 4 • 129M • 1.74k • 54

liked 2 models about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Feb 24 • 603k • • 1.34k

HuggingFaceTB/SmolLM2-135M-Instruct

Text Generation • 0.1B • Updated Apr 21 • 381k • 242

liked 2 datasets about 1 month ago

Jackrong/GPT-OSS-20B-Distilled-Reasoning-Mini

Viewer • Updated Aug 11 • 1.96k • 521 • 16

LLM360/guru-RL-92k

Viewer • Updated 29 days ago • 91.9k • 1.67k • 27

upvoted a collection about 1 month ago

agent

195 items • Updated about 9 hours ago • 12