1 41 258

Xi Yang

ianyeung

IanYeung

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

tencent/Hunyuan-MT-7B

liked a model 4 days ago

laion/CLIP-convnext_xxlarge-laion2B-s34B-b82K-augreg-soup

liked a model 6 days ago

tencent/HunyuanWorld-Voyager

View all activity

Organizations

None yet

upvoted a paper 12 days ago

CineScale: Free Lunch in High-Resolution Cinematic Visual Generation

Paper • 2508.15774 • Published 18 days ago • 20

upvoted a paper 13 days ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published 14 days ago • 182

upvoted 2 papers 20 days ago

Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models

Paper • 2508.12945 • Published 21 days ago • 12

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Paper • 2508.13009 • Published 21 days ago • 24

upvoted 2 papers 21 days ago

DINOv3

Paper • 2508.10104 • Published 26 days ago • 241

Thyme: Think Beyond Images

Paper • 2508.11630 • Published 24 days ago • 79

upvoted 2 papers 24 days ago

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published Aug 6 • 58

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published 25 days ago • 141

upvoted a paper 26 days ago

Matrix-3D: Omnidirectional Explorable 3D World Generation

Paper • 2508.08086 • Published 28 days ago • 70

upvoted a paper 27 days ago

Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control

Paper • 2508.08134 • Published 28 days ago • 9

upvoted a paper 28 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published about 1 month ago • 175

upvoted a paper about 1 month ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 242

upvoted a collection about 1 month ago

Hunyuan Dense Model

Collection

19 items • Updated Aug 4 • 11

upvoted a paper 2 months ago

StreamDiT: Real-Time Streaming Text-to-Video Generation

Paper • 2507.03745 • Published Jul 4 • 29

upvoted a collection 2 months ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Jul 1 • 76

upvoted an article 2 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 128

upvoted a paper 3 months ago

JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Paper • 2506.17612 • Published Jun 21 • 63

upvoted an article 3 months ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

and 4 others •

Jun 19

• 86

upvoted 2 papers 3 months ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18 • 65

Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation

Paper • 2506.09350 • Published Jun 11 • 48