Yuying Ge's picture

Yuying Ge

tttoaster

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

updated a model 27 days ago

TencentARC/ARC-Hunyuan-Video-7B

updated a dataset 28 days ago

TencentARC/ShortVid-Bench

View all activity

Organizations

upvoted a paper 4 days ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published 6 days ago • 18

upvoted a paper 2 months ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

Paper • 2506.16141 • Published Jun 19 • 27

upvoted 3 papers 3 months ago

Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published Jun 5 • 27

AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation

Paper • 2506.03126 • Published Jun 3 • 22

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Paper • 2505.21374 • Published May 27 • 27

upvoted 5 papers 5 months ago

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 110

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published Apr 1 • 71

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31 • 39

GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers

Paper • 2503.19480 • Published Mar 25 • 16

Long-Context Autoregressive Video Modeling with Next-Frame Prediction

Paper • 2503.19325 • Published Mar 25 • 73

upvoted a paper 7 months ago

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published Jan 22 • 72

upvoted a collection 9 months ago

DC-AE

Deep Compression Autoencoder • 18 items • Updated May 30 • 19

upvoted 2 papers 9 months ago

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

Paper • 2412.04432 • Published Dec 5, 2024 • 16

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published Dec 5, 2024 • 23