
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
β’
0.7B
β’
Updated
β’
24.5k
β’
1.52k
Generate MIDI music from prompts
Segment and track objects in a video
Demo for multimodal understanding and generation