Xizhou Zhu
Einsiedler
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal
Large Language Models
authored
a paper
5 months ago
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal
Large Language Models
authored
a paper
5 months ago
Dita: Scaling Diffusion Transformer for Generalist
Vision-Language-Action Policy