--- license: mit language: - en pipeline_tag: robotics ---

villa-X: A Vision-Language-Latent-Action Model

[![arXiv](https://img.shields.io/badge/arXiv-Paper-red?logo=arxiv&logoColor=white)](https://arxiv.org/abs/2507.23682)   [![Project](https://img.shields.io/badge/Project-Page-blue?logo=homepage&logoColor=white)](https://microsoft.github.io/villa-x)   [![Code](https://img.shields.io/badge/GitHub-Code-blue?logo=github&logoColor=white)](https://github.com/microsoft/villa-x/)
## How to use Check out [https://github.com/microsoft/villa-x/](https://github.com/microsoft/villa-x/)