---
license: mit
language:
- en
pipeline_tag: robotics
---
villa-X: A Vision-Language-Latent-Action Model
[](https://arxiv.org/abs/2507.23682) [](https://microsoft.github.io/villa-x) [](https://github.com/microsoft/villa-x/)
## How to use
Check out [https://github.com/microsoft/villa-x/](https://github.com/microsoft/villa-x/)