
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition
•
6B
•
Updated
•
367k
•
1.48k
Try on clothes virtually by uploading images
Upgraded to v1.0!
Scalable and Versatile 3D Generation from images
Audio Conditioned LipSync with Latent Diffusion Models
View AI model releases for 2024