Collections
Discover the best community collections!
Collections including paper arxiv:2302.05543
-
coqui/XTTS-v2
Text-to-Speech • Updated • 5.05M • 3.04k -
deepseek-ai/DeepSeek-V3-0324
Text Generation • 685B • Updated • 230k • • 3.06k -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.52M • • 4.93k -
Distilling an End-to-End Voice Assistant Without Instruction Training Data
Paper • 2410.02678 • Published • 23
-
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 57 -
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Paper • 2308.06721 • Published • 33 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 14
-
coqui/XTTS-v2
Text-to-Speech • Updated • 5.05M • 3.04k -
deepseek-ai/DeepSeek-V3-0324
Text Generation • 685B • Updated • 230k • • 3.06k -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.52M • • 4.93k -
Distilling an End-to-End Voice Assistant Without Instruction Training Data
Paper • 2410.02678 • Published • 23
-
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 57 -
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Paper • 2308.06721 • Published • 33 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 14