469
MeloTTS
🗣
Fast, efficient, & multilingual text-to-speech
Helpful tools for day-to-day uses...
Fast, efficient, & multilingual text-to-speech
Analyze images to generate descriptive prompts
Note Score: 9/10 - Very useful ↳ Analyzes Image into Prompts and keywords (Deep CLIP into styles, medium, artist, etc.)
Transcribe audio to text
Note Score: 9/10 - Very useful ↳ Doesn't use Whisper-Large-V3 - onnx version is loaded instead for lower delay
Create a 1M faces 3D colored model from an image!
Convert GUI screen to structured elements
Try on clothes virtually by uploading images
The most opinionated, anime-themed SDXL model
Segment objects in images using points
Download and process LoRA models from CivitAI
mcp_server & FLUX 4-bit Quantization + Enhanced
State-of-the-art target speech extractor