A text-to-speech model powered by SparkAudio and Mobvoi.
Gradio demo of CogView4-6B
GPUs go brrrr
Enhance and upscale images with Real-ESRGAN
Upgraded to v1.0!
Transform research papers and mathematical concepts into stu
Vision Transformer Attention Visualization
Image to Video Generation
Implement test-time compute scaling for math problems
Submit Hugging Face model links for quantization requests
Convert images to anime-style bodies and sketches
Complete list of past Daily Papers