Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kilian303 's Collections
test

test

updated 28 days ago
Upvote
-

  • Running
    347
    347

    Qwen2.5 Omni 7B Demo

    🏆

    Generate text and speech from text, audio, images, and videos


  • Running on Zero
    2.6k
    2.6k

    F5-TTS

    🗣

    F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)


  • Runtime error
    309
    309

    Kokoro TTS Zero

    🎴

    ✨[With v1.0.0] Accelerated TTS on Kokoro-82M


  • fixie-ai/ultravox-v0_5-llama-3_2-1b

    Audio-Text-to-Text • 0.7B • Updated May 6 • 385k • 57

  • Running on Zero
    833
    833

    Sesame CSM

    🌱

    Conversational speech generation


  • hexgrad/Kokoro-82M

    Text-to-Speech • Updated Apr 10 • 3.21M • • 5.03k

  • OuteAI/Llama-OuteTTS-1.0-1B

    Text-to-Speech • 1B • Updated 7 days ago • 113k • 210

  • suno/bark

    Text-to-Speech • Updated Oct 4, 2023 • 23.9k • 1.42k

  • senstella/csm-expressiva-1b

    Text-to-Speech • Updated Apr 17 • 19 • 33

  • Sleeping

    Sesame AI POC

    ⚡

    Full working POC demonstrating text to speech and speech


  • Running
    37
    37

    Spark-TTS

    ⚡

    (Unofficial) Gradio demo for Spark-TTS


  • Sleeping
    2
    2

    VoiceBloom

    ✨

    Generate audio from text with customizable voice and speed


  • Running on Zero
    828
    828

    TripoSG

    🔮

    Generate 3D models from images


  • Running on Zero
    4.97k
    4.97k

    FLUX.1 [Schnell]

    🏎

    Generate images from text prompts


  • MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh

    Paper • 2508.01242 • Published Aug 2 • 9
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs