--- license: mit base_model: - nari-labs/Dia-1.6B pipeline_tag: text-to-speech --- ## Purpose The purpose of this repository is to store various [TTS.cpp](https://github.com/mmwillet/TTS.cpp) compatible GGUF encoded model files for the [Dia model](https://github.com/nari-labs/dia). ### Model Types Currently the model is supported with 4-bit, 5-bit, 8-bit, F16bit and F32bit quantization / precision and all modes are supported with F16 and F32 bit precision DAC. `Dia.gguf` is the non-quantized 32 bit floating point version, `Dia_Q4.gguf`, `Dia_Q5.gguf`, `Dia_Q8.gguf` and `Dia_F16.gguf` are the 4bit, 5bit, 8bit and 16bit quantized versions respectively, and all versions with the prefix `_DAC_F16.gguf` are encoded with a 16bit version of the DAC audio encoder. ## Dia This page only contains the GGUF encoded model files of the original Dia model. For the original model please see the repository [here](https://github.com/nari-labs/dia). ## How to use See the github repo [here](https://github.com/mmwillet/TTS.cpp) for more information general usage. To compile TTS.cpp simple git clone and then run the the following in the repository's directory to compile (cmake is required): ```bash cmake -B build cmake --build build --config Release ``` After compilation is complete you can download a model file generate speech to a file from the same directory like so: ```bash build/bin/tts-cli --model-path /model/path/to/downloaded_gguf_file.gguf --prompt "I am saying some words" --save-path /tmp/test.wav ```