jellecali8's picture
Create README.md
da6c45c verified
---
license: mit
language: so
tags:
- tts
- speaker-embedding
- somali
- speechbrain
- vits
- speecht5
library_name: speechbrain
base_model: speechbrain/spkrec-ecapa-voxceleb
pipeline_tag: feature-extraction
---
# Ali Speaker Embedding Dataset
This dataset contains a PyTorch `.pt` file that represents a speaker embedding for the Somali male speaker **Ali**.
The embedding was generated using the [`speechbrain/spkrec-ecapa-voxceleb`](https://huggingface.co/speechbrain/spkrec-ecapa-voxceleb) speaker recognition model from over 300 audio clips of the speaker's voice.
## Dataset Details
- **File**: `Ali_speaker_embedding.pt`
- **Format**: PyTorch tensor (`.pt`)
- **Embedding Size**: 192-dimensional
- **Language**: Somali (`so`)
- **Gender**: Male
- **Audio Source**: 300 high-quality `.wav` files from speaker Ali
- **Sample Rate**: 16kHz
## Usage Example
```python
import torch
# Load the embedding
embedding = torch.load("Ali_speaker_embedding.pt")