Update README.md
Browse files
README.md
CHANGED
@@ -19,20 +19,21 @@ Fine-tuning was done using the Hugging Face Transformers library with a modified
|
|
19 |
|
20 |
| Training data | Hours |
|
21 |
|:---|---:|
|
22 |
-
| Latvian
|
23 |
-
| LATE-Media train set | 42 |
|
24 |
| Total | 209 |
|
25 |
|
26 |
## Evaluation
|
27 |
|
28 |
-
The model is evaluated on both a Latvian CV 17.0 test set (the [V1 split](https://analyzer.cv-toolbox.web.tr/examine/lv/17.0)) and a [LATE-Media test set](http://hdl.handle.net/20.500.12574/99).
|
29 |
-
|
30 |
| Testing data | WER | CER |
|
31 |
|:---|---:|---:|
|
32 |
-
| Latvian
|
33 |
-
| Latvian
|
34 |
-
| LATE-Media 1.0 - formatted | 20.8 | 8.2 |
|
35 |
-
| LATE-Media 1.0 - normalized | 14.1 | 5.9 |
|
|
|
|
|
|
|
36 |
|
37 |
## Citation
|
38 |
|
|
|
19 |
|
20 |
| Training data | Hours |
|
21 |
|:---|---:|
|
22 |
+
| Latvian Common Voice 17.0 train set (the [V1 split](https://analyzer.cv-toolbox.web.tr/examine/lv/17.0)) | 167 |
|
23 |
+
| LATE-Media 1.0 train set | 42 |
|
24 |
| Total | 209 |
|
25 |
|
26 |
## Evaluation
|
27 |
|
|
|
|
|
28 |
| Testing data | WER | CER |
|
29 |
|:---|---:|---:|
|
30 |
+
| Latvian Common Voice 17.0 test set (V1) - formatted | 5.0 | 1.6 |
|
31 |
+
| Latvian Common Voice 17.0 test set (V1) - normalized | 3.4 | 1.0 |
|
32 |
+
| LATE-Media 1.0 test set - formatted | 20.8 | 8.2 |
|
33 |
+
| LATE-Media 1.0 test set - normalized | 14.1 | 5.9 |
|
34 |
+
|
35 |
+
The Latvian CV 17.0 test set is available [here](https://analyzer.cv-toolbox.web.tr/examine/lv/17.0).
|
36 |
+
The LATE-Media 1.0 test set is available [here](http://hdl.handle.net/20.500.12574/99).
|
37 |
|
38 |
## Citation
|
39 |
|