Skip to content

Commit

Permalink
Add Welsh from Dewi Byrn Jones
Browse files Browse the repository at this point in the history
  • Loading branch information
JRMeyer committed Apr 3, 2021
1 parent 0b1dc36 commit 036adee
Show file tree
Hide file tree
Showing 13 changed files with 33 additions and 21 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

This repository tracks releases of open models for 🐸STT.


| Language | Creator | Release |
|----------|---------|---------|
| [Ukrainian](https://en.wikipedia.org/wiki/Ukrainian_language) | [Yurii Paniv](https://github.com/robinhad/) | [`0.4`](https://github.com/coqui-ai/STT-models/releases/ukrainian/robinhad/0.4) |
| [Ukrainian](https://en.wikipedia.org/wiki/Ukrainian_language) | [Yurii Paniv](https://github.com/robinhad/) | [`v0.4`](https://github.com/coqui-ai/STT-models/releases/tag/ukrainian/robinhad/v0.4) |
| [Welsh](https://en.wikipedia.org/wiki/Welsh_language) | [Dewi Bryn Jones](https://github.com/dewibrynjones/) | [`v21.03`](https://github.com/coqui-ai/STT-models/releases/tag/welsh/techiaith/v21.03) |
3 changes: 2 additions & 1 deletion catalan/ccoreilly/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Github](https://github.com/ccoreilly/deepspeech-catala/releases/tag/0.14.0) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.14.0`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [deepspeech-catala](https://github.com/ccoreilly/deepspeech-catala)
- License: MIT
- Citation details: `@misc{catalan-ccoreilly,
Expand Down Expand Up @@ -82,7 +83,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion french/commonvoice-fr/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Github](https://github.com/common-voice/commonvoice-fr/releases/tag/fr-v0.6) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.6`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [commonvoice-fr](https://github.com/common-voice/commonvoice-fr)
- License: MPL 2.0
- Citation details: `@misc{commonvoice-fr,
Expand Down Expand Up @@ -101,7 +102,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion french/jaco-polyglot/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Gitlab](https://gitlab.com/Jaco-Assistant/Scribosermo) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.0.1`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [scribosermo](https://gitlab.com/Jaco-Assistant/Scribosermo/-/tree/master/#old-experiments)
- License: GNU Lesser General Public License
- Citation details: `@misc{french-jaco,
Expand Down Expand Up @@ -80,7 +81,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
17 changes: 9 additions & 8 deletions german/aashishag/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,16 +17,17 @@ Jump to section:
- Model date: Accessed from [deepspeech-german](https://github.com/AASHISHAG/deepspeech-german) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.9.0`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [deepspeech-german](https://github.com/AASHISHAG/deepspeech-german)
- License: Apache 2.0
- Citation details: `@inproceedings{agarwal-zesch-2019-german,
author = "Aashish Agarwal and Torsten Zesch",
title = "German End-to-end Speech Recognition based on DeepSpeech",
booktitle = "Preliminary proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019): Long Papers",
year = "2019",
address = "Erlangen, Germany",
publisher = "German Society for Computational Linguistics \& Language Technology",
pages = "111--119"
author = "Aashish Agarwal and Torsten Zesch",
title = "German End-to-end Speech Recognition based on DeepSpeech",
booktitle = "Preliminary proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019): Long Papers",
year = "2019",
address = "Erlangen, Germany",
publisher = "German Society for Computational Linguistics \& Language Technology",
pages = "111--119"
}`
- Where to send questions or comments about the model: You can leave an issue on [`STT-model` issues](https://github.com/coqui-ai/STT-models/issues), open a new discussion on [`STT-model` discussions](https://github.com/coqui-ai/STT-models/discussions), or chat with us on [Gitter](https://gitter.im/coqui-ai/).

Expand Down Expand Up @@ -77,7 +78,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion german/jaco-polyglot/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Gitlab](https://gitlab.com/Jaco-Assistant/Scribosermo) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.0.1`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [scribosermo](https://gitlab.com/Jaco-Assistant/Scribosermo/-/tree/master/#old-experiments)
- License: GNU Lesser General Public License
- Citation details: `@misc{german-jaco,
Expand Down Expand Up @@ -82,7 +83,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion italian/jaco-polyglot/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Gitlab](https://gitlab.com/Jaco-Assistant/Scribosermo) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.0.1`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [scribosermo](https://gitlab.com/Jaco-Assistant/Scribosermo/-/tree/master/#old-experiments)
- License: GNU Lesser General Public License
- Citation details: `@misc{italian-jaco,
Expand Down Expand Up @@ -80,7 +81,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion kinyarwanda/digital-umuganda/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Github](https://github.com/Digital-Umuganda/Deepspeech-Kinyarwanda/tree/master/jan-8-2021-best-kinya-deepspeech) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.0.1`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [deepspeech-kinyarwanda](https://github.com/Digital-Umuganda/Deepspeech-Kinyarwanda)
- License: MPL 2.0
- Citation details: `@misc{deepspeech-kinyarwanda,
Expand Down Expand Up @@ -78,7 +79,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion komi/itml/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.0.1`
- Compatible with 🐸 STT version: `v0.9.3`
- License: AGPL
- Citation details: `@inproceedings{hjortnaes-etal-2020-towards,
title = "Towards a Speech Recognizer for {K}omi, an Endangered and Low-Resource Uralic Language",
Expand Down Expand Up @@ -82,7 +83,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion polish/jaco-polyglot/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Gitlab](https://gitlab.com/Jaco-Assistant/Scribosermo) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.0.1`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [scribosermo](https://gitlab.com/Jaco-Assistant/Scribosermo/-/tree/master/#old-experiments)
- License: GNU Lesser General Public License
- Citation details: `@misc{polish-jaco,
Expand Down Expand Up @@ -80,7 +81,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion spanish/jaco-polyglot/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Gitlab](https://gitlab.com/Jaco-Assistant/Scribosermo) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.0.1`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [scribosermo](https://gitlab.com/Jaco-Assistant/Scribosermo/-/tree/master/#old-experiments)
- License: GNU Lesser General Public License
- Citation details: `@misc{spanish-jaco,
Expand Down Expand Up @@ -80,7 +81,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion ukrainian/robinhad/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Github](https://github.com/robinhad/voice-recognition-ua/releases/tag/v0.4) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v0.4`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [voice-recognition-ua](https://github.com/robinhad/voice-recognition-ua)
- License: CC BY-NC 4.0
- Citation details: `@misc{ukrainian-stt-paniv,
Expand Down Expand Up @@ -80,7 +81,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down
3 changes: 2 additions & 1 deletion welsh/techiaith/MODEL_CARD.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ Jump to section:
- Model date: Accessed from [Github](https://github.com/techiaith/docker-deepspeech-cy/releases/tag/21.03) on March 31, 2021
- Model type: `Speech-to-Text`
- Model version: `v21.03`
- Compatible with 🐸 STT version: `v0.9.3`
- Code: [docker-deepspeech-cy](https://github.com/techiaith/docker-deepspeech-cy)
- License: MIT
- Citation details: `@misc{welsh-stt-dewibrynjones,
Expand Down Expand Up @@ -75,7 +76,7 @@ Deploying a Speech-to-Text model into any production setting has ethical implica

You should assume every machine learning model has demographic bias unless proven otherwise. For STT models, it is often the case that transcription accuracy is better for men than it is for women. If you are using this model in production, you should acknowledge this as a potential issue.

### Surveillence
### Surveillance

Speech-to-Text may be mis-used to invade the privacy of others by recording and mining information from private conversations. This kind of individual privacy is protected by law in may countries. You should not assume consent to record and analyze private speech.

Expand Down

0 comments on commit 036adee

Please sign in to comment.