Skip to content

DeepSpeech 0.7.x

Compare
Choose a tag to compare
@DewiBrynJones DewiBrynJones released this 18 Jun 10:45
· 14 commits to master since this release

Dyma'r sgriptiau a ddefnyddiwyd i hyfforddi a gwerthuso modelau DeepSpeech Mozilla a ddefnyddir ar gyfer adnabod lleferydd Cymraeg o fewn y rhaglenni cod agored Macsen a'r Trawsgrifiwr Cymraeg ym mis Mehefin 2020 (20.06) ac i gydfynd â DeepSpeech fersiwn 0.7.x

These are the scripts used to train the Mozilla DeepSpeech models used for Welsh language speech recognition in the open source Macsen - Welsh Language Voice Assistant App and Welsh Transcriber applications in June 2020 (20.06) and to coincide with DeepSpeech 0.7.x

Mae'r sgriptiau yn defnyddio data CommonVoice Mozilla Cymraeg a chyhoeddwyd ym mis Rhagfyr 2019 sydd yn cynnwys 77 awr o recordiadau sain. Er bod maint y data Cymraeg yn CommonVoice yn annigonol ar gyfer hyfforddi adnabod lleferydd Cymraeg cyflawn, mae'r sgriptiau yn llwyddo i ddarparu modelau defnyddiol ac ymarferol drwy ddefnyddio dulliau dysgu trosglwyddol i addasu modelau Saesneg DeepSpeech gyda data Cymraeg. Mewn arbrofion syml a chynhelir gan y sgriptiau, mae’r model trawsgrifio yn camddeall tua 27% o eiriau mewn brawddeg, tra bo'r modelau ar gyfer cynorthwyydd Macsen yn camddeall 14%.

The scripts use Welsh language data from Mozilla CommonVoice that was published in December 2019 and contains 77 hours of speech recordings. Although the amount of Welsh data in CommonVoice is insufficient for training a complete Welsh language speech recognition engine, these scripts still succeed in providing a practical capability by using transfer learning methods to adapt DeepSpeech project's English models with Welsh data. In simple evaluations that can be repeated with these scripts, the models for transcribing exhibit a word error rate of 27%, while our voice assistant Macsen models have an error rate of 14%.

Modelau / Models

Mae'r modelau sydd wedi eu hyfforddi gyda'r sgriptiau hyn ac ar gael isod, sef / The trained models with these scripts are available below are

Enghreifftiau Defnydd / Example Usage

Mae enghraifft o sut ddefnyddir y ffeiliau modelau o fenw cynnyrch feddalwedd yn ein projectau Trawsgrifiwr.
Our Trawsgrifiwr (Transcriber) provides an example of how the models can be used in a software project

https://github.com/techiaith/trawsgrifiwr-windows/blob/v1.0.3/DeepSpeechLib/DeepSpeechTranscriber.cs

Yn ogystal a'n project gwasanaeth API adnabod lleferydd ar-lein
As well as our online speech recognition API project

https://github.com/techiaith/docker-deepspeech-cy-server/blob/v20.06/server/wsgi.py

Mae rhagor o enghreifftiau ar gael o ddogfennaeth Mozilla / Further examples are available from the Mozilla website

Python: https://deepspeech.readthedocs.io/en/latest/Python-Examples.html

C/C++: https://deepspeech.readthedocs.io/en/latest/C-Examples.html

Diolch / Thank you

Rydym yn ddiolchgar iawn i Rhoslyn Prys (meddal.com) a ymgymerodd â nifer o ymgyrchoedd torfoli ar sail wirfoddol, i'r Mentrau Iaith, Cyngor Gwynedd, Llyfrgell Genedlaethol Cymru a weithiodd gyda Rhoslyn ar rai o'r ymgyrchoedd hyn, hefyd i Lywodraeth Cymru, ac i liaws o gyfranogwyr ar draws Cymru a thu hwnt sydd wedi cyfrannu eu lleisiau i Common Voice Cymraeg.

We are very grateful to Rhoslyn Prys (meddal.com) who undertook on a voluntary basis a number of crowdsourcing campaigns, to the Mentrau Iaith, Gwynedd Council, the National Library of Wales who worked with Rhoslyn on some of these campaigns, to the Welsh Government, and to the many participants across Wales and beyond who have contributed their voices to the Welsh Common Voice datasets.