Skip to content

Commit

Permalink
Update docs
Browse files Browse the repository at this point in the history
  • Loading branch information
wasertech committed Jun 4, 2022
1 parent 74a0c6a commit b1c5249
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions STT/CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -45,6 +45,7 @@ Some parameters for the model itself:
- `duplicate_sentence_count` to control if Common Voice dataset might need
to be regenerated with more duplicated allowed using Corpora Creator
**USE WITH CAUTION**
- `enable_augments` to help the model to better genralise on noisy data by augmenting the data in various ways.
- `cv_personal_first_url` to download only your own voice instead of all Common Voice dataset (first url)
- `cv_personal_second_url` to download only your own voice instead of all Common Voice dataset (second url)

Expand Down Expand Up @@ -84,6 +85,8 @@ files, with proper `checkpoint` descriptor as TensorFlow produces.
To use an existing checkpoint, just ensure the `docker run` includes a mount such as:
`type=bind,src=PATH/TO/CHECKPOINTS,dst=/transfer-checkpoint`. Upon running, the checkpoints will be automatically used as starting point.

Checkpoints don't typically use automatic mixed precision nor fully-connected layer normalization and mostly use a standard number of hidden layers (2048 unless specified otherwise). So don't change those parameters to fine-tune from them.

## Hardware

Training successfull on:
Expand Down

0 comments on commit b1c5249

Please sign in to comment.