Update docs

wasertech · Jun 4, 2022 · b1c5249 · b1c5249
1 parent 74a0c6a
commit b1c5249
Showing 1 changed file with 3 additions and 0 deletions.
diff --git a/STT/CONTRIBUTING.md b/STT/CONTRIBUTING.md
@@ -45,6 +45,7 @@ Some parameters for the model itself:
  - `duplicate_sentence_count` to control if Common Voice dataset might need
     to be regenerated with more duplicated allowed using Corpora Creator
     **USE WITH CAUTION**
+ - `enable_augments` to help the model to better genralise on noisy data by augmenting the data in various ways.
  - `cv_personal_first_url` to download only your own voice instead of all Common Voice dataset (first url)
  - `cv_personal_second_url` to download only your own voice instead of all Common Voice dataset (second url)
 
@@ -84,6 +85,8 @@ files, with proper `checkpoint` descriptor as TensorFlow produces.
 To use an existing checkpoint, just ensure the `docker run` includes a mount such as:
 `type=bind,src=PATH/TO/CHECKPOINTS,dst=/transfer-checkpoint`. Upon running, the checkpoints will be automatically used as starting point.
 
+Checkpoints don't typically use automatic mixed precision nor fully-connected layer normalization and mostly use a standard number of hidden layers (2048 unless specified otherwise). So don't change those parameters to fine-tune from them.
+
 ## Hardware
 
 Training successfull on: