Replies: 2 comments
-
>>> shahdloo |
Beta Was this translation helpful? Give feedback.
0 replies
-
>>> reuben |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
>>> shahdloo
[July 14, 2018, 12:33pm]
In the documentation for ''audiofile_to_input_vector'' function it reads
that ''MFCC features
at every 0.01s time step with a window length of 0.025s'' are
calculated. I tried to confirm this statement.
I have a 16kHz wav file containing 9631014 samples. the MFCC features I
get from the ''audiofile_to_input_vector'' function have dimension
30097 /*494 which I read as [9631014/320 ]
[26+2 / *26 / *9 /].
I conclude that 494 MFCC features are extracted for every 320 samples
which results in 0.02s time steps. Is my reasoning correct? So is this
really 0.02s time step instead of 0.01s?
[This is an archived TTS discussion thread from discourse.mozilla.org/t/mfcc-feature-dimensions]
Beta Was this translation helpful? Give feedback.
All reactions