awesome-speech

curated paper, notes, codes storage for speech recognition,speech synthesis, signal process...

Challenge for accuracy

Deep Speech: Scaling up end-to-end speech recognition https://arxiv.org/pdf/1412.5567v2.pdf

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin https://arxiv.org/pdf/1512.02595v1.pdf

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/pdf/1904.08779v3.pdf

English Conversational Telephone Speech Recognition by Humans and Machines https://arxiv.org/pdf/1703.02136v1.pdf

STATE-OF-THE-ART SPEECH RECOGNITION USING MULTI-STREAM SELF-ATTENTION WITH DILATED 1D CONVOLUTIONS https://arxiv.org/pdf/1910.00716v1.pdf

Purely sequence-trained neural networks for ASR based on lattice-free MMI https://www.danielpovey.com/files/2016_interspeech_mmi.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md