Skip to content

Commit

Permalink
upload fixed samples (#22)
Browse files Browse the repository at this point in the history
  • Loading branch information
seungwonpark committed Nov 11, 2019
1 parent 7541497 commit c26733f
Show file tree
Hide file tree
Showing 19 changed files with 11 additions and 15 deletions.
Binary file added audios/LJ001-0005.mel
Binary file not shown.
Binary file modified audios/LJ001-0005_reconstructed_epoch0400.wav
Binary file not shown.
Binary file modified audios/LJ001-0005_reconstructed_epoch0800.wav
Binary file not shown.
Binary file removed audios/LJ001-0005_reconstructed_epoch1350.wav
Binary file not shown.
Binary file modified audios/LJ001-0005_reconstructed_epoch1600.wav
Binary file not shown.
Binary file modified audios/LJ001-0005_reconstructed_epoch3200.wav
Binary file not shown.
Binary file added audios/LJ001-0015.mel
Binary file not shown.
Binary file modified audios/LJ001-0015_reconstructed_epoch0400.wav
Binary file not shown.
Binary file modified audios/LJ001-0015_reconstructed_epoch0800.wav
Binary file not shown.
Binary file removed audios/LJ001-0015_reconstructed_epoch1350.wav
Binary file not shown.
Binary file modified audios/LJ001-0015_reconstructed_epoch1600.wav
Binary file not shown.
Binary file modified audios/LJ001-0015_reconstructed_epoch3200.wav
Binary file not shown.
Binary file added audios/LJ014-0285.mel
Binary file not shown.
Binary file modified audios/LJ014-0285_reconstructed_epoch0400.wav
Binary file not shown.
Binary file modified audios/LJ014-0285_reconstructed_epoch0800.wav
Binary file not shown.
Binary file removed audios/LJ014-0285_reconstructed_epoch1350.wav
Binary file not shown.
Binary file modified audios/LJ014-0285_reconstructed_epoch1600.wav
Binary file not shown.
Binary file modified audios/LJ014-0285_reconstructed_epoch3200.wav
Binary file not shown.
26 changes: 11 additions & 15 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -8,19 +8,24 @@

<body>
<div>
<h2>Audio samples from <a href="https://arxiv.org/abs/1910.06711">MelGAN</a> vocoder (Training in progress)</h2>
<p><i>Disclaimer: This is a third-party implementation. The original authors stated that they will be releasing their code soon.</i></p>
<p><b>Implementation GitHub repo link:</b> <a href="https://github.com/seungwonpark/melgan">https://github.com/seungwonpark/melgan</a></p>
<p>Audio samples from original authors: <a href="https://melgan-neurips.github.io/">https://melgan-neurips.github.io/</a></p>
<br>
<h2>Audio samples from <a href="https://arxiv.org/abs/1910.06711">MelGAN</a> vocoder</h2>
<p><i>Disclaimer: This is a third-party implementation.</i></p>
<ul>
<li><b>Implementation GitHub repo:</b> <a href="https://github.com/seungwonpark/melgan">https://github.com/seungwonpark/melgan</a></li>
<ul>
<li>Contains pretrained model compatible with <a href="https://github.com/NVIDIA/tacotron2">NVIDIA/tacotron2</a> on LJSpeech-1.1</li>
</ul>
<li>Audio samples from original authors: <a href="https://melgan-neurips.github.io/">https://melgan-neurips.github.io/</a></li>
<li>Official implementation: <a href="https://github.com/descriptinc/melgan-neurips">https://github.com/descriptinc/melgan-neurips</a></li>
</ul>

<p>In summary, MelGAN can convert mel-spectrograms into raw audio at real-time on CPU,
and it generalizes to unseen speakers with significantly fewer parameters than previous state-of-the-art,
<a href="https://arxiv.org/abs/1811.00002">WaveGlow</a>.
</p>

<h2>LJSpeech-1.1</h2>
<p>All audios below are unseen during training. We split LJSpeech-1.1 into 9:1 for train/validation. </p>
<p>All audios below are unseen during training. We split LJSpeech-1.1 into 9:1 for train/validation. (Files with suffix "*5.wav" are for validation) </p>
<table>
<thead>
<tr>
Expand Down Expand Up @@ -58,15 +63,6 @@ <h2>LJSpeech-1.1</h2>
<td><audio controls class="audio-player" preload="metadata">
<source src="audios/LJ014-0285_reconstructed_epoch0800.wav" type="audio/wav"></audio></td>
</tr>
<tr>
<td>Epoch 1350</td>
<td><audio controls class="audio-player" preload="metadata">
<source src="audios/LJ001-0005_reconstructed_epoch1350.wav" type="audio/wav"></audio></td>
<td><audio controls class="audio-player" preload="metadata">
<source src="audios/LJ001-0015_reconstructed_epoch1350.wav" type="audio/wav"></audio></td>
<td><audio controls class="audio-player" preload="metadata">
<source src="audios/LJ014-0285_reconstructed_epoch1350.wav" type="audio/wav"></audio></td>
</tr>
<tr>
<td>Epoch 1600</td>
<td><audio controls class="audio-player" preload="metadata">
Expand Down

0 comments on commit c26733f

Please sign in to comment.