Skip to content

Latest commit

 

History

History
205 lines (183 loc) · 11.4 KB

index.md

File metadata and controls

205 lines (183 loc) · 11.4 KB

{:.no_toc}

  • toc {:toc}

Multi-speaker modeling

<thead>
<th style="text-align: center">Utterance</th>
<th style="text-align: center">Recording</th>
<th style="text-align: center">WaveRNN</th>
<th style="text-align: center">SC-WaveRNN</th>
<th style="text-align: center">Parallel WaveGAN</th>
<th style="text-align: center">MelGAN</th>
<th style="text-align: center">Multi-Singer</th>
</thead>
<tbody>
    <tr>
        <th>#1</th>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/GT/10.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/WaveRNN/10.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/SC_WaveRNN/10.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/ParallWaveGAN/10.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/MelGAN/10.wav" type="audio/wav"></audio></td>
        <<td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/FMSing_total/10.wav" type="audio/wav"></audio></td>
    </tr>
</tbody>
<tbody>
    <tr>
        <th>#2</th>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/GT/11.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/WaveRNN/11.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/SC_WaveRNN/11.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/ParallWaveGAN/11.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/MelGAN/11.wav" type="audio/wav"></audio></td>
        <<td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/
	    
	    _total/11.wav" type="audio/wav"></audio></td>
    </tr>
</tbody>
<tbody>
    <tr>
        <th>#3</th>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/GT/15.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/WaveRNN/15.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/SC_WaveRNN/15.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/ParallWaveGAN/15.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/MelGAN/15.wav" type="audio/wav"></audio></td>
        <<td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_woman/FMSing_total/15.wav" type="audio/wav"></audio></td>
    </tr>
</tbody>
    <tbody>
    <tr>
        <th>#4</th>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/GT/10.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/WaveRNN/10.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/SC_WaveRNN/10.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/PWG/10.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/MelGAN/10.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/FMSing_total/10.wav" type="audio/wav"></audio></td>
    </tr>
</tbody>
<tbody>
    <tr>
        <th>#5</th>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/GT/11.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/WaveRNN/11.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/SC_WaveRNN/11.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/PWG/11.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/MelGAN/11.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/FMSing_total/11.wav" type="audio/wav"></audio></td>
    </tr>
</tbody>
<tbody>
    <tr>
        <th>#6</th>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/GT/12.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/WaveRNN/12.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/SC_WaveRNN/12.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/PWG/12.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/MelGAN/12.wav" type="audio/wav"></audio></td>
        <td style="text-align: center"><audio controls style="width: 150px;"><source src="wav_for_demo/singing/Test_seen_man/FMSing_total/12.wav" type="audio/wav"></audio></td>
    </tr>
</tbody>

Singing Voice Synthesis

带 我 飞 飞 过 绝 望dai wo fei fei guo jue wang

Recording FastSpeech2+Multi-Singer

就 飞 多 远 吧jiu fei duo yuan ba

Recording FastSpeech2+Multi-Singer

就 算 很 受 伤 也 不 闪 泪 光jiu suan hen shou shang ye bu shan lei guang

Recording FastSpeech2+Multi-Singer

我 松 开 时 间 的 绳 索wo song kai shi jian de sheng suo

Recording FastSpeech2+Multi-Singer

原 来 你 生 来 就 属 于 天 际yuan lai ni sheng lai jiu shu yu tian ji

Recording FastSpeech2+Multi-Singer

庄 稼 早 已 收 割 完zhuang jia zao yi shou ge wan

Recording FastSpeech2+Multi-Singer

Data

We provide a Google Drive share link for all the applicants. Please note that anyone who downloads this dataset will be deemed to agree to the dataset sharing license.

Questions

Please contact [email protected]