语音相关dataset

http://www.openslr.org/resources.php

https://test.data-baker.com/#/data/index/compose

22050 Hz sampling rate.

*Total Clips 13,100
Total Words 225,715
Total Characters 1,308,678
*Total Duration 23:55:17
Mean Clip Duration 6.57 sec
Min Clip Duration 1.11 sec
Max Clip Duration 10.10 sec
Mean Words per Clip 17.23
Distinct Words 13,821

585 hours of read English speech at 24kHz sampling rate.

110 English speakers with various accents. Each speaker reads out about 400 sentences. roughly 44 hours

roughly 85 hours of emotion-neutral recordings spoken by 218 native Chinese mandarin speakers and total 88035 utterances.

posted on 2021-03-27 21:33  HolaWorld  阅读(199)  评论(0编辑  收藏  举报

导航