Audio
Emotional Speech Dataset

This is a Japanese speech dataset of 100 Japanese men and women in their teens to fifties.
Subject Attributes
100 Japanese professional voice actors and voice acting students in their teens to fifties, 51 men and 49 women
Data Size
5.6GB
Number of Files
6,800
Data Format
wav
(Linear PCM, bit depth: 16 bits or more, sampling frequency: 48,000 Hz)
License
Commercial and research use permitted / Rights-cleared / Free for academic use (details here)
Notes
[Emotions] 9 types (normal, calm, joy, sadness, anger, fear, disgust, surprise, impatience)
[Speech Style] Normal/Strong
[Dialogue] ① There's a cell phone on the table. ② This news is coming on the radio.
Sample
Request a Free Sample































