Audio

Emotional Speech Dataset

Emotional Speech Dataset

This is a Japanese speech dataset of 100 Japanese men and women in their teens to fifties.

Subject Attributes

100 Japanese professional voice actors and voice acting students in their teens to fifties, 51 men and 49 women

Data Size

5.6GB

Number of Files

6,800

Data Format

wav
(Linear PCM, bit depth: 16 bits or more, sampling frequency: 48,000 Hz)

License

Commercial and research use permitted / Rights-cleared / Free for academic use (details here)

Notes

[Emotions] 9 types (normal, calm, joy, sadness, anger, fear, disgust, surprise, impatience)
[Speech Style] Normal/Strong
[Dialogue] ① There's a cell phone on the table. ② This news is coming on the radio.

Sample

Request a Free Sample

Audio AI Data Recipe

amana images inc.

Visual Bank Inc.


© amanaimages inc.