12/18/2025

Qlean Dataset Launches a Single-Speaker Japanese Regional Dialect Speech Dataset

Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai; hereinafter “Visual Bank”) has launched a new dataset titled Japanese Single-Speaker Regional Dialect Monologue Speech Dataset as part of its AI training data solution, Qlean Dataset, operated through its subsidiary, Amana Images Inc.


This dataset contains single-speaker Japanese speech recorded in regional dialects from across Japan and is newly added to Qlean Dataset’s AI Data Recipe lineup.

It supports research and development in Japanese speech AI, including ASR and speech language models, with a focus on evaluating dialectal speech.

The audio features dialects such as Kansai, Okayama, Iyo, and Tosa, spoken by Japanese men and women in their 20s to 60s. The recordings are monologues on everyday topics or personal thoughts, based on scripts while preserving natural rhythm, pauses, and region-specific expressions.

The dataset can also be customized or newly recorded to meet specific development needs, enabling practical evaluation and development of speech models for real-world applications.

Overview of the “Single-Speaker Japanese Regional Dialect Speech Dataset”

Data Type

Audio

Speaker Attributes

Japanese men and women aged 20s–60s

File Formats

MP3 / WAV

Total Duration

Several hundred hours (approximately 10 minutes per recording)

Sampling Rate

44.1 kHz, 48 kHz / 16-bit, 24-bit

Dialects Included

Kansai, Okayama, Iyo, Tosa, and others (to be expanded)

Recording Scenes

Single-speaker monologues on personal thoughts and everyday topicsScript-based recordings with natural rhythm, pauses, and expressions

Sample Details

https://qleandataset.visual-bank.co.jp/en/lineup/ds-099

Use Case Examples for the Japanese Regional Dialect Speech Dataset

[Research Use Cases]

  • Dialect-Aware Japanese ASR Research
    Region-specific dialect speech enables evaluation of phonetic variation and recognition accuracy across regions beyond standard Japanese datasets.

  • Generalization Evaluation of Speech Language Models
    Long-form, single-speaker dialect recordings support assessment of model generalization and condition-dependent behavior.

  • Prosody and Intonation Analysis for Dialect Speech Synthesis
    Dialect speech data enables analysis of prosody, rhythm, and intonation for training and evaluating natural-sounding speech synthesis models.

[Industrial Use Cases]

  • Development of Dialect-Responsive ASR Systems
    Dialect speech supports training and validation of ASR models for real-world applications such as call centers and voice-input interfaces.

  • Use-Case-Specific Data Design for Japanese Speech Models
    Combining dialect and standard Japanese speech enables broader model coverage and performance evaluation tailored to specific applications.

  • Validation of Speech Synthesis and Conversational AI Using Dialect Speech
    Dialect monologue recordings enable evaluation of naturalness and intonation control in speech synthesis and conversational AI.

[Educational and Training Use Cases]

  • Educational Materials for Speech and Audio AI
    The dataset can be used as real-world audio material for teaching and training in speech recognition, speech synthesis, and speech language models, incorporating regional and speaker variability into educational design.

About Qlean Dataset

Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports a wide range of data types, including images, videos, audio, 3D assets, and text, enabling both research and commercial AI development in a legally safe environment.
Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continues to expand its specialized, industry-focused lineup known as the “AI Data Recipe.”
By reducing the operational burden of data collection and preparation, Qlean Dataset helps organizations establish AI development environments that are both legally compliant and risk-free.

▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup

Key Features of Qlean Dataset

  • Existing datasets deliverable within one business day

  • Custom data collection and recording services available

▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact

About Visual Bank Inc.

Visual Bank Inc. is a Tokyo-based startup building Next-Generation Data infrastructure to enhance AI development capabilities under the mission “Unlocking Data Accessibility.”
The company operates THE PEN, an AI-assisted creative tool for manga artists and the Qlean Dataset service.
Its subsidiaries include Amana Images Inc., one of Japan’s largest photostock providers; Qlean Dataset, which leads research and development in AI data; and THE PEN Inc., an AI-assisted creative tool for manga artists.

CEO: Saneyuki Nagai
Address: 6F, C-Cube Minami Aoyama Building, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview

    amana images inc.

    Visual Bank Inc.


    © amanaimages inc.