2/25/2026
Qlean Dataset Launches Japanese Read-Aloud Speech Dataset on Subculture and Spiritual Topics

Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai) has launched a new dataset under its AI training data solution, Qlean Dataset, through its subsidiary amanaimages Inc. The newly released dataset, titled Japanese Single-Speaker Read Speech Corpus on Subculture and Spiritual Themes with Transcripts, is designed for use in Automatic Speech Recognition (ASR), speech understanding, and speech–language foundation model development.
This dataset consists of Japanese-language texts related to subculture, spirituality, and healing themes, read aloud in a calm and steady tone by a single native Japanese speaker. Each audio recording is paired with a transcript that faithfully reflects the spoken content. The material includes conceptual and introspective narratives delivered in continuous read-aloud form, making it suitable for training and evaluating alignment between natural spoken language and structured text.
Because the corpus is recorded by a single speaker, it enables model evaluation and training with reduced variability in speaker characteristics. The read-aloud format, rather than conversational dialogue, allows clearer analysis of syntactic structure, vocabulary flow, and the correspondence between speech signals and linguistic expressions.
The dataset is provided as part of Qlean Dataset’s original AI development lineup, AI Data Recipe, and is intended for a wide range of use cases—from academic research to commercial AI system development. Visual Bank Inc. and amanaimages Inc. will continue supporting AI research and development globally by delivering structured datasets aligned with evolving needs in generative AI and speech-language technologies.
Japanese Single-Speaker Read Speech Corpus on Subculture and Spiritual Themes with Transcripts
Data Types | Audio, Text |
|---|---|
Speaker Attributes | Native Japanese speaker |
File Formats | Audio: MP3 |
Recording Length | 30 seconds to 22 minutes per audio file |
Sampling Rate | 44.1 kHz / 48 kHz |
Recording Scenarios | ・A single speaker reading texts related to subculture and spirituality |
Sample Details |
Use Case Examples
Research Applications
Evaluation of Read Speech Processing in ASR and Speech Understanding Models
Using paired Japanese read-aloud audio and transcripts, researchers can analyze recognition accuracy and error patterns in models handling continuous, structured speech.Foundational Research on Speech–Language Correspondence
The dataset enables examination of relationships between speech signals and linguistic meaning, particularly in concept-driven or introspective text, supporting studies on semantic modeling and speech-language alignment.
Industrial Applications
Accuracy Evaluation for Voice-Input AI Assistants
The read-aloud format can be used to assess and improve recognition performance in AI products designed to process narration-style or structured spoken input.Fine-Tuning of Speech–Language Foundation Models
Paired single-speaker audio and text data can support training and behavioral validation of foundation models that integrate speech and language processing.
About Qlean Dataset
Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports a wide range of data types, including images, videos, audio, 3D assets, and text, enabling both research and commercial AI development in a legally safe environment.
Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continues to expand its specialized, industry-focused lineup known as the “AI Data Recipe.”
By reducing the operational burden of data collection and preparation, Qlean Dataset helps organizations establish AI development environments that are both legally compliant and risk-free.
▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup




Key Features of Qlean Dataset
Existing datasets deliverable within one business day
Custom data collection and recording services available
▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact
About Visual Bank Inc.
Visual Bank Inc. is a Tokyo-based startup building Next-Generation Data infrastructure to enhance AI development capabilities under the mission “Unlocking Data Accessibility.”
The company operates THE PEN, an AI-assisted creative tool for manga artists and the Qlean Dataset service.
Its subsidiaries include Amana Images Inc., one of Japan’s largest photostock providers; Qlean Dataset, which leads research and development in AI data; and THE PEN Inc., an AI-assisted creative tool for manga artists.
CEO: Saneyuki Nagai
Address: 6F, C-Cube Minami Aoyama Building, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview





