2/25/2026

Qlean Dataset Launches Japanese Read-Aloud Speech Dataset on Subculture and Spiritual Topics

Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai) has launched a new dataset under its AI training data solution, Qlean Dataset, through its subsidiary amanaimages Inc. The newly released dataset, titled Japanese Single-Speaker Read Speech Corpus on Subculture and Spiritual Themes with Transcripts, is designed for use in Automatic Speech Recognition (ASR), speech understanding, and speech–language foundation model development.

This dataset consists of Japanese-language texts related to subculture, spirituality, and healing themes, read aloud in a calm and steady tone by a single native Japanese speaker. Each audio recording is paired with a transcript that faithfully reflects the spoken content. The material includes conceptual and introspective narratives delivered in continuous read-aloud form, making it suitable for training and evaluating alignment between natural spoken language and structured text.

Because the corpus is recorded by a single speaker, it enables model evaluation and training with reduced variability in speaker characteristics. The read-aloud format, rather than conversational dialogue, allows clearer analysis of syntactic structure, vocabulary flow, and the correspondence between speech signals and linguistic expressions.

The dataset is provided as part of Qlean Dataset’s original AI development lineup, AI Data Recipe, and is intended for a wide range of use cases—from academic research to commercial AI system development. Visual Bank Inc. and amanaimages Inc. will continue supporting AI research and development globally by delivering structured datasets aligned with evolving needs in generative AI and speech-language technologies.

Japanese Single-Speaker Read Speech Corpus on Subculture and Spiritual Themes with Transcripts

Data Types

Audio, Text

Speaker Attributes

Native Japanese speaker

File Formats

Audio: MP3
Text: TXT, JSON, CSV

Recording Length

30 seconds to 22 minutes per audio file

Sampling Rate

44.1 kHz / 48 kHz

Recording Scenarios

・A single speaker reading texts related to subculture and spirituality
・Calm narration of conceptual and reflective content

Sample Details

https://qleandataset.visual-bank.co.jp/en/lineup/pn-038

Use Case Examples 

Research Applications 

  • Evaluation of Read Speech Processing in ASR and Speech Understanding Models
    Using paired Japanese read-aloud audio and transcripts, researchers can analyze recognition accuracy and error patterns in models handling continuous, structured speech. 

  • Foundational Research on Speech–Language Correspondence
    The dataset enables examination of relationships between speech signals and linguistic meaning, particularly in concept-driven or introspective text, supporting studies on semantic modeling and speech-language alignment. 

Industrial Applications 

  • Accuracy Evaluation for Voice-Input AI Assistants
    The read-aloud format can be used to assess and improve recognition performance in AI products designed to process narration-style or structured spoken input. 

  • Fine-Tuning of Speech–Language Foundation Models
    Paired single-speaker audio and text data can support training and behavioral validation of foundation models that integrate speech and language processing.

About Qlean Dataset

Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports a wide range of data types, including images, videos, audio, 3D assets, and text, enabling both research and commercial AI development in a legally safe environment.
Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continues to expand its specialized, industry-focused lineup known as the “AI Data Recipe.”
By reducing the operational burden of data collection and preparation, Qlean Dataset helps organizations establish AI development environments that are both legally compliant and risk-free.

▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup

Key Features of Qlean Dataset

  • Existing datasets deliverable within one business day

  • Custom data collection and recording services available

▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact

About Visual Bank Inc.

Visual Bank Inc. is a Tokyo-based startup building Next-Generation Data infrastructure to enhance AI development capabilities under the mission “Unlocking Data Accessibility.”
The company operates THE PEN, an AI-assisted creative tool for manga artists and the Qlean Dataset service.
Its subsidiaries include Amana Images Inc., one of Japan’s largest photostock providers; Qlean Dataset, which leads research and development in AI data; and THE PEN Inc., an AI-assisted creative tool for manga artists.

CEO: Saneyuki Nagai
Address: 6F, C-Cube Minami Aoyama Building, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview

    amana images inc.

    Visual Bank Inc.


    © amanaimages inc.