12/4/2025

Qlean Dataset Releases a Japanese Business Single-Speaker Narrative Monologue Speech Corpus for ASR and Language AI 

Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai; “Visual Bank”) has launched the “Japanese Business Single-Speaker Narrative Monologue Speech Corpus” as part of its AI training data solution Qlean Dataset, operated through its subsidiary Amanaimages Inc.
This corpus contains long-form Japanese speech recorded by male and female speakers in their 20s to 40s. It can be used for research and implementation in automatic speech recognition (ASR), natural language processing (NLP), dialogue models, and foundational generative AI systems.

This corpus consists of approximately 473 hours of single-speaker audio recorded in mp3 format (44.1 kHz).
It features extended monologues in which speakers continuously explain topics related to business, management, and work styles. The recordings are characterized by long-form narrative speech, context-dependent transitions, and natural prosody.
Because the speech is unscripted, the dataset can be used across ASR evaluation, semantic understanding, dialogue generation, and speech-input-based generative AI models as cross-modal training and validation data.


The dataset includes long-duration single-speaker speech resembling natural speaking conditions and covering diverse topic flows. It is suitable for validating the generalization performance of speech models, developing enterprise AI applications, educational support AI, and voice-UI systems for organizations.
All speech is fully rights-cleared, making it safe for both research use and commercial AI development.

Overview of the “Japanese Business Single-Speaker Narrative Monologue Speech Corpus”

Data Type

Audio

Speaker Attributes

Male and female speakers in their 20s–40s

Format

mp3

Total Duration

Approx. 473 hours (per-file length: 5–40 minutes)

Sampling Rate

44.1 kHz

Scene Description

・Continuous explanations and commentary by a single speaker on business-related themes
・Long-form monologues and narrative-style natural speech
— Includes everyday topic development, structured opinions, and anecdotal descriptions
・Unscripted speech reflecting natural rhythm and pacing
— Includes context-dependent narration, topic shifts, and natural emotional intonation

Details

https://qleandataset.visual-bank.co.jp/en/lineup/pn-007


Use Case Examples

– Research (Academia)

  • ASR Research
    Suitable for evaluating ASR models due to long-form continuous speech with vocabulary diversity and context-dependent expressions.

  • Speech-Language Understanding / NLP Research
    Useful for tasks involving context retention, topic shifts, and semantic analysis, including summarization, topic classification, and intent recognition.

  • Generative AI / Dialogue Model Input Evaluation
    Applicable for pipeline evaluation from speech-to-text-to-response generation, as well as multimodal generative AI benchmarking.

– Industry

  • Meeting-minute generation and speech summarization AI
    Rich business-context narrative speech enables evaluation of summarization, intent extraction, and information-structuring AI.

  • Voice-UI / Enterprise AI Assistants
    Long-form explanatory speech supports improved understanding accuracy for internal dialogue systems and automated FAQ response models.

  • Multimodal AI – Audio Understanding
    Natural speech characteristics allow use in training and validating models integrating speech and text reasoning.

– Education and Social Implementation

  • Includes natural narration-style speech, making it suitable for evaluating educational audio-content generation AI.

About Qlean Dataset

Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports diverse data types including images, videos, audio, 3D, and text—enabling both research and commercial AI development in a legally safe environment.

Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continuously expands its specialized, industry-relevant lineup known as the “AI Data Recipe.”

By reducing the operational burden of data collection and preparation, Qlean Dataset helps build legally compliant and risk-free AI development environments.

▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en/
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup

Key Features of Qlean Dataset

  • Full consent obtained from all subjects; compliant with GDPR and CCPA

  • Existing datasets deliverable within one business day

  • Custom data collection and recording available

▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact

About Visual Bank Inc.

Visual Bank Inc. is a Tokyo-based startup building next-generation data infrastructure to maximize AI development capabilities under the mission, “Unlock the potential of all data.”
The company operates THE PEN, an AI-assisted creative tool for manga artists, and wholly owns Amana Images Inc., which provides the Qlean Dataset service.

CEO: Saneyuki Nagai
Address: C-Cube Minami Aoyama Building 6F, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en/
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview

    amana images inc.

    Visual Bank Inc.


    © amanaimages inc.