11/26/2025

Qlean Dataset Releases Authentic Japanese Narrative Monologue Corpus for Speech and Language AI

Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai; hereinafter “Visual Bank”) has launched a new dataset titled the 「Japanese Narrative Monologue Speech Corpus Dataset」 under its AI training data solution Qlean Dataset, operated through its subsidiary Amanaimages Inc.

This corpus contains natural monologue-style speech by Japanese speakers, based on their personal experiences, interests, and life stories. The dataset is provided in high-quality WAV format (48 kHz / 16-bit) and includes natural intonation and prosody, enabling applications in automatic speech recognition (ASR), speaker feature extraction, text-to-speech (TTS), summarization models, natural language processing (NLP), and multimodal AI systems including speech-enabled large language models.
Long-form single-speaker recordings captured in natural environments are suitable for evaluating AI model generalization and real-world speech understanding performance. The dataset can be used across a wide range of fields, including Japanese speech research at universities and research institutions, enterprise dialogue systems and call-center optimization, and speech-understanding AI for education and social support services.

Dataset Specifications

  • Subject Attributes:Japanese speakers aged 30s–60s, including men, women, and children

  • File Format:WAV

  • Duration:Approximately 15 minutes per recording

  • Content:Speakers talk freely about their own experiences and areas of interest

  • Audio Specifications:48 kHz / 16-bit

  • Sample Data:https://qleandataset.visual-bank.co.jp/en/lineup/pn-031

Example Use Cases

【Research Use (Academia)】

  • ASR Model Improvement
    Long-form natural monologues provide data for training and evaluation of ASR models handling sentence endings, punctuation variation, and topic shifts. Suitable for baseline benchmarking in research settings.

  • Speaker Recognition and Acoustic Feature Analysis
    The dataset includes natural speech from Japanese speakers across multiple age groups, supporting research in voiceprint feature extraction, clustering, age estimation, and acoustic phonetics.

【Industrial Use (Enterprises)】

  • Performance Enhancement for Voice Applications
    Useful for improving recognition accuracy in applications relying on single-speaker input, such as voice UI, voice search, and smart devices.

  • Improved Speech Input for Generative and Multimodal AI
    The long-form contextual narrative speech is well-suited for improving preprocessing accuracy in multimodal AI systems that convert audio to text and meaning (embedding generation). Applicable to speech-based Q&A, dialogue generation, summarization, and speech-enabled LLM performance enhancement.

  • Natural Dialogue Models for Robotics and Agents
    Supports evaluation of context-retention models for long-form speech, contributing to natural conversational performance in care robots, reception AI, and home robotics.

【Other Practical Uses (Education & Social Implementation)】

  • Educational AI and Japanese Language Learning Support
    The dataset covers various speaking styles, vocabulary choices, and narrative structures, enabling use in Japanese-language education, speech instruction, and pronunciation-training AI tools.

About Qlean Dataset

Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports diverse data types including images, videos, audio, 3D, and text—enabling both research and commercial AI development in a legally safe environment.

Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continuously expands its specialized, industry-relevant lineup known as the “AI Data Recipe.”

By reducing the operational burden of data collection and preparation, Qlean Dataset helps build legally compliant and risk-free AI development environments.

▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en/
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup

Key Features of Qlean Dataset

  • Full consent obtained from all subjects; compliant with GDPR and CCPA

  • Existing datasets deliverable within one business day

  • Custom data collection and recording available

▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact

About Visual Bank Inc.

Visual Bank Inc. is a Tokyo-based startup building next-generation data infrastructure to maximize AI development capabilities under the mission, “Unlock the potential of all data.”
The company operates THE PEN, an AI-assisted creative tool for manga artists, and wholly owns Amana Images Inc., which provides the Qlean Dataset service.

CEO: Saneyuki Nagai
Address: C-Cube Minami Aoyama Building 6F, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en/
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview

    amana images inc.

    Visual Bank Inc.


    © amanaimages inc.