11/26/2025
Qlean Dataset Releases Authentic Japanese Narrative Monologue Corpus for Speech and Language AI

Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai; hereinafter “Visual Bank”) has launched a new dataset titled the 「Japanese Narrative Monologue Speech Corpus Dataset」 under its AI training data solution Qlean Dataset, operated through its subsidiary Amanaimages Inc.
This corpus contains natural monologue-style speech by Japanese speakers, based on their personal experiences, interests, and life stories. The dataset is provided in high-quality WAV format (48 kHz / 16-bit) and includes natural intonation and prosody, enabling applications in automatic speech recognition (ASR), speaker feature extraction, text-to-speech (TTS), summarization models, natural language processing (NLP), and multimodal AI systems including speech-enabled large language models.
Long-form single-speaker recordings captured in natural environments are suitable for evaluating AI model generalization and real-world speech understanding performance. The dataset can be used across a wide range of fields, including Japanese speech research at universities and research institutions, enterprise dialogue systems and call-center optimization, and speech-understanding AI for education and social support services.
Dataset Specifications
Subject Attributes:Japanese speakers aged 30s–60s, including men, women, and children
File Format:WAV
Duration:Approximately 15 minutes per recording
Content:Speakers talk freely about their own experiences and areas of interest
Audio Specifications:48 kHz / 16-bit
Sample Data:https://qleandataset.visual-bank.co.jp/en/lineup/pn-031
Example Use Cases
【Research Use (Academia)】
ASR Model Improvement
Long-form natural monologues provide data for training and evaluation of ASR models handling sentence endings, punctuation variation, and topic shifts. Suitable for baseline benchmarking in research settings.Speaker Recognition and Acoustic Feature Analysis
The dataset includes natural speech from Japanese speakers across multiple age groups, supporting research in voiceprint feature extraction, clustering, age estimation, and acoustic phonetics.
【Industrial Use (Enterprises)】
Performance Enhancement for Voice Applications
Useful for improving recognition accuracy in applications relying on single-speaker input, such as voice UI, voice search, and smart devices.Improved Speech Input for Generative and Multimodal AI
The long-form contextual narrative speech is well-suited for improving preprocessing accuracy in multimodal AI systems that convert audio to text and meaning (embedding generation). Applicable to speech-based Q&A, dialogue generation, summarization, and speech-enabled LLM performance enhancement.Natural Dialogue Models for Robotics and Agents
Supports evaluation of context-retention models for long-form speech, contributing to natural conversational performance in care robots, reception AI, and home robotics.
【Other Practical Uses (Education & Social Implementation)】
Educational AI and Japanese Language Learning Support
The dataset covers various speaking styles, vocabulary choices, and narrative structures, enabling use in Japanese-language education, speech instruction, and pronunciation-training AI tools.
About Qlean Dataset
Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports diverse data types including images, videos, audio, 3D, and text—enabling both research and commercial AI development in a legally safe environment.
Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continuously expands its specialized, industry-relevant lineup known as the “AI Data Recipe.”
By reducing the operational burden of data collection and preparation, Qlean Dataset helps build legally compliant and risk-free AI development environments.
▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en/
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup




Key Features of Qlean Dataset
Full consent obtained from all subjects; compliant with GDPR and CCPA
Existing datasets deliverable within one business day
Custom data collection and recording available
▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact
About Visual Bank Inc.
Visual Bank Inc. is a Tokyo-based startup building next-generation data infrastructure to maximize AI development capabilities under the mission, “Unlock the potential of all data.”
The company operates THE PEN, an AI-assisted creative tool for manga artists, and wholly owns Amana Images Inc., which provides the Qlean Dataset service.
CEO: Saneyuki Nagai
Address: C-Cube Minami Aoyama Building 6F, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en/
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview





