11/4/2025

Qlean Dataset Releases Japanese Two-Person Business Dialogue Corpus for Conversational AI and Speech Recognition

Visual Bank Inc. (Tokyo, Japan; CEO Saneyuki Nagai) has announced the release of the “Japanese Two-Person Business Dialogue Speech Corpus and Text Dataset” through its AI training data solution, Qlean Dataset, developed under its subsidiary Amana Images Inc. The dataset contains hundreds of hours of natural Japanese two-person dialogues with transcriptions, speaker labels, and timestamps for AI applications such as ASR, conversation understanding, and text summarization.
It can be used for both research and commercial purposes to advance Japanese speech corpus accuracy, CX analysis AI, and conversational LLM training.

▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup
▶ About Qlean Dataset: https://qleandataset.visual-bank.co.jp/en

About the “AI Data Recipe” of Qlean Dataset

The "AI Data Recipe" within Qlean Dataset represent its commercially available lineup of original datasets.
They are designed for flexible combination based on usage, accuracy, and delivery requirements, and include both annotated and non-annotated data. Each dataset can be customized or expanded to meet specific needs.
Through partnerships with organizations such as Chiba Lotte Marines and Toyo Keizai Inc., as well as domestic and international networks and new recording projects, Qlean Dataset continues to expand its lineup.
This approach significantly reduces the workload required for data collection and preparation in AI development and accelerates project execution.

Overview of the Newly Released Dataset

  • Data Types: Audio, Text

  • Subjects: Japanese male and female speakers

  • Formats: Audio (wav), Text (txt)

  • Recording Length: Hundreds of hours

  • Scenes: Business meetings, SaaS inquiries, outbound calls, and other professional interactions

  • Transcription Structure: Line number, start time, end time, speaker label, utterance content

  • Sample Details:https://qleandataset.visual-bank.co.jp/lineup/pn-013

Use Case Examples of the Dataset

  1. Enhancing ASR and Speaker Diarization Models
    Collected across multiple environments (including online and face-to-face dialogues), the data supports robust speech recognition and speaker separation research, including overlapping speech handling and noise resilience.

  2. Training for Conversation Understanding and Summarization AI
    Precise transcriptions with timestamps and speaker segmentation enable development of models for long-form conversation summarization and next-utterance prediction.

  3. CX and Emotional Speech Recognition AI
    Incorporating emotional nuances such as tone and pauses, the dataset is ideal for customer satisfaction analysis and automated call quality evaluation AI.

  4. Sales Intelligence and Negotiation Analysis
    Covers practical dialogues from sales meetings and interviews, supporting AI that quantifies listening skills and conversation patterns.

  5. Contact Center Automation and FAQ Generation AI
    Includes authentic customer support interactions, useful for training voice bots and FAQ generation models.

  6. Speech UX and Conversational Interface Design
    Natural tempo and response patterns make it ideal for training AI assistants and smart-speaker voice UX systems.

  7. Emotional Change Detection and Experience Quality Evaluation
    Supports AI research on emotion detection and experience quality assessment through pitch and pause analysis.

  8. Japanese LLM and Multimodal Generative AI Training
    The paired speech-text structure enables multimodal LLM development and Japanese dialogue generation research.

Features of Qlean Dataset

All datasets are rights-cleared and commercially usable, collected with full participant consent and international privacy compliance.
Delivered via flexible “Data Recipes” for rapid deployment and customizable dataset creation.

Contact form: https://qleandataset.visual-bank.co.jp/en/contact
Service site: https://qleandataset.visual-bank.co.jp/en

About Visual Bank Inc.

Visual Bank Inc. is a next-generation data infrastructure company committed to “unleashing the potential of all data.”
The company operates THE PEN, an AI-powered assistance tool for manga artists, and wholly owns Amana Images Inc., which provides the AI training data service Qlean Dataset.
Visual Bank has been recognized in national R&D programs and continues to advance initiatives toward real-world AI implementation.

CEO: Saneyuki Nagai
Address: C-Cube Minami Aoyama Bldg. 6F, 7-1-7 Minami Aoyama, Minato-ku, Tokyo 107-0062
Corporate website: https://visual-bank.co.jp/en/
Amana Images overview: https://qleandataset.visual-bank.co.jp/en/company-overview

    amana images inc.

    Visual Bank Inc.


    © amanaimages inc.