12/24/2025

Qlean Dataset Launches Japanese Two-Speaker Comedy Dialogue Speech Corpus

Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai; hereinafter “Visual Bank”) has launched the Japanese Two-Speaker Comedy-Themed Dialogue Speech Corpus Dataset as part of its AI training data solution, Qlean Dataset, which is operated through its subsidiary, Amana Images Inc.

 This dataset is offered as one of the lineups within AI Data Recipe, Qlean Dataset’s machine learning dataset collection. It contains natural Japanese dialogue speech recorded by two speakers—male and female individuals in their 20s to 50s.

 The recorded audio consists of casual, comedy-style conversations characterized by humor, laughter, and lively exchanges. Because the dialogues progress without scripts, the dataset captures spontaneous reactions, variations in conversational tempo, topic digressions, and natural comedic elements commonly found in real-world Japanese conversations.

 The conversations include natural speaker turn-taking as well as overlapping speech between the two speakers. These characteristics make the dataset suitable for training and evaluation tasks such as turn-taking analysis, speaker identification, and dialogue structure modeling.

 All recordings were conducted in relaxed communication settings that resemble real-world usage scenarios. As a result, the dataset can be used under conditions close to actual deployment environments for research and development of conversational AI systems, speech assistants, and dialogue-based applications that rely on automatic speech recognition (ASR) and natural language processing (NLP) technologies.

Overview of the “Japanese Two-Speaker Fashion & Beauty Dialogue Speech Corpus”

 

Overview

A Japanese dialogue speech dataset featuring two speakers engaging in light, humorous conversations with a natural conversational flow.

Data Type

Audio

Speaker Attributes

Male and female speakers in their 20s to 50s

File Format

MP3 / WAV

Total Duration

Approximately 330 hours
(Each recording ranges from approximately 5 to 60 minutes)

Sampling Rate

44.1 kHz

Recorded Scene Characteristics

Casual exchanges between two speakers with humor and laughterConversations reflecting spontaneous reactions and changes in tempoFreely progressing dialogues without scripted structuresNaturally occurring comedic elements such as playful remarks and topic digressionsDialogue scenes centered on relaxed and informal communication

Sample Details

https://qleandataset.visual-bank.co.jp/lineup/pn-020

Use Case Examples for the Japanese Two-Speaker Comedy Dialogue Speech Corpus 

For Research Applications

  • Dialogue Structure Analysis
    This dataset can be used to evaluate dialogue structure analysis methods, including turn-taking detection, speaker alternation, and segmentation of conversational units between two speakers.

  • Natural Language Processing Research on Casual Dialogue
    By using unscripted casual conversations, researchers can study topic development and response generation behaviors in non-task-oriented dialogue systems.

For Industrial Applications

  • Response Generation and Understanding Models for Conversational AI
    The dataset can be used to train and evaluate response generation and dialogue understanding models for voice assistants and conversational services that require natural conversational flow.

  • Speaker Identification and Turn-Taking Technologies
    Two-speaker conversational audio enables validation of speaker change detection, utterance boundary estimation, and other dialogue control-related technologies.

For Educational and Practical Use

  • Educational Data for Speech Processing and Dialogue AI
    The dataset can serve as training material for speech recognition and conversational AI courses at universities and professional education institutions, allowing learners to work with real dialogue-specific processing challenges.

About Qlean Dataset

Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports a wide range of data types, including images, videos, audio, 3D assets, and text, enabling both research and commercial AI development in a legally safe environment.
Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continues to expand its specialized, industry-focused lineup known as the “AI Data Recipe.”
By reducing the operational burden of data collection and preparation, Qlean Dataset helps organizations establish AI development environments that are both legally compliant and risk-free.

▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup

Key Features of Qlean Dataset

  • Existing datasets deliverable within one business day

  • Custom data collection and recording services available

▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact

About Visual Bank Inc.

Visual Bank Inc. is a Tokyo-based startup building Next-Generation Data infrastructure to enhance AI development capabilities under the mission “Unlocking Data Accessibility.”
The company operates THE PEN, an AI-assisted creative tool for manga artists and the Qlean Dataset service.
Its subsidiaries include Amana Images Inc., one of Japan’s largest photostock providers; Qlean Dataset, which leads research and development in AI data; and THE PEN Inc., an AI-assisted creative tool for manga artists.

CEO: Saneyuki Nagai
Address: 6F, C-Cube Minami Aoyama Building, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview

    amana images inc.

    Visual Bank Inc.


    © amanaimages inc.