Qlean Dataset Launches Japanese Two-Speaker Comedy Dialogue Speech Corpus │ Qlean Dataset

12/24/2025

Qlean Dataset Launches Japanese Two-Speaker Comedy Dialogue Speech Corpus

Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai; hereinafter “Visual Bank”) has launched the Japanese Two-Speaker Comedy-Themed Dialogue Speech Corpus Dataset as part of its AI training data solution, Qlean Dataset, which is operated through its subsidiary, Amana Images Inc.

This dataset is offered as one of the lineups within AI Data Recipe, Qlean Dataset’s machine learning dataset collection. It contains natural Japanese dialogue speech recorded by two speakers—male and female individuals in their 20s to 50s.

The recorded audio consists of casual, comedy-style conversations characterized by humor, laughter, and lively exchanges. Because the dialogues progress without scripts, the dataset captures spontaneous reactions, variations in conversational tempo, topic digressions, and natural comedic elements commonly found in real-world Japanese conversations.

The conversations include natural speaker turn-taking as well as overlapping speech between the two speakers. These characteristics make the dataset suitable for training and evaluation tasks such as turn-taking analysis, speaker identification, and dialogue structure modeling.

All recordings were conducted in relaxed communication settings that resemble real-world usage scenarios. As a result, the dataset can be used under conditions close to actual deployment environments for research and development of conversational AI systems, speech assistants, and dialogue-based applications that rely on automatic speech recognition (ASR) and natural language processing (NLP) technologies.

Overview of the “Japanese Two-Speaker Fashion & Beauty Dialogue Speech Corpus”

Overview	A Japanese dialogue speech dataset featuring two speakers engaging in light, humorous conversations with a natural conversational flow.
Data Type	Audio
Speaker Attributes	Male and female speakers in their 20s to 50s
File Format	MP3 / WAV
Total Duration	Approximately 330 hours (Each recording ranges from approximately 5 to 60 minutes)
Sampling Rate	44.1 kHz
Recorded Scene Characteristics	Casual exchanges between two speakers with humor and laughterConversations reflecting spontaneous reactions and changes in tempoFreely progressing dialogues without scripted structuresNaturally occurring comedic elements such as playful remarks and topic digressionsDialogue scenes centered on relaxed and informal communication
Sample Details	https://qleandataset.visual-bank.co.jp/lineup/pn-020

Use Case Examples for the Japanese Two-Speaker Comedy Dialogue Speech Corpus

For Research Applications

Dialogue Structure Analysis
This dataset can be used to evaluate dialogue structure analysis methods, including turn-taking detection, speaker alternation, and segmentation of conversational units between two speakers.
Natural Language Processing Research on Casual Dialogue
By using unscripted casual conversations, researchers can study topic development and response generation behaviors in non-task-oriented dialogue systems.

For Industrial Applications

Response Generation and Understanding Models for Conversational AI
The dataset can be used to train and evaluate response generation and dialogue understanding models for voice assistants and conversational services that require natural conversational flow.
Speaker Identification and Turn-Taking Technologies
Two-speaker conversational audio enables validation of speaker change detection, utterance boundary estimation, and other dialogue control-related technologies.

For Educational and Practical Use

Educational Data for Speech Processing and Dialogue AI
The dataset can serve as training material for speech recognition and conversational AI courses at universities and professional education institutions, allowing learners to work with real dialogue-specific processing challenges.

About Qlean Dataset

Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports a wide range of data types, including images, videos, audio, 3D assets, and text, enabling both research and commercial AI development in a legally safe environment.
Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continues to expand its specialized, industry-focused lineup known as the “AI Data Recipe.”
By reducing the operational burden of data collection and preparation, Qlean Dataset helps organizations establish AI development environments that are both legally compliant and risk-free.

▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup

Key Features of Qlean Dataset

Existing datasets deliverable within one business day
Custom data collection and recording services available

▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact

About Visual Bank Inc.

Visual Bank Inc. is a Tokyo-based startup building Next-Generation Data infrastructure to enhance AI development capabilities under the mission “Unlocking Data Accessibility.”
The company operates THE PEN, an AI-assisted creative tool for manga artists and the Qlean Dataset service.
Its subsidiaries include Amana Images Inc., one of Japan’s largest photostock providers; Qlean Dataset, which leads research and development in AI data; and THE PEN Inc., an AI-assisted creative tool for manga artists.

CEO: Saneyuki Nagai
Address: 6F, C-Cube Minami Aoyama Building, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview

Back to News