4/3/2026
New Multimodal Dataset: High-Stakes Self-PR Scenarios in Japanese

Visual Bank Inc. (Minato-ku, Tokyo; Saneyuki Nagai, CEO) is pleased to announce the release of the "Japanese Job Candidate Self-PR Video Dataset" through its AI training data solution, Qlean Dataset. Operated by its subsidiary, amanaimages Inc., this dataset is optimized for extracting dynamic human features and training advanced multimodal analysis models within the recruitment context.
This dataset faithfully reproduces modern hiring scenarios, such as online interviews and video screenings, which have become industry standards. The collection features young Japanese adults—simulating new graduates—speaking about their strengths and personal experiences directly to the camera. Each video uses a front-facing, waist-up (bust-up) angle typical of online meetings, providing visual and audio data that closely mirrors real-world selection environments.
The content includes two distinct formats: "free-talk," which naturally captures the speaker's emotions and intonation, and "scripted reading," where the speech content is fixed. This dual approach is designed not only to improve Automatic Speech Recognition (ASR) accuracy but also to facilitate the analysis of qualitative communication elements, such as changes in gaze, facial expressions, and speech fluency. Furthermore, as Qlean Dataset offers custom data collection through specific model assignments, the dataset can be expanded to meet deep R&D requirements, including diversifying speaker attributes or securing longer-form speech data for linguistic modeling.
This release is part of the "AI Data Recipe," Qlean Dataset's lineup of original data solutions for AI development. It is designed to power the next generation of social implementation, ranging from automated recruitment screening to the development of educational and training products for interview preparation. Visual Bank and amanaimages remain committed to supporting the research and development of AI that can accurately understand and analyze human behavior by providing structured data that captures the diverse realities of Japanese life and society.
Dataset Overview
Data Type: | Video |
|---|---|
Subject Attributes: | Japanese (Young adults/prospective graduates), including gender metadata |
Data Volume: | 5,764.40 MB |
Quantity: | 72 clips |
Format: | mp4 |
Duration: | Approx. 1 minute per clip |
Recording Environment: | Angle: Frontal/Bust-up (simulating online interviews) |
Metadata: | List format providing gender and "Scripted/Non-scripted" flags |
Sample Page: |

Use Case Examples
【Research & Academia】
Development of Non-Verbal Communication Analysis Models: Can be used for multimodal research to analyze how psychological states—such as tension or confidence—affect facial expressions, eye movement, and speech pitch in high-stakes evaluative settings like job interviews.
【Industrial Applications】
AI-Driven Video Interview Screening in HR Tech: Serves as training data for algorithms that transcribe candidate speech (ASR) or quantify features such as facial brightness and gaze stability to assist in candidate evaluation.
Development of TTS and Voice Conversion Models for Specific Scenarios: Utilizes the unique high-pressure environment of "Self-PR" to train generative AI in reproducing specific emotions and stress levels, or as base data for customizing vocal tones in conversational agents.
Validation of Virtual Backgrounds and Lighting Correction for Web Conferencing: Ideal for evaluating the accuracy of human segmentation and natural skin tone correction algorithms specifically for the bust-up framing used in professional online interactions.
About Qlean Dataset
Qlean Dataset is a commercially cleared AI training data solution provided by Amana Images, a subsidiary of Visual Bank Group. The platform offers diverse data formats including image, video, audio, 3D, and text, as well as a specialized AI Data Recipe lineup developed through collaborations with major media organizations and data rights holders.
URL:https://qleandataset.visual-bank.co.jp/en




About Visual Bank Inc.
Visual Bank Group is a technology company developing data infrastructure and AI solutions that support advanced AI development. The company operates THE PEN, an AI tool for manga creators, and its subsidiary, amanaimages Inc., provides commercial digital content and AI training data solutions, including Qlean Dataset. Visual Bank is also a selected participant in GENIAC, a Japanese government initiative supporting the advancement of next generation AI technologies.
CEO: Saneyuki Nagai
Website:https://visual-bank.co.jp/en





