2/17/2026

Qlean Dataset Launches Construction Site Worker & Heavy Equipment Image Dataset

Visual Bank Inc. (Minato-ku, Tokyo; CEO: Saneyuki Nagai), through its subsidiary amanaimages Inc., has released the Construction Site Worker & Heavy Equipment Image Dataset under its AI training data solution, Qlean Dataset. 

The dataset supports pre-training and fine-tuning of generative and multimodal foundation models, as well as AI development in areas such as object detection, human detection, image classification, and semantic segmentation.

The dataset contains images captured at real construction sites, featuring heavy machinery—including excavators, bulldozers, and dump trucks—and workers wearing helmets, work uniforms, and reflective safety vests. Machinery is photographed with most of the body visible within the frame, enabling recognition of equipment-specific visual characteristics. Worker images reflect typical safety gear and site conditions, supporting person detection and recognition tasks in construction environments.

Scenes include machinery, materials, scaffolding, and other structural elements, allowing model evaluation under complex background conditions. Each image is accompanied by structured metadata to support flexible data selection and annotation design.

All data is commercially cleared for enterprise use, supporting foundation model development and applied AI implementation in the construction and infrastructure sectors.

Overview of the Construction Site Worker & Heavy Equipment Image

Data Type

image

Subject Attributes

① Construction machinery (power shovels, bulldozers, dump trucks, heavy machinery, etc.)
② On-site workers (wearing helmets, work clothes, safety reflective vests, hand towels, etc.)

Data Size

1.55GB

Number of Files

496

Data Format

jpeg/png

Inclusion Requirements

1. Construction Machinery
- Power shovels, bulldozers, dump trucks, heavy machinery, etc.
- At least two-thirds of the equipment must be included.
- People: No requirement.
2. Site Workers
- Wearing helmets, work clothes, reflective safety vests, hand towels, etc.
- At least two-thirds of the body must be included (face or entire body not required).

Other

Metadata available.

Sample Details

https://qleandataset.visual-bank.co.jp/en/lineup/sp-004

Use Case Examples

[Research Applications]

  • Additional training for construction-specific multimodal foundation models
    The dataset can be used as supplementary training data to teach foundation models visual concepts unique to construction sites, such as heavy machinery and safety equipment. It supports domain adaptation and transfer learning experiments for specialized environments.

[Industrial Applications]

  • Development of AI systems for construction site safety management
    The dataset can be used to train models that detect workers and heavy machinery from site footage. It supports development of functions such as proximity monitoring between equipment and personnel and object detection within designated areas.

  • Heavy equipment type classification models
    The dataset can be used to train classification models for machinery types such as excavators and bulldozers, enabling automatic identification of equipment categories from construction site images.

[Additional Practical Applications]

  • AI implementation training data for construction DX talent development
    The dataset can be used in AI training programs for the construction industry, supporting hands-on exercises in object detection and image classification using real construction site imagery.

About Qlean Dataset

Qlean Dataset is a commercial-use-ready AI training data solution provided by Amana Images Inc., a subsidiary of Visual Bank Inc.
It supports a wide range of data types, including images, videos, audio, 3D assets, and text, enabling both research and commercial AI development in a legally safe environment.
Through collaborations with data partners such as Chiba Lotte Marines Co., Ltd. and Toyo Keizai Inc., Qlean Dataset continues to expand its specialized, industry-focused lineup known as the “AI Data Recipe.”
By reducing the operational burden of data collection and preparation, Qlean Dataset helps organizations establish AI development environments that are both legally compliant and risk-free.

▶ Qlean Dataset: https://qleandataset.visual-bank.co.jp/en
▶ AI Data Recipe: https://qleandataset.visual-bank.co.jp/en/lineup

Key Features of Qlean Dataset

  • Existing datasets deliverable within one business day

  • Custom data collection and recording services available

▶ Contact: https://qleandataset.visual-bank.co.jp/en/contact

About Visual Bank Inc.

Visual Bank Inc. is a Tokyo-based startup building Next-Generation Data infrastructure to enhance AI development capabilities under the mission “Unlocking Data Accessibility.”
The company operates THE PEN, an AI-assisted creative tool for manga artists and the Qlean Dataset service.
Its subsidiaries include Amana Images Inc., one of Japan’s largest photostock providers; Qlean Dataset, which leads research and development in AI data; and THE PEN Inc., an AI-assisted creative tool for manga artists.

CEO: Saneyuki Nagai
Address: 6F, C-Cube Minami Aoyama Building, 7-1-7 Minami-Aoyama, Minato-ku, Tokyo 107-0062
Corporate Site: https://visual-bank.co.jp/en
Amana Images: https://qleandataset.visual-bank.co.jp/en/company-overview

    amana images inc.

    Visual Bank Inc.


    © amanaimages inc.