STAIR: Software Technology and Artificial Intelligence Research Laboratory

STAIR Captions : A Large-Scale Japanese Image Caption Dataset STAIR Captions : A Large-Scale Japanese Image Caption Dataset

2017.04.05

Akikazu Takeuchi, Yuya Yoshikawa, Yutaro Shigeto

In recent years, automatic generation of image descriptions (captions), that is, image captioning, has attracted a great deal of attention. Most studies on image captioning target English language, and there are few image caption datasets in Japanese. To tackle this problem, we construct a large-scale Japanese image caption dataset based on images from MS-COCO. Our dataset consists of 820,310 Japanese captions for 164,062 images.

For more information, see http://captions.stair.center

Members

竹内彰一 Akikazu Takeuchi
吉川友也 Yuya Yoshikawa
重藤優太郎 Yutaro Shigeto

STAIR Captions : A Large-Scale Japanese Image Caption Dataset STAIR Captions : A Large-Scale Japanese Image Caption Dataset

Members

関連記事Related Articles

STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset (ACL2017 Short) Constructing Large-Scale Japanese Image Caption Dataset (ACL2017 Short)

STAIR Captions: 大規模日本語画像キャプションデータセット (NLP2017) STAIR Captions: a Large-Scale Japanese Image Caption Dataset (NLP2017)

日本語画像キャプションデータセット「STAIR Captions」公開 STAIR Captions: a new large-scale Japanese image caption dataset