Dahyun Kang | Minsu Cho | ||
Pohang University of Science and Technology (POSTECH), South Korea |
We introduce the integrative task of few-shot classification and segmentation (FS-CS) that aims to both classify and segment target objects in a query image when the target classes are given with a few examples. This task combines two conventional few-shot learning problems, few-shot classification and segmentation. FS-CS generalizes them to more realistic episodes with arbitrary image pairs, where each target class may or may not be present in the query. To address the task, we propose the integrative few-shot learning (iFSL) framework for FS-CS, which trains a learner to construct class-wise foreground maps for multi-label classification and pixel-wise segmentation. We also develop an effective iFSL model, attentive squeeze network (ASNet), that leverages deep semantic correlation and global self-attention to produce reliable foreground maps. In experiments, the proposed method shows promising performance on the FS-CS task and also achieves the state of the art on standard few-shot segmentation benchmarks.
1. A new task of integrative few-shot classification and segmentation (FS-CS)
which combines few-shot classification and few-shot segmentation into an integrative task by addressing their limitations.
2. A new learning framework of integrative few-shot learning framework (iFSL)
which learns to both classify and segment a query image using class-wise foreground maps.
3. A new neural architecture of attentive squeeze network (ASNet)
which squeezes semantic correlations into a foreground map for iFSL via strided global self-attention.
(b) | (c) |
(a) Multi-way classification and segmentation. | (b) Task transfer between FS-S, FS-C, and FS-CS. |
This work was supported by Samsung Advanced Institute of Technology (SAIT) and also by Center for Applied Research in Artificial Intelligence (CARAI) grant funded by DAPA and ADD (UD190031RD).
Check our GitHub repository: [GitHub]