Multimodal salient object detection via adversarial learning with collaborative generator

Research Output

Multimodal salient object detection(MSOD), which utilizes multimodal information (e.g., RGB image and thermal infrared or depth image) to detect common salient objects, has received much attention recently. Different modalities reflect different appearance properties of salient objects, some of which could contribute to improving the precision and/or recall of MSOD. To greatly improve both Precision and Recall by fully exploring multimodal data, in this work, we propose an effective adversarial learning framework based on a novel collaborative generator for accurate multimodal salient object detection. In particular, the collaborative generator consists of three generators (generator1, generator2 and generator3), which aim at decreasing the false positive and false negative of the generated saliency maps and improving F-measure of the final saliency maps respectively. Generator1 and generator2 contain two encoder–decoder networks for multimodal inputs, and we propose a new co-attention model to perform adaptive interactions between different modalities. Furthermore, we apply generator3 to integrate feature maps from generator1 and generator2 in a complementary way. Through adversarially learning the collaborative generator and discriminator, both Precision and Recall of the predicted maps are boosted with the complementary benefits of multimodal data. Extensive experiments on three RGBT datasets and six RGBD datasets show that our method performs quite well against state-of-the-art MSOD methods.

Type:

Article
Date:

19 December 2022
Publication Status:

Published
Publisher

Elsevier BV
DOI:

10.1016/j.engappai.2022.105707
ISSN:

0952-1976
Funders:

EPSRC Engineering and Physical Sciences Research Council; Engineering and Physical Sciences Research Council; New Funder

http://researchrepository.napier.ac.uk/output/3020268 Tu, Z., Yang, W., Wang, K., Hussain, A., Luo, B., & Li, C. (2023). Multimodal salient object detection via adversarial learning with collaborative generator. Engineering Applications of Artificial Intelligence, 119, Article 105707. https://doi.org/10.1016/j.engappai.2022.105707

Citation

Tu, Z., Yang, W., Wang, K., Hussain, A., Luo, B., & Li, C. (2023). Multimodal salient object detection via adversarial learning with collaborative generator. Engineering Applications of Artificial Intelligence, 119, Article 105707. https://doi.org/10.1016/j.engappai.2022.105707

Authors

Prof Amir Hussain

Professor
School of Computing Engineering and the Built Environment

0131 455 2239

A.Hussain@napier.ac.uk

Keywords

Multimodal salient object detection, Collaborative generator, Adversarial learning

Monthly Views:

Linked Projects

Available Documents

Files currently unavailable for download , please contact repository@napier.ac.uk to request a copy
Downloadable citations
HTML BIB RTF

Type:

Date:

Publication Status:

Publisher

DOI:

ISSN:

Funders: