Attentive Contextual Network for Image Captioning

J. Prudviraj; C. Vishnu; C Krishna Mohan

doi:10.1109/IJCNN52387.2021.9533970

Profiles Research Units Publications

Conferences

Attentive Contextual Network for Image Captioning

J. Prudviraj, C. Vishnu,

Published in Institute of Electrical and Electronics Engineers Inc.

2021

DOI: 10.1109/IJCNN52387.2021.9533970

Volume: 2021-July

Abstract

Existing image captioning approaches fail to generate fine-grained captions due to the lack of rich encoding representation of an image. In this paper, we present an attentive contextual network (ACN) to learn the spatially transformed image features and dense multi-scale contextual information of an image to generate semantically meaningful captions. At first, we construct deformable network on intermediate layers of convolutional neural network (CNN) to cultivate spatial invariant features. And the multi-scale contextual features are produced by employing contextual network on top of last layers of CNN. Then, we exploit attention mechanism on contextual network to extract dense contextual features. Further, the extracted spatial and contextual features are combined to encode the holistic representation of an image. Finally, a multi-stage caption decoder with visual attention module is incorporated to generate fine-grained captions. The performance of the proposed approach is demonstrated on COCO dataset, the largest dataset for image captioning. © 2021 IEEE.

About the journal

Journal	Data powered by TypesetProceedings of the International Joint Conference on Neural Networks
Publisher	Data powered by TypesetInstitute of Electrical and Electronics Engineers Inc.

Authors (1)

C Krishna Mohan
- Department of Computer Science and Engineering

ACADEMICS

FACILITIES

CAMPUS LIFE

COUNCILS

QUICK LINKS