File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Image tagging by joint deep visual-semantic propagation

TitleImage tagging by joint deep visual-semantic propagation
Authors
KeywordsCNN-LSTM
Image tagging
Visual-semantic
Issue Date2018
PublisherSpringer.
Citation
18th Pacific Rim Conference on Multimedia, Harbin, China, 28-29 September 2017. In Advances in Multimedia Information Processing – PCM 2017:18th Pacific-Rim Conference on Multimedia, Harbin, China, September 28-29, 2017, Revised Selected Papers, Part I, p. 25-35. Cham: Springer, 2018 How to Cite?
AbstractImage tagging has attracted much research interest due to its wide applications. Many existing methods have gained impressive results, however, they have two main limitations: (1) only focus on tagging images, but ignore the tags’ influences on visual feature modeling. (2) model the tag correlation without considering visual contents of image. In this paper, we propose a joint visual-semantic propagation model (JVSP) to address these two issues. First, we leverage a joint visual-semantic modeling to harvest integrated features which can accurately reflect the relationship between tags and image regions. Second, we introduce a visual-guided LSTM to capture the co-occurrence relation of the tags. Third, we also design a diversity loss to enforce that our model learns to focus on different regions. Experimental results on three challenging datasets demonstrate that our proposed method leads to significant performance gains over existing methods.
Persistent Identifierhttp://hdl.handle.net/10722/307235
ISBN
ISSN
2023 SCImago Journal Rankings: 0.606
ISI Accession Number ID
Series/Report no.Lecture Notes in Computer Science ; 10735

 

DC FieldValueLanguage
dc.contributor.authorMa, Yuexin-
dc.contributor.authorZhu, Xinge-
dc.contributor.authorSun, Yujing-
dc.contributor.authorYan, Bingzheng-
dc.date.accessioned2021-11-03T06:22:12Z-
dc.date.available2021-11-03T06:22:12Z-
dc.date.issued2018-
dc.identifier.citation18th Pacific Rim Conference on Multimedia, Harbin, China, 28-29 September 2017. In Advances in Multimedia Information Processing – PCM 2017:18th Pacific-Rim Conference on Multimedia, Harbin, China, September 28-29, 2017, Revised Selected Papers, Part I, p. 25-35. Cham: Springer, 2018-
dc.identifier.isbn9783319773797-
dc.identifier.issn0302-9743-
dc.identifier.urihttp://hdl.handle.net/10722/307235-
dc.description.abstractImage tagging has attracted much research interest due to its wide applications. Many existing methods have gained impressive results, however, they have two main limitations: (1) only focus on tagging images, but ignore the tags’ influences on visual feature modeling. (2) model the tag correlation without considering visual contents of image. In this paper, we propose a joint visual-semantic propagation model (JVSP) to address these two issues. First, we leverage a joint visual-semantic modeling to harvest integrated features which can accurately reflect the relationship between tags and image regions. Second, we introduce a visual-guided LSTM to capture the co-occurrence relation of the tags. Third, we also design a diversity loss to enforce that our model learns to focus on different regions. Experimental results on three challenging datasets demonstrate that our proposed method leads to significant performance gains over existing methods.-
dc.languageeng-
dc.publisherSpringer.-
dc.relation.ispartofAdvances in Multimedia Information Processing – PCM 2017:18th Pacific-Rim Conference on Multimedia, Harbin, China, September 28-29, 2017, Revised Selected Papers, Part I-
dc.relation.ispartofseriesLecture Notes in Computer Science ; 10735-
dc.subjectCNN-LSTM-
dc.subjectImage tagging-
dc.subjectVisual-semantic-
dc.titleImage tagging by joint deep visual-semantic propagation-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1007/978-3-319-77380-3_3-
dc.identifier.scopuseid_2-s2.0-85047464475-
dc.identifier.spage25-
dc.identifier.epage35-
dc.identifier.eissn1611-3349-
dc.identifier.isiWOS:000460422000003-
dc.publisher.placeCham-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats