File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Video classification via relational feature encoding networks

TitleVideo classification via relational feature encoding networks
Authors
KeywordsVideo classification
Relational Feature Encoding
Temporal aggregation
Temporal segment networks
Issue Date2017
Citation
LSVC 2017 - Proceedings of the Workshop on Large-Scale Video Classification Challenge, co-located with MM 2017, 2017, p. 9-13 How to Cite?
Abstract© 2017 Association for Computing Machinery. In this paper, we propose a novel Relational Feature Encoding Network for video classification. The proposed network uses a set of relational functions wired on top of a backbone convolutional neural network (ConvNet) to generate multiple complementary feature streams on the fy, which are then combined by an aggregation module to form a video-level representation for recognition. The relational functions compute new relational features by applying element-wise operations or a simple projection to pairs of raw ConvNet features, and thus encode the underlying temporal dynamics and relationship of contextual frames which are critical for recognizing video contents. In this work, we explore a number of design choices for both the relational functions and the aggregation functions, and evaluate the resulting deep model on a number of video classification benchmarks, including the extended Fudan-Columbia Video dataset, UCF101, and Kinetics. Experimental results demonstrate that our model is not only well-suited for action recognition, but also exhibits promising performance for general videos.
Persistent Identifierhttp://hdl.handle.net/10722/273731

 

DC FieldValueLanguage
dc.contributor.authorZhou, Yao-
dc.contributor.authorFeng, Litong-
dc.contributor.authorRen, Jiamin-
dc.contributor.authorQiu, Shi-
dc.contributor.authorLi, Jingyu-
dc.contributor.authorLuo, Ping-
dc.date.accessioned2019-08-12T09:56:30Z-
dc.date.available2019-08-12T09:56:30Z-
dc.date.issued2017-
dc.identifier.citationLSVC 2017 - Proceedings of the Workshop on Large-Scale Video Classification Challenge, co-located with MM 2017, 2017, p. 9-13-
dc.identifier.urihttp://hdl.handle.net/10722/273731-
dc.description.abstract© 2017 Association for Computing Machinery. In this paper, we propose a novel Relational Feature Encoding Network for video classification. The proposed network uses a set of relational functions wired on top of a backbone convolutional neural network (ConvNet) to generate multiple complementary feature streams on the fy, which are then combined by an aggregation module to form a video-level representation for recognition. The relational functions compute new relational features by applying element-wise operations or a simple projection to pairs of raw ConvNet features, and thus encode the underlying temporal dynamics and relationship of contextual frames which are critical for recognizing video contents. In this work, we explore a number of design choices for both the relational functions and the aggregation functions, and evaluate the resulting deep model on a number of video classification benchmarks, including the extended Fudan-Columbia Video dataset, UCF101, and Kinetics. Experimental results demonstrate that our model is not only well-suited for action recognition, but also exhibits promising performance for general videos.-
dc.languageeng-
dc.relation.ispartofLSVC 2017 - Proceedings of the Workshop on Large-Scale Video Classification Challenge, co-located with MM 2017-
dc.subjectVideo classification-
dc.subjectRelational Feature Encoding-
dc.subjectTemporal aggregation-
dc.subjectTemporal segment networks-
dc.titleVideo classification via relational feature encoding networks-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1145/3134263.3134265-
dc.identifier.scopuseid_2-s2.0-85035790237-
dc.identifier.spage9-
dc.identifier.epage13-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats