File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Patch-Set-Based Representation for Alignment-Free Image Set Classification

TitlePatch-Set-Based Representation for Alignment-Free Image Set Classification
Authors
KeywordsAlignment free
image set classification
patch-set-based representation
Video-Based face recognition
Issue Date2016
Citation
IEEE Transactions on Circuits and Systems for Video Technology, 2016, v. 26, n. 9, p. 1646-1658 How to Cite?
AbstractThis paper presents a patch-set-based sparse representation for image set classification. Compared with image-based image set representation, our patch-set-based representation is alignment free and thus has an advantage for tasks like video-based face recognition, image-set-based object recognition, and video-based hand gesture recognition, where precious alignment is usually difficult or even impossible due to large variance in view angle or pose. Specifically, to bypass the alignment issue, we propose to adopt the patch-based image set representation by dividing each image within each set into patches, then we cluster all the training patches into multiple clusters and classify the test patches based on the cluster centers of training patches. The labels of test patches within each cluster are inferred from a patch-set-based sparse representation for classification, and the labels of all test patches from all the clusters are then aggregated to predict a single label for the test set. Experimental results on video-based face recognition data sets (CMU-MoBo and YouTube Celebrities), image-set-based object recognition data set (ETH-80), and video-based hand gesture recognition data set (Kinect Hand Gestures) demonstrate that our proposed method consistently outperforms all existing ones, and the improvement is very significant on the YouTube Celebrities and Kinect Hand Gesture data sets. Moreover, we also quantitatively show the robustness of our method to misalignment on the Mutli-PIE data set.
Persistent Identifierhttp://hdl.handle.net/10722/344979
ISSN
2023 Impact Factor: 8.3
2023 SCImago Journal Rankings: 2.299

 

DC FieldValueLanguage
dc.contributor.authorGao, Shenghua-
dc.contributor.authorZeng, Zinan-
dc.contributor.authorJia, Kui-
dc.contributor.authorChan, Tsung Han-
dc.contributor.authorTang, Jinhui-
dc.date.accessioned2024-08-15T09:24:28Z-
dc.date.available2024-08-15T09:24:28Z-
dc.date.issued2016-
dc.identifier.citationIEEE Transactions on Circuits and Systems for Video Technology, 2016, v. 26, n. 9, p. 1646-1658-
dc.identifier.issn1051-8215-
dc.identifier.urihttp://hdl.handle.net/10722/344979-
dc.description.abstractThis paper presents a patch-set-based sparse representation for image set classification. Compared with image-based image set representation, our patch-set-based representation is alignment free and thus has an advantage for tasks like video-based face recognition, image-set-based object recognition, and video-based hand gesture recognition, where precious alignment is usually difficult or even impossible due to large variance in view angle or pose. Specifically, to bypass the alignment issue, we propose to adopt the patch-based image set representation by dividing each image within each set into patches, then we cluster all the training patches into multiple clusters and classify the test patches based on the cluster centers of training patches. The labels of test patches within each cluster are inferred from a patch-set-based sparse representation for classification, and the labels of all test patches from all the clusters are then aggregated to predict a single label for the test set. Experimental results on video-based face recognition data sets (CMU-MoBo and YouTube Celebrities), image-set-based object recognition data set (ETH-80), and video-based hand gesture recognition data set (Kinect Hand Gestures) demonstrate that our proposed method consistently outperforms all existing ones, and the improvement is very significant on the YouTube Celebrities and Kinect Hand Gesture data sets. Moreover, we also quantitatively show the robustness of our method to misalignment on the Mutli-PIE data set.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Circuits and Systems for Video Technology-
dc.subjectAlignment free-
dc.subjectimage set classification-
dc.subjectpatch-set-based representation-
dc.subjectVideo-Based face recognition-
dc.titlePatch-Set-Based Representation for Alignment-Free Image Set Classification-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TCSVT.2015.2469571-
dc.identifier.scopuseid_2-s2.0-84986588622-
dc.identifier.volume26-
dc.identifier.issue9-
dc.identifier.spage1646-
dc.identifier.epage1658-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats