File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TCSVT.2015.2469571
- Scopus: eid_2-s2.0-84986588622
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Patch-Set-Based Representation for Alignment-Free Image Set Classification
Title | Patch-Set-Based Representation for Alignment-Free Image Set Classification |
---|---|
Authors | |
Keywords | Alignment free image set classification patch-set-based representation Video-Based face recognition |
Issue Date | 2016 |
Citation | IEEE Transactions on Circuits and Systems for Video Technology, 2016, v. 26, n. 9, p. 1646-1658 How to Cite? |
Abstract | This paper presents a patch-set-based sparse representation for image set classification. Compared with image-based image set representation, our patch-set-based representation is alignment free and thus has an advantage for tasks like video-based face recognition, image-set-based object recognition, and video-based hand gesture recognition, where precious alignment is usually difficult or even impossible due to large variance in view angle or pose. Specifically, to bypass the alignment issue, we propose to adopt the patch-based image set representation by dividing each image within each set into patches, then we cluster all the training patches into multiple clusters and classify the test patches based on the cluster centers of training patches. The labels of test patches within each cluster are inferred from a patch-set-based sparse representation for classification, and the labels of all test patches from all the clusters are then aggregated to predict a single label for the test set. Experimental results on video-based face recognition data sets (CMU-MoBo and YouTube Celebrities), image-set-based object recognition data set (ETH-80), and video-based hand gesture recognition data set (Kinect Hand Gestures) demonstrate that our proposed method consistently outperforms all existing ones, and the improvement is very significant on the YouTube Celebrities and Kinect Hand Gesture data sets. Moreover, we also quantitatively show the robustness of our method to misalignment on the Mutli-PIE data set. |
Persistent Identifier | http://hdl.handle.net/10722/344979 |
ISSN | 2023 Impact Factor: 8.3 2023 SCImago Journal Rankings: 2.299 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Gao, Shenghua | - |
dc.contributor.author | Zeng, Zinan | - |
dc.contributor.author | Jia, Kui | - |
dc.contributor.author | Chan, Tsung Han | - |
dc.contributor.author | Tang, Jinhui | - |
dc.date.accessioned | 2024-08-15T09:24:28Z | - |
dc.date.available | 2024-08-15T09:24:28Z | - |
dc.date.issued | 2016 | - |
dc.identifier.citation | IEEE Transactions on Circuits and Systems for Video Technology, 2016, v. 26, n. 9, p. 1646-1658 | - |
dc.identifier.issn | 1051-8215 | - |
dc.identifier.uri | http://hdl.handle.net/10722/344979 | - |
dc.description.abstract | This paper presents a patch-set-based sparse representation for image set classification. Compared with image-based image set representation, our patch-set-based representation is alignment free and thus has an advantage for tasks like video-based face recognition, image-set-based object recognition, and video-based hand gesture recognition, where precious alignment is usually difficult or even impossible due to large variance in view angle or pose. Specifically, to bypass the alignment issue, we propose to adopt the patch-based image set representation by dividing each image within each set into patches, then we cluster all the training patches into multiple clusters and classify the test patches based on the cluster centers of training patches. The labels of test patches within each cluster are inferred from a patch-set-based sparse representation for classification, and the labels of all test patches from all the clusters are then aggregated to predict a single label for the test set. Experimental results on video-based face recognition data sets (CMU-MoBo and YouTube Celebrities), image-set-based object recognition data set (ETH-80), and video-based hand gesture recognition data set (Kinect Hand Gestures) demonstrate that our proposed method consistently outperforms all existing ones, and the improvement is very significant on the YouTube Celebrities and Kinect Hand Gesture data sets. Moreover, we also quantitatively show the robustness of our method to misalignment on the Mutli-PIE data set. | - |
dc.language | eng | - |
dc.relation.ispartof | IEEE Transactions on Circuits and Systems for Video Technology | - |
dc.subject | Alignment free | - |
dc.subject | image set classification | - |
dc.subject | patch-set-based representation | - |
dc.subject | Video-Based face recognition | - |
dc.title | Patch-Set-Based Representation for Alignment-Free Image Set Classification | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1109/TCSVT.2015.2469571 | - |
dc.identifier.scopus | eid_2-s2.0-84986588622 | - |
dc.identifier.volume | 26 | - |
dc.identifier.issue | 9 | - |
dc.identifier.spage | 1646 | - |
dc.identifier.epage | 1658 | - |