Beyond spatial pyramids: A new feature extraction framework with dense spatial sampling for image classification

Yan, Shengye; Xu, Xinxing; Xu, Dong; Lin, Stephen; Li, Xuelong

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1007/978-3-642-33765-9_34
Scopus: eid_2-s2.0-84867882770
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Beyond spatial pyramids: A new feature extraction framework with dense spatial sampling for image classification

Title	Beyond spatial pyramids: A new feature extraction framework with dense spatial sampling for image classification
Authors	Yan, Shengye Xu, Xinxing Xu, Dong Lin, Stephen Li, Xuelong
Keywords	Adapted Classifier Image Classification Multiple Kernel Learning Sliding Window Spatial Pyramid
Issue Date	2012
Publisher	Springer
Citation	12th European Conference on Computer Vision (ECCV 2012), Florence, Italy, 7-13 October 2012. In Fitzgibbon, A, Lazebnik, S, Perona, P, et al. (Eds.), Computer Vision - ECCV 2012: 12th European Conference on Computer Vision, Florence, Italy, October 7-13, 2012. Proceedings, Part IV, p. 473-487. Berlin: Springer, 2012 How to Cite? DOI: http://dx.doi.org/10.1007/978-3-642-33765-9_34
Abstract	We introduce a new framework for image classification that extends beyond the window sampling of fixed spatial pyramids to include a comprehensive set of windows densely sampled over location, size and aspect ratio. To effectively deal with this large set of windows, we derive a concise high-level image feature using a two-level extraction method. At the first level, window-based features are computed from local descriptors (e.g., SIFT, spatial HOG, LBP) in a process similar to standard feature extractors. Then at the second level, the new image feature is determined from the window-based features in a manner analogous to the first level. This higher level of abstraction offers both efficient handling of dense samples and reduced sensitivity to misalignment. More importantly, our simple yet effective framework can readily accommodate a large number of existing pooling/coding methods, allowing them to extract features beyond the spatial pyramid representation. To effectively fuse the second level feature with a standard first level image feature for classification, we additionally propose a new learning algorithm, called Generalized Adaptive ℓ p -norm Multiple Kernel Learning (GA-MKL), to learn an adapted robust classifier based on multiple base kernels constructed from image features and multiple sets of pre-learned classifiers of all the classes. Extensive evaluation on the object recognition (Caltech256) and scene recognition (15Scenes) benchmark datasets demonstrates that the proposed method outperforms state-of-the-art image classification algorithms under a broad range of settings. © 2012 Springer-Verlag.
Persistent Identifier	http://hdl.handle.net/10722/321493
ISBN	9783642337642
ISSN	0302-9743 2023 SCImago Journal Rankings: 0.606
Series/Report no.	Lecture Notes in Computer Science ; 7575 LNCS Sublibrary. SL 6, Image Processing, Computer Vision, Pattern Recognition, and Graphics

DC Field	Value	Language
dc.contributor.author	Yan, Shengye	-
dc.contributor.author	Xu, Xinxing	-
dc.contributor.author	Xu, Dong	-
dc.contributor.author	Lin, Stephen	-
dc.contributor.author	Li, Xuelong	-
dc.date.accessioned	2022-11-03T02:19:16Z	-
dc.date.available	2022-11-03T02:19:16Z	-
dc.date.issued	2012	-
dc.identifier.citation	12th European Conference on Computer Vision (ECCV 2012), Florence, Italy, 7-13 October 2012. In Fitzgibbon, A, Lazebnik, S, Perona, P, et al. (Eds.), Computer Vision - ECCV 2012: 12th European Conference on Computer Vision, Florence, Italy, October 7-13, 2012. Proceedings, Part IV, p. 473-487. Berlin: Springer, 2012	-
dc.identifier.isbn	9783642337642	-
dc.identifier.issn	0302-9743	-
dc.identifier.uri	http://hdl.handle.net/10722/321493	-
dc.description.abstract	We introduce a new framework for image classification that extends beyond the window sampling of fixed spatial pyramids to include a comprehensive set of windows densely sampled over location, size and aspect ratio. To effectively deal with this large set of windows, we derive a concise high-level image feature using a two-level extraction method. At the first level, window-based features are computed from local descriptors (e.g., SIFT, spatial HOG, LBP) in a process similar to standard feature extractors. Then at the second level, the new image feature is determined from the window-based features in a manner analogous to the first level. This higher level of abstraction offers both efficient handling of dense samples and reduced sensitivity to misalignment. More importantly, our simple yet effective framework can readily accommodate a large number of existing pooling/coding methods, allowing them to extract features beyond the spatial pyramid representation. To effectively fuse the second level feature with a standard first level image feature for classification, we additionally propose a new learning algorithm, called Generalized Adaptive ℓ p -norm Multiple Kernel Learning (GA-MKL), to learn an adapted robust classifier based on multiple base kernels constructed from image features and multiple sets of pre-learned classifiers of all the classes. Extensive evaluation on the object recognition (Caltech256) and scene recognition (15Scenes) benchmark datasets demonstrates that the proposed method outperforms state-of-the-art image classification algorithms under a broad range of settings. © 2012 Springer-Verlag.	-
dc.language	eng	-
dc.publisher	Springer	-
dc.relation.ispartof	Computer Vision - ECCV 2012: 12th European Conference on Computer Vision, Florence, Italy, October 7-13, 2012. Proceedings, Part IV	-
dc.relation.ispartofseries	Lecture Notes in Computer Science ; 7575	-
dc.relation.ispartofseries	LNCS Sublibrary. SL 6, Image Processing, Computer Vision, Pattern Recognition, and Graphics	-
dc.subject	Adapted Classifier	-
dc.subject	Image Classification	-
dc.subject	Multiple Kernel Learning	-
dc.subject	Sliding Window	-
dc.subject	Spatial Pyramid	-
dc.title	Beyond spatial pyramids: A new feature extraction framework with dense spatial sampling for image classification	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1007/978-3-642-33765-9_34	-
dc.identifier.scopus	eid_2-s2.0-84867882770	-
dc.identifier.spage	473	-
dc.identifier.epage	487	-
dc.identifier.eissn	1611-3349	-
dc.publisher.place	Berlin	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Beyond spatial pyramids: A new feature extraction framework with dense spatial sampling for image classification

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats