File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ICCV48922.2021.00334
- WOS: WOS:000797698903054
Supplementary
-
Citations:
- Web of Science: 0
- Appears in Collections:
Conference Paper: Multi-scale Matching Networks for Semantic Correspondence
Title | Multi-scale Matching Networks for Semantic Correspondence |
---|---|
Authors | |
Keywords | Computer vision Codes Fuses Semantics Buildings |
Issue Date | 2021 |
Publisher | IEEE Computer Society. |
Citation | ICCV Workshop on Deep Multi-Task Learning in Computer Vision (Virtual), Montreal, QC, Canada, October 11-17, 2021. In Proceedings: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW 2021), p. 3334-3344 How to Cite? |
Abstract | Deep features have been proven powerful in building accurate dense semantic correspondences in various previous works. However, the multi-scale and pyramidal hierarchy of convolutional neural networks has not been well studied to learn discriminative pixel-level features for semantic correspondence. In this paper, we propose a multi-scale matching network that is sensitive to tiny semantic differences between neighboring pixels. We follow the coarse-to-fine matching strategy and build a top-down feature and matching enhancement scheme that is coupled with the multi-scale hierarchy of deep convolutional neural networks. During feature enhancement, intra-scale enhancement fuses same-resolution feature maps from multiple layers together via local self-attention and cross-scale enhancement hallucinates higher-resolution feature maps along the top-down pathway. Besides, we learn complementary matching details at different scales thus the overall matching score is refined by features of different semantic levels gradually. Our multi-scale matching network can be trained end-to-end easily with few additional learnable parameters. Experimental results demonstrate that the proposed method achieves state-of-the-art performance on three popular benchmarks with high computational efficiency. The code has been released at https://github.com/wintersun661/MMNet. |
Persistent Identifier | http://hdl.handle.net/10722/316359 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zhao, D | - |
dc.contributor.author | Song, Z | - |
dc.contributor.author | Ji, Z | - |
dc.contributor.author | Zhao, G | - |
dc.contributor.author | Ge, W | - |
dc.contributor.author | Yu, Y | - |
dc.date.accessioned | 2022-09-02T06:10:06Z | - |
dc.date.available | 2022-09-02T06:10:06Z | - |
dc.date.issued | 2021 | - |
dc.identifier.citation | ICCV Workshop on Deep Multi-Task Learning in Computer Vision (Virtual), Montreal, QC, Canada, October 11-17, 2021. In Proceedings: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW 2021), p. 3334-3344 | - |
dc.identifier.uri | http://hdl.handle.net/10722/316359 | - |
dc.description.abstract | Deep features have been proven powerful in building accurate dense semantic correspondences in various previous works. However, the multi-scale and pyramidal hierarchy of convolutional neural networks has not been well studied to learn discriminative pixel-level features for semantic correspondence. In this paper, we propose a multi-scale matching network that is sensitive to tiny semantic differences between neighboring pixels. We follow the coarse-to-fine matching strategy and build a top-down feature and matching enhancement scheme that is coupled with the multi-scale hierarchy of deep convolutional neural networks. During feature enhancement, intra-scale enhancement fuses same-resolution feature maps from multiple layers together via local self-attention and cross-scale enhancement hallucinates higher-resolution feature maps along the top-down pathway. Besides, we learn complementary matching details at different scales thus the overall matching score is refined by features of different semantic levels gradually. Our multi-scale matching network can be trained end-to-end easily with few additional learnable parameters. Experimental results demonstrate that the proposed method achieves state-of-the-art performance on three popular benchmarks with high computational efficiency. The code has been released at https://github.com/wintersun661/MMNet. | - |
dc.language | eng | - |
dc.publisher | IEEE Computer Society. | - |
dc.relation.ispartof | Proceedings: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW 2021) | - |
dc.rights | Proceedings: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW 2021). Copyright © IEEE Computer Society. | - |
dc.subject | Computer vision | - |
dc.subject | Codes | - |
dc.subject | Fuses | - |
dc.subject | Semantics | - |
dc.subject | Buildings | - |
dc.title | Multi-scale Matching Networks for Semantic Correspondence | - |
dc.type | Conference_Paper | - |
dc.identifier.email | Yu, Y: yzyu@cs.hku.hk | - |
dc.identifier.authority | Yu, Y=rp01415 | - |
dc.identifier.doi | 10.1109/ICCV48922.2021.00334 | - |
dc.identifier.hkuros | 336342 | - |
dc.identifier.spage | 3334 | - |
dc.identifier.epage | 3344 | - |
dc.identifier.isi | WOS:000797698903054 | - |
dc.publisher.place | United States | - |