File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Semi-supervised Cycle-GAN for face photo-sketch translation in the wild

TitleSemi-supervised Cycle-GAN for face photo-sketch translation in the wild
Authors
KeywordsCycle-GAN
Face photo-sketch translation
Semi-supervised
Steganography
Issue Date1-Oct-2023
PublisherElsevier
Citation
Computer Vision and Image Understanding, 2023, v. 235 How to Cite?
Abstract

The performance of face photo-sketch translation has improved a lot thanks to deep neural networks. GAN based methods trained on paired images can produce high-quality results under laboratory settings. Such paired datasets are, however, often very small and lack diversity. Meanwhile, Cycle-GANs trained with unpaired photo-sketch datasets suffer from the steganography phenomenon, which makes them not effective to face photos in the wild. In this paper, we introduce a semi-supervised approach with a noise-injection strategy, named Semi-Cycle-GAN (SCG), to tackle these problems. For the first problem, we propose a pseudo sketch feature representation for each input photo composed from a small reference set of photo-sketch pairs, and use the resulting pseudo pairs to supervise a photo-to-sketch generator Gp2s. The outputs of Gp2s can in turn help to train a sketch-to-photo generator Gs2p in a self-supervised manner. This allows us to train Gp2s and Gs2p using a small reference set of photo-sketch pairs together with a large face photo dataset (without ground-truth sketches). For the second problem, we show that the simple noise-injection strategy works well to alleviate the steganography effect in SCG and helps to produce more reasonable sketch-to-photo results with less overfitting than fully supervised approaches. Experiments show that SCG achieves competitive performance on public benchmarks and superior results on photos in the wild.


Persistent Identifierhttp://hdl.handle.net/10722/331394
ISSN
2021 Impact Factor: 4.886
2020 SCImago Journal Rankings: 0.854

 

DC FieldValueLanguage
dc.contributor.authorChen, Chaofeng-
dc.contributor.authorLiu, Wei-
dc.contributor.authorTan, Xiao-
dc.contributor.authorWong, Kwan-Yee-
dc.date.accessioned2023-09-21T06:55:20Z-
dc.date.available2023-09-21T06:55:20Z-
dc.date.issued2023-10-01-
dc.identifier.citationComputer Vision and Image Understanding, 2023, v. 235-
dc.identifier.issn1077-3142-
dc.identifier.urihttp://hdl.handle.net/10722/331394-
dc.description.abstract<p>The performance of face photo-sketch translation has improved a lot thanks to deep neural networks. GAN based methods trained on paired images can produce high-quality results under laboratory settings. Such paired datasets are, however, often very small and lack diversity. Meanwhile, Cycle-GANs trained with unpaired photo-sketch datasets suffer from the steganography phenomenon, which makes them not effective to face photos in the wild. In this paper, we introduce a semi-supervised approach with a noise-injection strategy, named Semi-Cycle-GAN (SCG), to tackle these problems. For the first problem, we propose a pseudo sketch feature representation for each input photo composed from a small reference set of photo-sketch pairs, and use the resulting pseudo pairs to supervise a photo-to-sketch generator G<sub>p2s</sub>. The outputs of G<sub>p2s</sub> can in turn help to train a sketch-to-photo generator G<sub>s2p</sub> in a self-supervised manner. This allows us to train G<sub>p2s</sub> and G<sub>s2p</sub> using a small reference set of photo-sketch pairs together with a large face photo dataset (without ground-truth sketches). For the second problem, we show that the simple noise-injection strategy works well to alleviate the steganography effect in SCG and helps to produce more reasonable sketch-to-photo results with less overfitting than fully supervised approaches. Experiments show that SCG achieves competitive performance on public benchmarks and superior results on photos in the wild.<br></p>-
dc.languageeng-
dc.publisherElsevier-
dc.relation.ispartofComputer Vision and Image Understanding-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectCycle-GAN-
dc.subjectFace photo-sketch translation-
dc.subjectSemi-supervised-
dc.subjectSteganography-
dc.titleSemi-supervised Cycle-GAN for face photo-sketch translation in the wild-
dc.typeArticle-
dc.identifier.doi10.1016/j.cviu.2023.103775-
dc.identifier.scopuseid_2-s2.0-85165227080-
dc.identifier.volume235-
dc.identifier.eissn1090-235X-
dc.identifier.issnl1077-3142-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats