Semi-supervised Cycle-GAN for face photo-sketch translation in the wild

Chen, Chaofeng; Liu, Wei; Tan, Xiao; Wong, Kwan-Yee

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1016/j.cviu.2023.103775
Scopus: eid_2-s2.0-85165227080
WOS: WOS:001043786200001
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: Semi-supervised Cycle-GAN for face photo-sketch translation in the wild

Title	Semi-supervised Cycle-GAN for face photo-sketch translation in the wild
Authors	Chen, Chaofeng Liu, Wei Tan, Xiao Wong, Kwan-Yee
Keywords	Cycle-GAN Face photo-sketch translation Semi-supervised Steganography
Issue Date	1-Oct-2023
Publisher	Elsevier
Citation	Computer Vision and Image Understanding, 2023, v. 235 How to Cite? DOI: http://dx.doi.org/10.1016/j.cviu.2023.103775
Abstract	The performance of face photo-sketch translation has improved a lot thanks to deep neural networks. GAN based methods trained on paired images can produce high-quality results under laboratory settings. Such paired datasets are, however, often very small and lack diversity. Meanwhile, Cycle-GANs trained with unpaired photo-sketch datasets suffer from the steganography phenomenon, which makes them not effective to face photos in the wild. In this paper, we introduce a semi-supervised approach with a noise-injection strategy, named Semi-Cycle-GAN (SCG), to tackle these problems. For the first problem, we propose a pseudo sketch feature representation for each input photo composed from a small reference set of photo-sketch pairs, and use the resulting pseudo pairs to supervise a photo-to-sketch generator G_p2s. The outputs of G_p2s can in turn help to train a sketch-to-photo generator G_s2p in a self-supervised manner. This allows us to train G_p2s and G_s2p using a small reference set of photo-sketch pairs together with a large face photo dataset (without ground-truth sketches). For the second problem, we show that the simple noise-injection strategy works well to alleviate the steganography effect in SCG and helps to produce more reasonable sketch-to-photo results with less overfitting than fully supervised approaches. Experiments show that SCG achieves competitive performance on public benchmarks and superior results on photos in the wild.
Persistent Identifier	http://hdl.handle.net/10722/331394
ISSN	1077-3142 2023 Impact Factor: 4.3 2023 SCImago Journal Rankings: 1.420
ISI Accession Number ID	WOS:001043786200001

DC Field	Value	Language
dc.contributor.author	Chen, Chaofeng	-
dc.contributor.author	Liu, Wei	-
dc.contributor.author	Tan, Xiao	-
dc.contributor.author	Wong, Kwan-Yee	-
dc.date.accessioned	2023-09-21T06:55:20Z	-
dc.date.available	2023-09-21T06:55:20Z	-
dc.date.issued	2023-10-01	-
dc.identifier.citation	Computer Vision and Image Understanding, 2023, v. 235	-
dc.identifier.issn	1077-3142	-
dc.identifier.uri	http://hdl.handle.net/10722/331394	-
dc.description.abstract	<p>The performance of face photo-sketch translation has improved a lot thanks to deep neural networks. GAN based methods trained on paired images can produce high-quality results under laboratory settings. Such paired datasets are, however, often very small and lack diversity. Meanwhile, Cycle-GANs trained with unpaired photo-sketch datasets suffer from the steganography phenomenon, which makes them not effective to face photos in the wild. In this paper, we introduce a semi-supervised approach with a noise-injection strategy, named Semi-Cycle-GAN (SCG), to tackle these problems. For the first problem, we propose a pseudo sketch feature representation for each input photo composed from a small reference set of photo-sketch pairs, and use the resulting pseudo pairs to supervise a photo-to-sketch generator G<sub>p2s</sub>. The outputs of G<sub>p2s</sub> can in turn help to train a sketch-to-photo generator G<sub>s2p</sub> in a self-supervised manner. This allows us to train G<sub>p2s</sub> and G<sub>s2p</sub> using a small reference set of photo-sketch pairs together with a large face photo dataset (without ground-truth sketches). For the second problem, we show that the simple noise-injection strategy works well to alleviate the steganography effect in SCG and helps to produce more reasonable sketch-to-photo results with less overfitting than fully supervised approaches. Experiments show that SCG achieves competitive performance on public benchmarks and superior results on photos in the wild.<br></p>	-
dc.language	eng	-
dc.publisher	Elsevier	-
dc.relation.ispartof	Computer Vision and Image Understanding	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject	Cycle-GAN	-
dc.subject	Face photo-sketch translation	-
dc.subject	Semi-supervised	-
dc.subject	Steganography	-
dc.title	Semi-supervised Cycle-GAN for face photo-sketch translation in the wild	-
dc.type	Article	-
dc.identifier.doi	10.1016/j.cviu.2023.103775	-
dc.identifier.scopus	eid_2-s2.0-85165227080	-
dc.identifier.volume	235	-
dc.identifier.eissn	1090-235X	-
dc.identifier.isi	WOS:001043786200001	-
dc.identifier.issnl	1077-3142	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Semi-supervised Cycle-GAN for face photo-sketch translation in the wild

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats