Disentangled representation learning for controllable image synthesis: An information-theoretic perspective

Tang, Shichang; Zhou, Xu; He, Xuming; Ma, Yi

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/ICPR48806.2021.9411925
Scopus: eid_2-s2.0-85110472024
WOS: WOS:000681331402073
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- HKU Musketeers Foundation Institute of Data Science: Conference papers

Conference Paper: Disentangled representation learning for controllable image synthesis: An information-theoretic perspective

Title	Disentangled representation learning for controllable image synthesis: An information-theoretic perspective
Authors	Tang, Shichang Zhou, Xu He, Xuming Ma, Yi
Issue Date	2020
Citation	Proceedings - International Conference on Pattern Recognition, 2020, p. 10042-10049 How to Cite? DOI: http://dx.doi.org/10.1109/ICPR48806.2021.9411925
Abstract	In this paper, we look into the problem of disentangled representation learning and controllable image synthesis in a deep generative model. We develop an encoder-decoder architecture for a variant of the Variational Auto-Encoder (VAE) with two latent codes z1 and z2. Our framework uses z2 to capture specified factors of variation while z1 captures the complementary factors of variation. To this end, we analyze the learning problem from the perspective of multivariate mutual information, derive optimizable lower bounds of the conditional mutual information in the image synthesis processes and incorporate them into the training objective. We validate our method empirically on the Color MNIST dataset and the CelebA dataset by showing controllable image syntheses. Our proposed paradigm is simple yet effective and is applicable to many situations, including those where there is not an explicit factorization of features available, or where the features are non-categorical.
Persistent Identifier	http://hdl.handle.net/10722/327775
ISSN	1051-4651 2023 SCImago Journal Rankings: 0.584
ISI Accession Number ID	WOS:000681331402073

DC Field	Value	Language
dc.contributor.author	Tang, Shichang	-
dc.contributor.author	Zhou, Xu	-
dc.contributor.author	He, Xuming	-
dc.contributor.author	Ma, Yi	-
dc.date.accessioned	2023-05-08T02:26:43Z	-
dc.date.available	2023-05-08T02:26:43Z	-
dc.date.issued	2020	-
dc.identifier.citation	Proceedings - International Conference on Pattern Recognition, 2020, p. 10042-10049	-
dc.identifier.issn	1051-4651	-
dc.identifier.uri	http://hdl.handle.net/10722/327775	-
dc.description.abstract	In this paper, we look into the problem of disentangled representation learning and controllable image synthesis in a deep generative model. We develop an encoder-decoder architecture for a variant of the Variational Auto-Encoder (VAE) with two latent codes z1 and z2. Our framework uses z2 to capture specified factors of variation while z1 captures the complementary factors of variation. To this end, we analyze the learning problem from the perspective of multivariate mutual information, derive optimizable lower bounds of the conditional mutual information in the image synthesis processes and incorporate them into the training objective. We validate our method empirically on the Color MNIST dataset and the CelebA dataset by showing controllable image syntheses. Our proposed paradigm is simple yet effective and is applicable to many situations, including those where there is not an explicit factorization of features available, or where the features are non-categorical.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings - International Conference on Pattern Recognition	-
dc.title	Disentangled representation learning for controllable image synthesis: An information-theoretic perspective	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/ICPR48806.2021.9411925	-
dc.identifier.scopus	eid_2-s2.0-85110472024	-
dc.identifier.spage	10042	-
dc.identifier.epage	10049	-
dc.identifier.isi	WOS:000681331402073	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Disentangled representation learning for controllable image synthesis: An information-theoretic perspective

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats