Learning Modulated Transformation in GANs

Yang, Ceyuan; Zhang, Qihang; Xu, Yinghao; Zhu, Jiapeng; Shen, Yujun; Dai, Bo

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Scopus: eid_2-s2.0-85191164604
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- HKU Musketeers Foundation Institute of Data Science: Conference papers

Conference Paper: Learning Modulated Transformation in GANs

Title	Learning Modulated Transformation in GANs
Authors	Yang, Ceyuan Zhang, Qihang Xu, Yinghao Zhu, Jiapeng Shen, Yujun Dai, Bo
Issue Date	2023
Citation	Advances in Neural Information Processing Systems, 2023, v. 36 How to Cite?
Abstract	The success of style-based generators largely benefits from style modulation, which helps take care of the cross-instance variation within data. However, the instance-wise stochasticity is typically introduced via regular convolution, where kernels interact with features at some fixed locations, limiting its capacity for modeling geometric variation. To alleviate this problem, we equip the generator in generative adversarial networks (GANs) with a plug-and-play module, termed as modulated transformation module (MTM). This module predicts spatial offsets under the control of latent codes, based on which the convolution operation can be applied at variable locations for different instances, and hence offers the model an additional degree of freedom to handle geometry deformation. Extensive experiments suggest that our approach can be faithfully generalized to various generative tasks, including image generation, 3D-aware image synthesis, and video generation, and get compatible with state-of-the-art frameworks without any hyper-parameter tuning. It is noteworthy that, towards human generation on the challenging TaiChi dataset, we improve the FID of StyleGAN3 from 21.36 to 13.60, demonstrating the efficacy of learning modulated geometry transformation. Code and models are available at https://github.com/limbo0000/mtm.
Persistent Identifier	http://hdl.handle.net/10722/352429
ISSN	1049-5258 2020 SCImago Journal Rankings: 1.399

DC Field	Value	Language
dc.contributor.author	Yang, Ceyuan	-
dc.contributor.author	Zhang, Qihang	-
dc.contributor.author	Xu, Yinghao	-
dc.contributor.author	Zhu, Jiapeng	-
dc.contributor.author	Shen, Yujun	-
dc.contributor.author	Dai, Bo	-
dc.date.accessioned	2024-12-16T03:58:53Z	-
dc.date.available	2024-12-16T03:58:53Z	-
dc.date.issued	2023	-
dc.identifier.citation	Advances in Neural Information Processing Systems, 2023, v. 36	-
dc.identifier.issn	1049-5258	-
dc.identifier.uri	http://hdl.handle.net/10722/352429	-
dc.description.abstract	The success of style-based generators largely benefits from style modulation, which helps take care of the cross-instance variation within data. However, the instance-wise stochasticity is typically introduced via regular convolution, where kernels interact with features at some fixed locations, limiting its capacity for modeling geometric variation. To alleviate this problem, we equip the generator in generative adversarial networks (GANs) with a plug-and-play module, termed as modulated transformation module (MTM). This module predicts spatial offsets under the control of latent codes, based on which the convolution operation can be applied at variable locations for different instances, and hence offers the model an additional degree of freedom to handle geometry deformation. Extensive experiments suggest that our approach can be faithfully generalized to various generative tasks, including image generation, 3D-aware image synthesis, and video generation, and get compatible with state-of-the-art frameworks without any hyper-parameter tuning. It is noteworthy that, towards human generation on the challenging TaiChi dataset, we improve the FID of StyleGAN3 from 21.36 to 13.60, demonstrating the efficacy of learning modulated geometry transformation. Code and models are available at https://github.com/limbo0000/mtm.	-
dc.language	eng	-
dc.relation.ispartof	Advances in Neural Information Processing Systems	-
dc.title	Learning Modulated Transformation in GANs	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.scopus	eid_2-s2.0-85191164604	-
dc.identifier.volume	36	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Learning Modulated Transformation in GANs

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats