File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Instant Gaussian Splatting Generation for High-Quality and Real-Time Facial Asset Rendering

TitleInstant Gaussian Splatting Generation for High-Quality and Real-Time Facial Asset Rendering
Authors
Keywords3D gaussian splatting
animation
diffusion
digital avatar
generative model
transformer
Issue Date1-Jan-2025
PublisherInstitute of Electrical and Electronics Engineers
Citation
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025 How to Cite?
Abstract

Traditional and AI-driven modeling techniques enable high-fidelity 3D asset generation from scans, videos, or text prompts. However, editing and rendering these assets often involves a trade-off between quality and speed. In this paper, we propose GauFace, a novel Gaussian Splatting representation, tailored for efficient rendering of facial mesh with textures. Then, we introduce TransGS, a diffusion transformer that instantly generates the GauFace assets from mesh, textures and lightning conditions. Specifically, we adopt a patch-based pipeline to handle the vast number of Gaussian Points, a novel texel-aligned sampling scheme with UV positional encoding to enhance the throughput of generating GauFace assets. Once trained, TransGS can generate GauFace assets in 5 seconds, delivering high fidelity and real-time facial interaction of 30fps@1440p to a Snapdragon 8 Gen 2 mobile platform. The rich conditional modalities further enable editing and animation capabilities reminiscent of traditional CG pipelines. We conduct extensive evaluations and user studies, compared to traditional renderers, as well as recent neural rendering methods. They demonstrate the superior performance of our approach for facial asset rendering. We also showcase diverse applications of facial assets using our TransGS approach and GauFace representation, across various platforms like PCs, phones, and VR headsets.


Persistent Identifierhttp://hdl.handle.net/10722/361933
ISSN
2023 Impact Factor: 20.8
2023 SCImago Journal Rankings: 6.158

 

DC FieldValueLanguage
dc.contributor.authorQin, Dafei-
dc.contributor.authorLin, Hongyang-
dc.contributor.authorZhang, Qixuan-
dc.contributor.authorQiao, Kaichun-
dc.contributor.authorZhang, Longwen-
dc.contributor.authorSaito, Jun-
dc.contributor.authorZhao, Zijun-
dc.contributor.authorYu, Jingyi-
dc.contributor.authorXu, Lan-
dc.contributor.authorKomura, Taku-
dc.date.accessioned2025-09-17T00:32:09Z-
dc.date.available2025-09-17T00:32:09Z-
dc.date.issued2025-01-01-
dc.identifier.citationIEEE Transactions on Pattern Analysis and Machine Intelligence, 2025-
dc.identifier.issn0162-8828-
dc.identifier.urihttp://hdl.handle.net/10722/361933-
dc.description.abstract<p>Traditional and AI-driven modeling techniques enable high-fidelity 3D asset generation from scans, videos, or text prompts. However, editing and rendering these assets often involves a trade-off between quality and speed. In this paper, we propose GauFace, a novel Gaussian Splatting representation, tailored for efficient rendering of facial mesh with textures. Then, we introduce TransGS, a diffusion transformer that instantly generates the GauFace assets from mesh, textures and lightning conditions. Specifically, we adopt a patch-based pipeline to handle the vast number of Gaussian Points, a novel texel-aligned sampling scheme with UV positional encoding to enhance the throughput of generating GauFace assets. Once trained, TransGS can generate GauFace assets in 5 seconds, delivering high fidelity and real-time facial interaction of 30fps@1440p to a Snapdragon 8 Gen 2 mobile platform. The rich conditional modalities further enable editing and animation capabilities reminiscent of traditional CG pipelines. We conduct extensive evaluations and user studies, compared to traditional renderers, as well as recent neural rendering methods. They demonstrate the superior performance of our approach for facial asset rendering. We also showcase diverse applications of facial assets using our TransGS approach and GauFace representation, across various platforms like PCs, phones, and VR headsets.</p>-
dc.languageeng-
dc.publisherInstitute of Electrical and Electronics Engineers-
dc.relation.ispartofIEEE Transactions on Pattern Analysis and Machine Intelligence-
dc.subject3D gaussian splatting-
dc.subjectanimation-
dc.subjectdiffusion-
dc.subjectdigital avatar-
dc.subjectgenerative model-
dc.subjecttransformer-
dc.titleInstant Gaussian Splatting Generation for High-Quality and Real-Time Facial Asset Rendering-
dc.typeArticle-
dc.identifier.doi10.1109/TPAMI.2025.3550195-
dc.identifier.scopuseid_2-s2.0-105000382833-
dc.identifier.eissn1939-3539-
dc.identifier.issnl0162-8828-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats