File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TPAMI.2025.3550195
- Scopus: eid_2-s2.0-105000382833
- Find via

Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Instant Gaussian Splatting Generation for High-Quality and Real-Time Facial Asset Rendering
| Title | Instant Gaussian Splatting Generation for High-Quality and Real-Time Facial Asset Rendering |
|---|---|
| Authors | |
| Keywords | 3D gaussian splatting animation diffusion digital avatar generative model transformer |
| Issue Date | 1-Jan-2025 |
| Publisher | Institute of Electrical and Electronics Engineers |
| Citation | IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025 How to Cite? |
| Abstract | Traditional and AI-driven modeling techniques enable high-fidelity 3D asset generation from scans, videos, or text prompts. However, editing and rendering these assets often involves a trade-off between quality and speed. In this paper, we propose GauFace, a novel Gaussian Splatting representation, tailored for efficient rendering of facial mesh with textures. Then, we introduce TransGS, a diffusion transformer that instantly generates the GauFace assets from mesh, textures and lightning conditions. Specifically, we adopt a patch-based pipeline to handle the vast number of Gaussian Points, a novel texel-aligned sampling scheme with UV positional encoding to enhance the throughput of generating GauFace assets. Once trained, TransGS can generate GauFace assets in 5 seconds, delivering high fidelity and real-time facial interaction of 30fps@1440p to a Snapdragon 8 Gen 2 mobile platform. The rich conditional modalities further enable editing and animation capabilities reminiscent of traditional CG pipelines. We conduct extensive evaluations and user studies, compared to traditional renderers, as well as recent neural rendering methods. They demonstrate the superior performance of our approach for facial asset rendering. We also showcase diverse applications of facial assets using our TransGS approach and GauFace representation, across various platforms like PCs, phones, and VR headsets. |
| Persistent Identifier | http://hdl.handle.net/10722/361933 |
| ISSN | 2023 Impact Factor: 20.8 2023 SCImago Journal Rankings: 6.158 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Qin, Dafei | - |
| dc.contributor.author | Lin, Hongyang | - |
| dc.contributor.author | Zhang, Qixuan | - |
| dc.contributor.author | Qiao, Kaichun | - |
| dc.contributor.author | Zhang, Longwen | - |
| dc.contributor.author | Saito, Jun | - |
| dc.contributor.author | Zhao, Zijun | - |
| dc.contributor.author | Yu, Jingyi | - |
| dc.contributor.author | Xu, Lan | - |
| dc.contributor.author | Komura, Taku | - |
| dc.date.accessioned | 2025-09-17T00:32:09Z | - |
| dc.date.available | 2025-09-17T00:32:09Z | - |
| dc.date.issued | 2025-01-01 | - |
| dc.identifier.citation | IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025 | - |
| dc.identifier.issn | 0162-8828 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/361933 | - |
| dc.description.abstract | <p>Traditional and AI-driven modeling techniques enable high-fidelity 3D asset generation from scans, videos, or text prompts. However, editing and rendering these assets often involves a trade-off between quality and speed. In this paper, we propose GauFace, a novel Gaussian Splatting representation, tailored for efficient rendering of facial mesh with textures. Then, we introduce TransGS, a diffusion transformer that instantly generates the GauFace assets from mesh, textures and lightning conditions. Specifically, we adopt a patch-based pipeline to handle the vast number of Gaussian Points, a novel texel-aligned sampling scheme with UV positional encoding to enhance the throughput of generating GauFace assets. Once trained, TransGS can generate GauFace assets in 5 seconds, delivering high fidelity and real-time facial interaction of 30fps@1440p to a Snapdragon 8 Gen 2 mobile platform. The rich conditional modalities further enable editing and animation capabilities reminiscent of traditional CG pipelines. We conduct extensive evaluations and user studies, compared to traditional renderers, as well as recent neural rendering methods. They demonstrate the superior performance of our approach for facial asset rendering. We also showcase diverse applications of facial assets using our TransGS approach and GauFace representation, across various platforms like PCs, phones, and VR headsets.</p> | - |
| dc.language | eng | - |
| dc.publisher | Institute of Electrical and Electronics Engineers | - |
| dc.relation.ispartof | IEEE Transactions on Pattern Analysis and Machine Intelligence | - |
| dc.subject | 3D gaussian splatting | - |
| dc.subject | animation | - |
| dc.subject | diffusion | - |
| dc.subject | digital avatar | - |
| dc.subject | generative model | - |
| dc.subject | transformer | - |
| dc.title | Instant Gaussian Splatting Generation for High-Quality and Real-Time Facial Asset Rendering | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1109/TPAMI.2025.3550195 | - |
| dc.identifier.scopus | eid_2-s2.0-105000382833 | - |
| dc.identifier.eissn | 1939-3539 | - |
| dc.identifier.issnl | 0162-8828 | - |
