虚拟人形象合成技术综述

邓钇敏; 张旭龙; 司世景; 王健宗; 肖京

doi:10.11959/j.issn.2096-0271.2022081

您当前的位置：

首页 >

文章列表页 >

虚拟人形象合成技术综述

研究 | 更新时间：2024-06-03

- 虚拟人形象合成技术综述
- Human avatars synthesis technologies: a survey
- 大数据 2023年9卷第3期页码：114-139
- 作者机构：
  
  1. 平安科技（深圳）有限公司，广东深圳 518063
  2. 中国科学技术大学，安徽合肥 230026
  3. 上海外国语大学国际金融贸易学院，上海 200083
- 作者简介：
  
  [ "邓钇敏（1999- ），女，中国科学技术大学硕士生，中国计算机学会会员，主要研究方向为深度学习、计算机视觉、元宇宙等。" ]
  [ "张旭龙（1988- ），男，博士，平安科技（深圳）有限公司高级算法研究员，主要研究方向为语音合成、语音转换、音乐信息检索、机器学习和深度学习方法在人工智能领域应用。" ]
  [ "司世景（1988- ），男，博士，平安科技（深圳）有限公司资深算法研究员，深圳市海外高层次人才。美国杜克大学人工智能博士后，中国计算机学会会员，主要研究方向为机器学习和及其在人工智能领域应用。" ]
  [ "王健宗（1983- ），男，博士，平安科技（深圳）有限公司副总工程师，资深人工智能总监，联邦学习技术部总经理。美国佛罗里达大学人工智能博士后，中国计算机学会高级会员，中国计算机学会大数据专家委员会委员，曾任美国莱斯大学电子与计算机工程系研究员，主要研究方向为联邦学习和人工智能等。" ]
  [ "肖京（1972- ），男，博士，中国平安集团首席科学家，2019年吴文俊人工智能杰出贡献奖获得者，中国计算机学会深圳分部副主席，主要研究方向为计算机图形学学科、自动驾驶、3D显示、医疗诊断、联邦学习等。" ]
- 基金信息：
  
  广东省重点领域研发计划“新一代人工智能”重大专项;The Key Research and Development Program of Guangdong Province(2021B0101400003)
- DOI：10.11959/j.issn.2096-0271.2022081
  中图分类号： TP391
- 网络首发：2023-05，
  
  纸质出版：2023-05-15
- 稿件说明：
移动端阅览
邓钇敏, 张旭龙, 司世景, 等. 虚拟人形象合成技术综述[J]. 大数据, 2023,9(3):114-139.

Yimin DENG, Xulong ZHANG, Shijing SI, et al. Human avatars synthesis technologies: a survey[J]. Big data research, 2023, 9(3): 114-139.
邓钇敏, 张旭龙, 司世景, 等. 虚拟人形象合成技术综述[J]. 大数据, 2023,9(3):114-139. DOI： 10.11959/j.issn.2096-0271.2022081.

Yimin DENG, Xulong ZHANG, Shijing SI, et al. Human avatars synthesis technologies: a survey[J]. Big data research, 2023, 9(3): 114-139. DOI： 10.11959/j.issn.2096-0271.2022081.

摘要

随着元宇宙兴起，针对虚拟人形象化高效建模的需求日益迫切。从人类图像数据集中构建人类模型一直是计算机视觉的热门话题，其中3D虚拟人合成可以视作三维重建的子模块，重点在于对复杂的人体结构和表面细节的还原。对近年来虚拟人形象构建相关文献进行了全面调研，研究范围覆盖了全身形象、头部形象以及衣物建模等领域。分析归纳构建工作的基本原理，从各自技术路线层面出发将虚拟人合成方法分为基于网格、基于图像、基于体素、基于隐式表示、混合表示5类。首先介绍各类方法的基本原理，然后结合现有工作讨论具体技术，并指出各类方法的优缺点。此外还介绍了部分常见的模型质量评估的数据集和评价指标，简要介绍了虚拟人的常见应用。最后对虚拟人合成技术未来发展方向进行了展望，以合成高质量、高保真度、低延迟的虚拟人形象。

Abstract

Nowadays

the demand for efficient human avatars modeling is becoming increasingly urgent since metaverse has attracted more and more attention.Creating human avatars from human image datasets has always been a popular topic in the field of computer vision.3D human avatars synthesis can be regarded as a sub-module of 3D reconstruction focusing on reproducing the complex articulated body and surface details of human.A comprehensive survey of the literature related to the human reconstruction in recent years was conducted

including the work of full-body avatars

talking-head and clothing modeling.By analyzing and summarizing existing work

human avatars synthesis technologies were divided into five categories: mesh-based methods

image-based methods

voxel-based methods

implicit methods and hybrid methods due to the features of their pipelines.Firstly

the basic principles of them were introduced respectively.Secondly

the realization based on related work was discussed and then the advantages and disadvantages of methods respectively were pointed out.Thirdly

the datasets and metrics for model quality evaluation were introduced.Besides

an overview of various applications was given.Finally

the future directions of human avatars synthesis technology were prospected to synthesize high-quality

high-fidelity and low-latency human avatars.

关键词

Keywords

references

中国人工智能产业发展联盟总体组 , 中关村数智人工智能产业联盟数字人工作委员会 . 2020年虚拟数字人发展白皮书 [R ] . 2020 .

Artificial Intelligence Industry Alliance , Digital Human Work Committee of Zhongguancun Shuzhi Artificial Intelligence Industry Alliance . 2020 virtual digital human development white paper [R ] . 2020 .

FU K , PENG J S , HE Q W , et al . Single image 3D object reconstruction based on deep learning:a review [J ] . Multimedia Tools and Applications , 2021 , 80 ( 1 ): 463 - 498 .

SHA T , ZHANG W , SHEN T , et al . Deep person generation:a survey from the perspective of face,pose and cloth synthesis [J ] . arXiv preprint , 2021 ,arXiv:2109.02081.

CHEN L , PENG S D , ZHOU X W . Towards efficient and photorealistic 3D human reconstruction:a brief survey [J ] . Visual Informatics , 2021 , 5 ( 4 ): 11 - 19 .

JOEYDEVRIES . Textures [Z ] . 2022 .

ZENG W , OUYANG W L , LUO P , et al . 3D human mesh regression with dense correspondence [C ] // Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 7052 - 7061 .

GATYS L A , ECKER A S , BETHGE M . Texture synthesis using convolutional neural networks [C ] // Proceedings of the 28th International Conference on Neural Information Processing Systems New York:ACM Press , 2015 : 262 - 270 .

RISSER E , WILMOT P , BARNES C . Stable and controllable neural texture synthesis and style transfer using histogram losses [J ] . arXiv preprint , 2017 ,arXiv:1701.08893.

OECHSLE M , MESCHEDER L , NIEMEYER M , et al . Texture fields:learning texture representations in function space [C ] // Proceedings of 2019 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2020 : 4530 - 4539 .

阿娣提·玛珠德 , M.戈皮 . 视觉计算基础:计算机视觉、图形学和图像处理的核心概念 [M ] . 赵启军,涂欢,梁洁,译. . 北京 : 机械工业出版社 , 2019 .

MAJUMDER A , GOPI M . Introduction to visual computing:core concepts in computer vision,graphics,and image processing [M ] . Translated by ZHAO Q J,XU H,LIANG J . Beijing : China Machine Press , 2019 .

JONES A , GARDNER A , BOLAS M , et al . Simulating spatially varying lighting on a live performance [C ] // Proceedings of 3rd European Conference on Visual Media Production and the 2nd Multimedia Conference 2006 .[S.l.:s.n. ] , 2006 : 127 - 133 .

PHONG B T . Illumination for computer generated pictures [J ] . Communications of the ACM , 1975 , 18 ( 6 ): 311 - 317 .

JOEYDEVRIES . Normal mapping [Z ] . 2022 .

JOEYDEVRIES . PBR:theory [Z ] . 2022 .

洪锋 , 梅炯 , 李明禄 . 医学图象三维重建技术综述 [J ] . 中国图象图形学报(A辑) , 2003 , 8 ( z1 ): 784 - 791 .

HONG F , MEI J , LI M L . Study on the techniques for 3D reconstruction of medical images [J ] . Journal of Image and Graphics , 2003 , 8 ( z1 ): 784 - 791 .

MILDENHALL B , SRINIVASAN P P , TANCIK M , et al . NeRF:representing scenes as neural radiance fields for view synthesis [C ] // Proceedings of 2020 European Conference on Computer Vision . Cham:Springer , 2020 : 405 - 421 .

XU Q G , XU Z X , PHILIP J , et al . PointNeRF:point-based neural radiance fields [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 5428 - 5438 .

ANGUELOV D , SRINIVASAN P , KOLLER D , et al . SCAPE:shape completion and animation of people [J ] . ACM Transactions on Graphics , 2005 , 24 ( 3 ): 408 - 416 .

KAVAN L , COLLINS S , ŽÁRA J , , et al . Geometric skinning with approximate dual quaternion blending [J ] . ACM Transactions on Graphics , 2008 , 27 ( 4 ): 1 - 23 .

JACOBSON A , BARAN I , POPOVIĆ J , , et al . Bounded biharmonic weights for realtime deformation [J ] . ACM Transactions on Graphics , 2011 , 30 ( 4 ): 1 - 8 .

LOPER M , MAHMOOD N , ROMERO J , et al . SMPL:a skinned multi-person linear model [J ] . ACM Transactions on Graphics , 2015 , 34 ( 6 ): 1 - 16 .

PAVLAKOS G , CHOUTAS V , GHORBANI N , et al . Expressive body capture:3D hands,face,and body from a single image [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 10967 - 10977 .

WU S Z , JIN S , LIU W T , et al . Graphbased 3D multi-person pose estimation using multi-view images [C ] // Proceedings of 2021 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2022 : 11128 - 11137 .

JIANG B Y , ZHANG Y D , WEI X K , et al . H4D:human 4D modeling by learning neural compositional representation [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 19333 - 19343 .

OSMAN A A A , BOLKART T , BLACK M J . STAR:sparse trained articulated human body regressor [C ] // Proceedings of European Conference on Computer Vision . Cham:Springer , 2020 : 598 - 613 .

XU H Y , BAZAVAN E G , ZANFIR A , et al . GHUM ＆ GHUML:generative 3D human shape and articulated pose models [C ] // Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 6183 - 6192 .

KINGMA D P , WELLING M . Autoencoding variational bayes [J ] . arXiv preprint , 2013 ,arXiv:1312.6114.

REZENDE D J , MOHAMED S . Variational inference with normalizing flows [J ] . arXiv preprint , 2015 ,arXiv:1505.05770.

BHATNAGAR B , TIWARI G , THEOBALT C , et al . Multi-garment Net:learning to dress 3D people from images [C ] // Proceedings of 2019 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2020 : 5419 - 5429 .

ALLDIECK T , PONS-MOLL G , THEOBALT C , et al . Tex2Shape:detailed full human body geometry from a single image [C ] // Proceedings of 2019 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2020 : 2293 - 2303 .

WENG C Y , CURLESS B , KEMELMACHER-SHLIZERMAN I . Photo wake-up:3D character animation from a single photo [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 5901 - 5910 .

ALLDIECK T , MAGNOR M , XU W P , et al . Detailed human avatars from monocular video [C ] // Proceedings of 2018 International Conference on 3D Vision . Piscataway:IEEE Press , 2018 : 98 - 109 .

MA Q L , YANG J L , RANJAN A , et al . Learning to dress 3D people in generative clothing [C ] // Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 6468 - 6477 .

ALLDIECK T , MAGNOR M , BHATNAGAR B L , et al . Learning to reconstruct people in clothing from a single RGB camera [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 1175 - 1186 .

JIANG B Y , ZHANG J Y , HONG Y , et al . BCNet:learning body and cloth shape from a single image [C ] // Proceedings of European Conference on Computer Vision . Cham:Springer , 2020 : 18 - 35 .

WEI W L , LIN J C , LIU T L , et al . Capturing humans in motion:temporalattentive 3D human pose and shape estimation from monocular video [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 13201 - 13210 .

BLANZ V , VETTER T . A morphable model for the synthesis of 3D faces [C ] // Proceedings of the 26th annual conference on Computer graphics and interactive techniques . New York:ACM Press , 1999 : 187 - 194 .

LATTAS A , MOSCHOGLOU S , GECER B , et al . AvatarMe:realistically renderable 3D facial reconstruction “In-thewild” [C ] // Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 757 - 766 .

ZHENG M W , YANG H Y , HUANG D , et al . ImFace:a nonlinear 3D morphable face model with implicit neural representations [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 20311 - 20320 .

ZHENG Y F , ABREVAYA V F , BÜHLER M C , et al . I M avatar:implicit morphable head avatars from videos [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 13535 - 13545 .

GECER B , PLOUMPIS S , KOTSIA I , et al . GANFIT:generative adversarial network fitting for high fidelity 3D face reconstruction [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 1155 - 1164 .

KARRAS T , LAINE S , AILA T M . A style-based generator architecture for generative adversarial networks [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 4396 - 4405 .

TEWARI A , ELGHARIB M , BHARAJ G , et al . StyleRig:rigging StyleGAN for 3D control over portrait images [C ] // Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 6141 - 6150 .

KARRAS T , LAINE S , AITTALA M , et al . Analyzing and improving the image quality of StyleGAN [C ] // Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 8107 - 8116 .

LUO H W , NAGANO K , KUNG H W , et al . Normalized avatar synthesis using StyleGAN and perceptual refinement [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 11657 - 11667 .

SHEN Y , LIANG J B , LIN M C . GANbased garment generation using sewing pattern images [C ] // Proceedings of European Conference on Computer Vision . Cham:Springer , 2020 : 225 - 247 .

RAFFIEE A H , SOLLAMI M . GarmentGAN:photo-realistic adversarial fashion transfer [C ] // Proceedings of 2020 25th International Conference on Pattern Recognition . Piscataway:IEEE Press , 2021 : 3923 - 3930 .

CURLESS B , LEVOY M . A volumetric method for building complex models from ange images [C ] // Proceedings of the 23rd Annual Conference on Computer graphics and Interactive Techniques . New York:ACM Press , 1996 : 303 - 312 .

IZADI S , KIM D , HILLIGES O , et al . KinectFusion:real-time 3D reconstruction and interaction using a moving depth camera [C ] // Proceedings of the 24th annual ACM symposium on User Interface Software and Technology . New York:ACM Press , 2011 : 559 - 568 .

DAI A , NIEßNER M , ZOLLHÖFER M , et al . BundleFusion:real-time globally consistent 3D reconstruction using onthe-fly surface reintegration [J ] . ACM Transactions on Graphics , 2017 , 36 ( 4 ): 76a .

SITZMANN V , THIES J , HEIDE F , et al . DeepVoxels:learning persistent 3D feature embeddings [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 2432 - 2441 .

MA X X , SU J J , WANG C Y , et al . Context modeling in 3D human pose estimation:a unified perspective [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 6234 - 6243 .

ZHENG Z R , YU T , WEI Y X , et al . DeepHuman:3D human reconstruction from a single image [C ] // Proceedings of 2019 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2020 : 7738 - 7748 .

LOMBARDI S , SIMON T , SARAGIH J , et al . Neural volumes:learning dynamic renderable volumes from images [J ] . ACM Transactions on Graphics , 2019 , 38 ( 4 ): 1 - 14 .

MESCHEDER L , OECHSLE M , NIEMEYER M , et al . Occupancy networks:learning 3D reconstruction in function space [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 4455 - 4465 .

PARK J J , FLORENCE P , STRAUB J , et al . DeepSDF:learning continuous signed distance functions for shape representation [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 165 - 174 .

CHEN Z Q , ZHANG H . Learning implicit fields for generative shape modeling [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 5932 - 5941 .

SITZMANN V , ZOLLHÖFER M , WETZSTEIN G . Scene representation networks:continuous 3D-structureaware neural scene representations [J ] . arXiv preprint , 2019 ,arXiv:1906.01618.

YANG G S , VO M , NEVEROVA N , et al . BANMo:building animatable 3D neural models from many casual videos [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 2853 - 2863 .

NEVEROVA N , NOVOTNY D , KHALIDOV V , et al . Continuous surface embeddings [J ] . arXiv preprint , 2020 ,arXiv:2011.12438.

BOŽIČ A , PALAFOX P , ZOLLHÖFER M , et al . Neural deformation graphs for globally-consistent non-rigid reconstruction [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 1450 - 1459 .

MITTAL P , CHENG Y C , SINGH M , et al . AutoSDF:shape priors for 3D completion,reconstruction and generation [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 306 - 315 .

SAITO S , HUANG Z , NATSUME R , et al . PIFu:pixel-aligned implicit function for high-resolution clothed human digitization [C ] // Proceedings of 2019 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2020 : 2304 - 2314 .

SAITO S , SIMON T , SARAGIH J , et al . PIFuHD:multi-level pixel-aligned implicit function for high-resolution 3D human digitization [C ] // Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 81 - 90 .

JADHAV O , PATIL A , SAM J , et al . Virtual dressing using augmented reality [J ] . ITM Web of Conferences , 2021 ,40.

ZHU X , LIAO T , LYU J , et al . MVPhuman dataset for 3D human avatar reconstruction from unconstrained frames [J ] . arXiv preprint , 2022 ,arXiv:2204.11184.

DENG B Y , LEWIS J P , JERUZALSKI T , et al . NASA Neural articulated shape approximation [C ] // Proceedings of European Conference on Computer Vision . Cham:Springer , 2020 : 612 - 628 .

CAO Y K , CHEN G Y , HAN K , et al . JIFF:jointly-aligned implicit face function for high quality single view clothed human reconstruction [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 2719 - 2729 .

BHATNAGAR B L , SMINCHISESCU C , THEOBALT C , et al . Combining implicit function learning and parametric models for 3D human reconstruction [C ] // Proceedings of European Conference on Computer Vision . Cham:Springer , 2020 : 311 - 329 .

SAITO S , YANG J L , MA Q L , et al . SCANimate:weakly supervised learning of skinned clothed avatar networks [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 2885 - 2896 .

XIU Y L , YANG J L , TZIONAS D , et al . ICON:implicit clothed humans obtained from normals [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 13286 - 13296 .

ZHENG Z R , HUANG H , YU T , et al . Structured local radiance fields for human avatar modeling [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 15872 - 15882 .

XU T H , FUJITA Y , MATSUMOTO E . Surface-aligned neural radiance fields for controllable 3D human synthesis [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 15862 - 15871 .

LIU W , PIAO Z X , MIN J , et al . Liquid warping GAN:a unified framework for human motion imitation,appearance transfer and novel view synthesis [C ] // Proceedings of 2019 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2020 : 5903 - 5912 .

GRIGOREV A , ISKAKOV K , IANINA A , et al . StylePeople:a generative model of fullbody human avatars [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 5147 - 5156 .

RAJ A , ZOLLHÖFER M , SIMON T , et al . Pixel-aligned volumetric avatars [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 11728 - 11737 .

GAFNI G , THIES J , ZOLLHÖFER M , et al . Dynamic neural radiance fields for monocular 4D facial avatar reconstruction [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 8645 - 8654 .

ZHENG Z R , YU T , LIU Y B , et al . PaMIR:parametric model-conditioned implicit representation for imagebased human reconstruction [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2022 , 44 ( 6 ): 31703184 .

ZHENG Y , SHAO R Z , ZHANG Y X , et al . DeepMultiCap:performance capture of multiple characters using sparse multiview cameras [C ] // Proceedings of 2021 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2022 : 6219 - 6229 .

YANG Z , WANG S L , MANIVASAGAM S , et al . S3:neural shape,skeleton,and skinning fields for 3D human modeling [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 13279 - 13288 .

PENG S D , ZHANG Y Q , XU Y H , et al . Neural body:implicit neural representations with structured latent codes for novel view synthesis of dynamic humans [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 9050 - 9059 .

ZHENG L , SHEN L Y , TIAN L , et al . Scalable person re-identification:a benchmark [C ] // Proceedings of 2015 IEEE International Conference on Computer Vision . Piscataway:IEEE Press , 2016 : 1116 - 1124 .

IONESCU C , PAPAVA D , OLARU V , et al . Human3.6M:large scale datasets and predictive methods for 3D human sensing in natural environments [J ] . IEEE Transactions on Pattern Analysis and Machine Intelligence , 2014 , 36 ( 7 ): 1325 - 1339 .

JING X Y , FENG Q , LAI Y K , et al . STATE:learning structure and texture representations for novel view synthesis [C ] // Proceedings of IEEE International Conference on Computer Vision .[S.l.:s.n. ] , 2022 .

PATEL P , HUANG C H P , TESCH J , et al . AGORA:avatars in geography optimized for regression analysis [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 13463 - 13473 .

ALLDIECK T , MAGNOR M , XU W P , et al . Video based reconstruction of 3D people models [C ] // Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2018 : 8387 - 8397 .

ZHANG R , ISOLA P , EFROS A A , et al . The unreasonable effectiveness of deep features as a perceptual metric [C ] // Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2018 : 586 - 595 .

BROWNLEE J How to implement the frechet inception distance (FID) for evaluating GANs [Z ] . 2019 .

SETIADI D R I M . PSNR vs SSIM:imperceptibility quality assessment for image steganography [J ] . Multimedia Tools and Applications , 2021 , 80 ( 6 ): 84238444 .

LAZOVA V , INSAFUTDINOV E , PONSMOLL G . 360-degree textures of people in clothing from a single image [C ] // Proceedings of 2019 International Conference on 3D Vision . Piscataway:IEEE Press , 2019 : 643 - 653 .

SIAROHIN A , LATHUILIÈRE S , TULYAKOV S , et al . First order motion model for image animation [J ] . Advances in Neural Information Processing Systems , 2019 ,32.

SIAROHIN A , LATHUILIÈRE S , TULYAKOV S , et al . First order motion model for image animation [J ] . arXiv preprint , 2020 ,arXiv:2003.00196.

KIM H , GARRIDO P , TEWARI A , et al . Deep video portraits [J ] . ACM Transactions on Graphics , 2018 , 37 ( 4 ): 1 - 14 .

ZHAO F Q , YANG W , ZHANG J K , et al . HumanNeRF:efficiently generated human radiance field from sparse inputs [C ] // Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2022 : 7733 - 7743 .

ESSER P , SUTTER E . A variational U-net for conditional appearance and shape generation [C ] // Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2018 : 8857 - 8866 .

REN Y R , YU X M , CHEN J M , et al . Deep image spatial transformation for person image generation [C ] // Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 7687 - 7696 .

SIAROHIN A , SANGINETO E , LATHUILIÈRE S , et al . Deformable GANs for pose-based human image generation [C ] // Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2018 : 3408 - 3416 .

LIU M C , WANG K J , JI R H , et al . Pose transfer generation with semantic parsing attention network for person re-identification [J ] . Knowledge-Based Systems , 2021 ,223.

ZHU Z , HUANG T T , SHI B G , et al . Progressive pose attention transfer for person image generation [C ] // Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2020 : 2342 - 2351 .

OLSZEWSKI K , TULYAKOV S , WOODFORD O , et al . Transformable bottleneck networks [C ] // Proceedings of 2019 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2020 : 7647 - 7656 .

YU A , YE V , TANCIK M , et al . pixelNeRF:neural radiance fields from one or few images [C ] // Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway:IEEE Press , 2021 : 4576 - 4585 .

FANG Z X , CAI L B , WANG G . MetaHuman creator the starting point of the metaverse [C ] // Proceedings of 2021 International Symposium on Computer Technology and Information Science . Piscataway:IEEE Press , 2021 : 154 - 157 .

PATARANUTAPORN P , DANRY V , LEONG J , et al . AI-generated characters for supporting personalized learning and well-being [J ] . Nature Machine Intelligence , 2021 , 3 ( 12 ): 1013 - 1022 .

PATARANUTAPORN P , DANRY V , MAES P . Machinoia,machine of multiple me:integrating with past,future and alternative selves [C ] // Proceedings of Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems . New York:ACM Press , 2021 : 1 - 7 .

KATO R , KIKUCHI Y , YEM V , et al . Reality Avatar for Customer Conversation in the Metaverse [C ] // Proceedings of International Conference on HumanComputer Interaction . Cham:Springer , 2022 : 131 - 145 .

CONTI M , GATHANI J , TRICOMI P P . Virtual influencers in online social media [J ] . IEEE Communications Magazine , 2022 , 60 ( 8 ): 86 - 91 .

SILVA E S , BONETTI F . Digital humans in fashion:will consumers interact? [J ] . Journal of Retailing and Consumer Services , 2021 ,60.

KÁDEKOVÁ I Z , HOLIENČINOVÁ I M . Influencer marketing as a modern phenomenon creating a new frontier of virtual opportunities [J ] . Communication Today , 2018 , 9 ( 2 ): 90 - 105 .

沈浩 , 刘亭利 . 虚实共融，若即若离：全面进击的虚拟数字人 [J ] . 视听界 , 2022 ( 3 ): 5 - 10 .

SHEN H , LIU T L . Integration of reality and reality,at arm’s length:an all-round attack on virtual digital people [J ] . Broadcasting Realm , 2022 ( 3 ): 5 - 10 .

PARK I , SAH Y J , LEE S , et al . Avatarmediated communication in video conferencing:effect of self-affirmation on debating participation focusing on moderation effect of avatar [J ] . International Journal of Human-Computer Interaction , 2023 , 39 ( 3 ): 464 - 475 .

TAKANO M , YOKOTANI K . Online social support via avatar communication buffers harmful effects of offline bullying victimization [J ] . Proceedings of the International AAAI Conference on Web and Social Media , 2022 , 16 : 980 - 992 .

CHEONG B C . Avatars in the metaverse:potential legal issues and remedies [J ] . International Cybersecurity Law Review , 2022 , 3 ( 2 ): 467 - 494 .

LI J X , FENG Z J , SHE Q , et al . MINE:towards continuous depth MPI with NeRF for novel view synthesis [C ] // Proceedings of 2021 IEEE/CVF International Conference on Computer Vision . Piscataway:IEEE Press , 2022 : 12558 - 12568 .

浏览量

569

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

沙尘图像视觉增强技术综述

节奏舞者：基于关键动作转换图和有条件姿态插值网络的3D舞蹈生成方法研究

基于改进YOLOv8的高分辨率遥感图像目标检测算法

情感语音合成综述

基于生成对抗网络的多特征融合去雾技术