Exploring the Fusion of Animation and Computer Vision for Enhanced Realism in Virtual Character Interaction

被引:0
|
作者
Zheng, Fei [1 ]
Zhu, Yu [1 ]
机构
[1] Xi'an Technological University, School of Art and Media, Xi'an,710000, China
关键词
Adversarial machine learning - Animation - Damage detection - Failure analysis - Image enhancement - Image quality - Image texture - Interactive computer graphics - Multilayer neural networks - Negative bias temperature instability - Reliability analysis - Virtual reality;
D O I
10.1109/ACCESS.2024.3519292
中图分类号
学科分类号
摘要
This research investigates the enhancement of virtual character animations and interactions through the integration of Generative Adversarial Networks (GANs) and Particle Swarm Optimization (PSO). The main objective was to improve the quality of generated animations by optimizing GAN hyperparameters to achieve better texture fidelity, animation consistency, and computational efficiency. Hyperparameters are the settings used to control the learning process of an algorithm (e.g., learning rate, number of layers in a neural network). Unlike model parameters, which are learned from the data, hyperparameters are predefined and must be tuned to improve model performance. GANs were utilized to generate realistic textures and animations, with the model architecture including a generator and discriminator. Training was performed using a comprehensive dataset of real-world animations. To optimize the GAN's performance, PSO was applied to adjust key hyperparameters such as learning rates, batch sizes, and network layers. Notable improvements were observed, including an increase in SSIM (Structural Similarity Index) scores from 0.75 to 0.88 and a rise in PSNR (Peak Signal-to-Noise Ratio) values from 24.5 dB to 28.7 dB. SSIM is a metric that evaluates the similarity between two images by comparing their structural information, luminance, and contrast. A higher SSIM score means the generated image is more similar to the reference image, indicating better visual quality. PSNR is a metric used to measure the quality of an image compared to a reference image. It compares the maximum possible signal (image quality) to the noise (errors or distortions) introduced in the generated image, with higher PSNR values indicating better image quality. The FID score was reduced from 50.3 to 30.2, indicating enhanced visual realism. Expert review scores for overall impression and user interaction quality also showed significant increases, demonstrating the effectiveness of the optimization. Additionally, computational efficiency was improved, with training time reduced from 50 hours to 35 hours and inference time decreased from 0.80 to 0.60 seconds per frame. The implications of these findings are considerable for virtual character design across gaming, film, and virtual reality industries. The research provides a solid framework for achieving more immersive and realistic virtual environments. The approach of combining PSO with GANs presents a promising method for advancing animation and computer vision technologies in future developments. © 2013 IEEE.
引用
收藏
页码:194816 / 194828
相关论文
共 22 条
  • [21] Enhanced breast cancer diagnosis through integration of computer vision with fusion based joint transfer learning using multi modality medical images
    S. Iniyan
    M. Senthil Raja
    R. Poonguzhali
    A. Vikram
    Janjhyam Venkata Naga Ramesh
    Sachi Nandan Mohanty
    Khasim Vali Dudekula
    Scientific Reports, 14 (1)