Fine-tuning Pipeline for Hand Image Generation Using Diffusion Model

被引:0
|
作者
Bai, Bingyuan [1 ]
Xie, Haoran [1 ]
Miyata, Kazunori [1 ]
机构
[1] Japan Adv Inst Sci & Technol JAIST, Nomi, Japan
关键词
text-to-image; hand inpainting; stable diffusion; ControlNet; LoRA;
D O I
10.1109/NICOInt62634.2024.00020
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The hand images generated by the image generative model may suffer distortions, such as stable diffusion. To solve this issue, we introduce a hand image fine-tuning pipeline consisting of three stages: hand detection, object masking, and image inpainting. First, a hand detection model is trained to identify flawed hands using bounding boxes (Bbox). Then, these Bbox regions are masked in conjunction with Mediapipe landmarks. Finally, a ControlNet model is trained for inpainting the masked areas, and the targeted LoRA is also trained to minimize boundary fragmentation. The results indicate that our method achieves better anatomical accuracy in hand reconstruction compared to the original diffusion model. Furthermore, the introduction of the directional LoRA model further enhances the evaluation outcomes.
引用
收藏
页码:58 / 63
页数:6
相关论文
共 50 条
  • [31] Using statistical methods to model the fine-tuning of molecular machines and systems
    Thorvaldsen, Steinar
    Hossjer, Ola
    JOURNAL OF THEORETICAL BIOLOGY, 2020, 501
  • [32] Wind Forecasting using HARMONIE with Bayes Model Averaging for Fine-Tuning
    Peters, Martin B.
    O'Brien, Enda
    McKinstry, Alastair
    Ralph, Adam
    EUROPEAN GEOSCIENCES UNION GENERAL ASSEMBLY 2013, EGUDIVISION ENERGY, RESOURCES & THE ENVIRONMENT, ERE, 2013, 40 : 95 - 101
  • [33] Construction of Domain-Specific DistilBERT Model by Using Fine-Tuning
    Bai, Jing
    Cao, Rui
    Ma, Wen
    Shinnou, Hiroyuki
    2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 237 - 241
  • [34] Improving Universal Language Model Fine-Tuning using Attention Mechanism
    Santos, Flavio A. O.
    Ponce-Guevara, K. L.
    Macedo, David
    Zanchettin, Cleber
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [35] Autograsping pose of virtual hand model using the Signed Distance Field real-time sampling with fine-tuning
    Puchalski M.
    Woźna-Szcześniak B.
    Computer Science Research Notes, 2023, 31 (1-2): : 232 - 240
  • [36] Fine-tuning in vectorization using algebraic curves
    Zhang, SQ
    Li, L
    Seah, H
    COMPUTERS & GRAPHICS-UK, 1999, 23 (02): : 269 - 276
  • [37] Fine-tuning in vectorization using algebraic curves
    Zhang, Shouqing
    Li, Ling
    Seah, Hocksoon
    Computers and Graphics (Pergamon), 1999, 23 (02): : 269 - 276
  • [38] Fine-Tuning and training of densenet for histopathology image representation using TCGA diagnostic slides
    Riasatian, Abtin
    Babaie, Morteza
    Maleki, Danial
    Kalra, Shivam
    Valipour, Mojtaba
    Hemati, Sobhan
    Zaveri, Manit
    Safarpoor, Amir
    Shafiei, Sobhan
    Afshari, Mehdi
    Rasoolijaberi, Maral
    Sikaroudi, Milad
    Adnan, Mohd
    Shah, Sultaan
    Choi, Charles
    Damaskinos, Savvas
    Campbell, Clinton Jv
    Diamandis, Phedias
    Pantanowitz, Liron
    Kashani, Hany
    Ghodsi, Ali
    Tizhoosh, H. R.
    MEDICAL IMAGE ANALYSIS, 2021, 70
  • [39] Spiking neural networks fine-tuning for brain image segmentation
    Yue, Ye
    Baltes, Marc
    Abuhajar, Nidal
    Sun, Tao
    Karanth, Avinash
    Smith, Charles D.
    Bihl, Trevor
    Liu, Jundong
    FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [40] Gastric Pathology Image Classification Using Stepwise Fine-Tuning for Deep Neural Networks
    Qu, Jia
    Hiruta, Nobuyuki
    Terai, Kensuke
    Nosato, Hirokazu
    Murakawa, Masahiro
    Sakanashi, Hidenori
    JOURNAL OF HEALTHCARE ENGINEERING, 2018, 2018