Generative AI-based style recommendation using fashion item detection and classification

被引:0
|
作者
Kalinin, Aleksandr [1 ]
Jafari, Akbar Anbar [1 ]
Avots, Egils [1 ]
Ozcinar, Cagri [1 ]
Anbarjafari, Gholamreza [2 ,3 ,4 ]
机构
[1] Univ Tartu, iCV Lab, Tartu, Estonia
[2] 3S Holding OU, Uus Veeriku Tee 1, EE-62220 Tartu, Estonia
[3] PwC Advisory, Helsinki, Finland
[4] iVCV OU, Purpuri 12-2, EE-51011 Tartu, Estonia
关键词
Generative AI; Deep learning; Object detection; Image processing; Fashion;
D O I
10.1007/s11760-024-03538-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This research work focuses on the creation of a cutting-edge style recommendation system that uses generative AI and deep learning approaches to analyse fashion photos. The system is intended to process input images, such as selfies or studio-quality photos, and output a text file with extensive feedback on the individual's style and suggestions for improvement. The system consists of two main components: the YOLOv8 convolutional neural network trained on DeepFashion2 dataset, which detects and crops clothing items, and the GPT-4.0 large language model, which generates informative style commentary and recommendations. YOLOv8 is briefly trained on a specific dataset to improve its performance in recognising 10 different types of clothes, while GPT-4.0, which is accessible via the OpenAI API, is charged with giving cohesive and short style suggestions. To evaluate the success of the suggested solution, real experimental trials were conducted at many events in Madrid and Tallinn. Three well-known AI models were used for comparison: OpenAI's GPT-4.0 Vision, Google's Gemini 1.5 Pro as reported by Reid et al. (Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context, 2024), and Anthropic's Claude 3 (Templeton et al. in Tansform Circuits Thread, 2024)-Opus. Participants judged the quality of each model's fashion recommendations. The results showed that GPT-4.0 Vision and Gemini 1.5 Pro had comparable average ratings, indicating higher perceived quality than Claude 3-Opus. This research work demonstrates how cutting-edge computer vision and natural language processing technology may transform personalised fashion advising services, improving accuracy and relevance of style recommendations.
引用
收藏
页码:9179 / 9189
页数:11
相关论文
共 50 条
  • [1] AN AI-BASED CLASSIFICATION AND RECOMMENDATION SYSTEM FOR DIGITAL LIBRARIES
    Alomran, Abdulaziz I.
    Basha, Imtiaz
    [J]. Scalable Computing, 2024, 25 (04): : 3181 - 3199
  • [2] An Enhanced AI-Based Network Intrusion Detection System Using Generative Adversarial Networks
    Park, Cheolhee
    Lee, Jonghoon
    Kim, Youngsoo
    Park, Jong-Geun
    Kim, Hyunjin
    Hong, Dowon
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (03) : 2330 - 2345
  • [3] Item-region-based style classification network (IRSN): a fashion style classifier based on domain knowledge of fashion experts
    Choi, Jinyoung
    Kwon, Youngchae
    Kim, Injung
    [J]. APPLIED INTELLIGENCE, 2024, 54 (20) : 9579 - 9593
  • [4] Prediction and Detection of Ventricular Fibrillation Using Complex Features and AI-Based Classification
    Fira, Monica
    Costin, Hariton-Nicolae
    Goras, Liviu
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [5] AI-based carcinoma detection and classification using histopathological images: A systematic review
    Prabhu, Swathi
    Prasad, Keerthana
    Robels-Kelly, Antonio
    Lu, Xuequan
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 142
  • [6] Toward Intelligent Design: An AI-Based Fashion Designer Using Generative Adversarial Networks Aided by Sketch and Rendering Generators
    Yan, Han
    Zhang, Haijun
    Liu, Linlin
    Zhou, Dongliang
    Xu, Xiaofei
    Zhang, Zhao
    Yan, Shuicheng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2323 - 2338
  • [7] An AI-Based Exercise Prescription Recommendation System
    Chen, Hung-Kai
    Chen, Fueng-Ho
    Lin, Shien-Fong
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (06):
  • [8] Fashion recommendation based on style and social events
    Federico Becattini
    Lavinia De Divitiis
    Claudio Baecchi
    Alberto Del Bimbo
    [J]. Multimedia Tools and Applications, 2023, 82 : 38217 - 38232
  • [9] Fashion recommendation based on style and social events
    Becattini, Federico
    De Divitiis, Lavinia
    Baecchi, Claudio
    Bimbo, Alberto Del
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (24) : 38217 - 38232
  • [10] A BONE FRACTURE DETECTION USING AI-BASED TECHNIQUES
    Mehta, Rushabh
    Pareek, Preksha
    Jayaswal, Ruchi
    Patil, Shruti
    Vyas, Kishan
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2023, 24 (02): : 161 - 171