Knowledge expansion of metadata using script mining analysis in multimedia recommendation

被引:0
|
作者
Joo-Chang Kim
Kyung-Yong Chung
机构
[1] Kyonggi University,Data Mining Lab., Department of Computer Science
[2] Kyonggi University,Division of Computer Science and Engineering
来源
关键词
Multimedia; Data mining; Knowledge discovery; Recommendation; Metadata;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a method for knowledge expansion of metadata using script mining analysis for multimedia recommendation systems is proposed. The method allows the extraction of new metadata and knowledge expansion through the mining analysis of multimedia scripts, which include a large amount of information. The scripts are collected by a Web crawler based on Python. From the collected scripts, hidden information is extracted through keyword analysis and sentiment analysis. In keyword analysis, scripts, unlike general documents, show a high frequency of names of characters or proper nouns. Such names or proper nouns are not frequently used in other media content, and therefore, their importance is high. Frequently, they are already offered in the conventional metadata, and consequently cause information duplication. Accordingly, term frequency–inverse document and metadata frequency (TF–IDMF), which considers the frequency of metadata in general term frequency–inverse document frequency (TF–IDF), is used. Thus, the importance of the names of characters or proper nouns in scripts can be decreased. Because the keywords for the extracted scripts are in fact included in the scripts, they can be used for precise multimedia search and recommendation. In sentiment analysis, the AFINN lexicon and the Bing lexicon are utilized to scan words in a script. The Bing lexicon is used to examine whether the words in the entire script are positive or negative. Then, the total numbers of positive words and negative words are used to calculate the representative sentiment of the script. The AFINN lexicon includes approximately 170 sentiment words, the negative or positive sentiment of which is presented in the range − 5 to +5. One script is divided into 100 sentences, and then, the representative sentiment in each sentence is evaluated as either positive or negative. Through script scanning, the flow of sentiment in multimedia streams can be discovered. The Bing lexicon categorizes words into positive, negative, and neutral sentiments. Through script scanning, the words included in each category can be quantified. Depending on the result of the script sentiment analysis, a different sentence embedding method based on inter-sentence similarity is used to cluster similar media. The results of the keyword analysis and sentiment analysis of a script are added to the metadata in a new column in a knowledge base to expand knowledge. To evaluate the significance of multimedia recommendations, keywords and sentiment information are used, and then, the similarity and clustering of the extracted media are assessed. As a result, script mining analysis based on the attributes that include actual information of media is considerably better than that based on types or a range of metadata attributes. Therefore, the proposed knowledge expansion method achieves significant results and shows an excellent performance in multimedia recommendation.
引用
收藏
页码:34679 / 34695
页数:16
相关论文
共 50 条
  • [1] Knowledge expansion of metadata using script mining analysis in multimedia recommendation
    Kim, Joo-Chang
    Chung, Kyung-Yong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (26-27) : 34679 - 34695
  • [2] Mining Latent Structures for Multimedia Recommendation
    Zhang, Jinghao
    Zhu, Yanqiao
    Liu, Qiang
    Wu, Shu
    Wang, Shuhui
    Wang, Liang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3872 - 3880
  • [3] Mining knowledge from text collections using automatically generated metadata
    Pierre, JM
    PRACTICAL ASPECTS OF KNOWLEDGE MANAGEMENT, 2002, 2569 : 537 - 548
  • [4] Domain discovery and expansion in multimedia metadata for digital TV broadcast
    Federal University of Ceará, Ceará, Brazil
    不详
    Int. J. Metadata Semant. Ontol., 3 (212-218):
  • [5] Metadata management for web mining and knowledge management
    Laware, GW
    IC'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, VOLS 1 AND 2, 2003, : 39 - 45
  • [6] Multimedia Semantics: Metadata, Analysis and Interaction
    Isfandyari-Moghaddam, Alireza
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2012, 32 (05) : 495 - 496
  • [7] Multimedia recommendation using Word2Vec-based social relationship mining
    Baek, Ji-Won
    Chung, Kyung-Yong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (26-27) : 34499 - 34515
  • [8] Multimedia recommendation using Word2Vec-based social relationship mining
    Ji-Won Baek
    Kyung-Yong Chung
    Multimedia Tools and Applications, 2021, 80 : 34499 - 34515
  • [9] Olympus:: Personal knowledge recommendation using agents, ontologies and web mining
    De Rezende, Juliana Lucas
    Pereira, Vinicios Batista
    Xexeo, Geraldo
    De Souza, Jano Moreira
    COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN III, 2007, 4402 : 53 - +
  • [10] Metadata Retrieval Using RTCP for Multimedia Streaming
    Kum, Seung-woo
    Lim, Tae Beom
    Lee, Seok Pil
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 815 - 820