Scene Classification, Data Cleaning, and Comment Summarization for Large-Scale Location Databases

被引:0
|
作者
Cheng, Hsu-Yung [1 ]
Yu, Chih-Chang [2 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 320, Taiwan
[2] Chun Yuan Christian Univ, Dept Informat & Comp Engn, Taoyuan 320, Taiwan
关键词
image analysis; image classification; deep learning; natural language processing;
D O I
10.3390/electronics11131947
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a framework that can automatically analyze the images and comments in user-uploaded location databases. The proposed framework integrates image processing and natural language processing techniques to perform scene classification, data cleaning, and comment summarization so that the cluttered information in user-uploaded databases can be presented in an organized way to users. For scene classification, RGB image features, segmentation features, and the features of discriminative objects are fused with an attention module to improve classification accuracy. For data cleaning, incorrect images are detected using a multilevel feature extractor and a multiresolution distance calculation scheme. Finally, a comment summarization scheme is proposed to overcome the problems of unstructured sentences and the improper usage of punctuation marks, which are commonly found in customer reviews. To validate the proposed framework, a system that can classify and organize scenes and comments for hotels is implemented and evaluated. Comparisons with existing related studies are also performed. The experimental results validate the effectiveness and superiority of the proposed framework.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Missing data imputation and corrected statistics for large-scale behavioral databases
    Courrieu, Pierre
    Rey, Arnaud
    BEHAVIOR RESEARCH METHODS, 2011, 43 (02) : 310 - 330
  • [22] Emotional modelling and classification of a large-scale collection of scene images in a cluster environment
    Cao, Jianfang
    Li, Yanfei
    Tian, Yun
    PLOS ONE, 2018, 13 (01):
  • [23] Scalable Tabular Metadata Location and Classification in Large-Scale Structured Datasets
    Islam, Kazi
    Gubanov, Michael
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT I, 2021, 12923 : 35 - 50
  • [24] Preserving Location Privacy on the Release of Large-scale Mobility Data
    Hu, Xueheng
    Striegel, Aaron
    2013 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2013, : 838 - 843
  • [25] An effective and efficient data cleaning technique in large databases
    Ji, Z
    Han, L
    IKE '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGNINEERING, 2004, : 501 - 504
  • [26] The automatic construction of large-scale corpora for summarization research
    Marcu, D
    SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 137 - 144
  • [27] BIGPATENT: A Large-Scale Dataset for Abstractive and Coherent Summarization
    Sharma, Eva
    Li, Chen
    Wang, Lu
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2204 - 2213
  • [28] Efficient Online Summarization of Large-Scale Dynamic Networks
    Qu, Qiang
    Liu, Siyuan
    Zhu, Feida
    Jensen, Christian S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (12) : 3231 - 3245
  • [29] LANS: Large-scale Arabic News Summarization Corpus
    Alhamadani, Abdulaziz
    Zhang, Xuchao
    He, Jianfeng
    Khatri, Aadyant
    Lu, Chang-Tien
    ArabicNLP 2023 - 1st Arabic Natural Language Processing Conference, Proceedings, 2023, : 89 - 100
  • [30] A large-scale digital library system to integrate heterogeneous data of distributed databases
    Di Giacomo, M
    Martinez, M
    Scott, J
    EURO-PAR 2004 PARALLEL PROCESSING, PROCEEDINGS, 2004, 3149 : 391 - 397