Scene Classification, Data Cleaning, and Comment Summarization for Large-Scale Location Databases

被引:0
|
作者
Cheng, Hsu-Yung [1 ]
Yu, Chih-Chang [2 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 320, Taiwan
[2] Chun Yuan Christian Univ, Dept Informat & Comp Engn, Taoyuan 320, Taiwan
关键词
image analysis; image classification; deep learning; natural language processing;
D O I
10.3390/electronics11131947
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a framework that can automatically analyze the images and comments in user-uploaded location databases. The proposed framework integrates image processing and natural language processing techniques to perform scene classification, data cleaning, and comment summarization so that the cluttered information in user-uploaded databases can be presented in an organized way to users. For scene classification, RGB image features, segmentation features, and the features of discriminative objects are fused with an attention module to improve classification accuracy. For data cleaning, incorrect images are detected using a multilevel feature extractor and a multiresolution distance calculation scheme. Finally, a comment summarization scheme is proposed to overcome the problems of unstructured sentences and the improper usage of punctuation marks, which are commonly found in customer reviews. To validate the proposed framework, a system that can classify and organize scenes and comments for hotels is implemented and evaluated. Comparisons with existing related studies are also performed. The experimental results validate the effectiveness and superiority of the proposed framework.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Multiclass Classification Problem of Large-Scale Biomedical Meta Data
    Student, Sebastian
    Pieter, Justyna
    Fujarewicz, Krzysztof
    9TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING, INTER-ENG 2015, 2016, 22 : 938 - 945
  • [42] Kernel Logistic Regression Algorithm for Large-Scale Data Classification
    Elbashir, Murtada
    Wang, Jianxin
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (05) : 465 - 472
  • [43] A Fast Distributed Classification Algorithm for Large-scale Imbalanced Data
    Wang, Huihui
    Gao, Yang
    Shi, Yinghuan
    Wang, Hao
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1251 - 1256
  • [44] Detecting fraud transactions in large-scale databases
    Pabarskaite, Zidrina
    Long, James Allen
    Proceedings of the ISAT International Scientific School, 2000, : 223 - 231
  • [45] Large-scale legal reasoning with rules and databases
    Antoniou, Grigoris
    Baryannis, George
    Batsakis, Sotiris
    Tachmazidis, Ilias
    Governatori, Guido
    Islam, Mohammad Badiul
    Liu, Qing
    Robaldo, Livio
    Siragusa, Giovanni
    Journal of Applied Logics, 2021, 8 (04): : 911 - 939
  • [46] Efficient name disambiguation for large-scale databases
    Huang, Jian
    Ertekin, Seyda
    Giles, C. Lee
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS, 2006, 4213 : 536 - 544
  • [47] RESEARCH ON THE INCOMPLETE POINT CLOUD DATA REPAIRING OF THE LARGE-SCALE SCENE BUILDINGS
    Li, Yongqiang
    Li, Lixue
    Niu, Lubiao
    Huang, Tengda
    Li, Youpeng
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 6726 - 6729
  • [48] View synthesizing for a large-scale object in a scene
    Thein, T.L.L. (tllt55@gmail.com), 1600, Universitas Ahmad Dahlan (10):
  • [49] Architecture of a large-scale location service
    Leonhardi, A
    Rothermel, K
    22ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2002, : 465 - 466
  • [50] Facility location for large-scale emergencies
    Huang, Rongbing
    Kim, Seokjin
    Menezes, Mozart B. C.
    ANNALS OF OPERATIONS RESEARCH, 2010, 181 (01) : 271 - 286