Extracting loosely structured data records through mining strict patterns

被引:4
|
作者
Wu, Yipu [1 ]
Chen, Jing [1 ]
Li, Qing [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, 83 Tat Chee Avenne, Kowloon, Hong Kong, Peoples R China
关键词
D O I
10.1109/ICDE.2008.4497543
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extracting loosely structured data records (DRs) has wide applications in many domains, such as forum pattern recognition, blog data analysis, and books and news review analysis. Currently existing methods work well for strongly structured DRs only. In this paper, we address the problem of extracting loosely structured DRs through mining strict patterns. In our method, we utilize both content feature and tag tree feature to recognize the loosely structured DRs, and propose a new approach to extract the DRs automatically. Through experimental study we demonstrate that this method is both effective and robust in practice.
引用
收藏
页码:1322 / +
页数:2
相关论文
共 50 条
  • [31] Mining truck platooning patterns through massive trajectory data
    Ma, Xiaolei
    Huo, Enze
    Yu, Haiyang
    Li, Honghai
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 221
  • [32] An approach to extracting complex knowledge patterns among concepts belonging to structured, semi-structured and unstructured sources in a data lake
    Lo Giudice, Paolo
    Musarella, Lorenzo
    Sofo, Giuseppe
    Ursino, Domenico
    [J]. INFORMATION SCIENCES, 2019, 478 : 606 - 626
  • [33] Extracting cyber communities through patterns
    Argyros, T
    Ermopoulos, C
    Pavlaki, V
    Al-Said, N
    [J]. PROCEEDINGS OF THE THIRD SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2003, : 259 - 263
  • [34] Mining GPS Data for Extracting Significant Places
    Agamennoni, Gabriel
    Nieto, Juan
    Nebot, Eduardo
    [J]. ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 1860 - 1867
  • [35] Warehousing structured and unstructured data for data mining
    Miller, LL
    Honavar, V
    Barta, T
    [J]. PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1997, 34 : 215 - 224
  • [36] Warehousing structured and unstructured data for data mining
    Miller, LL
    Honavar, V
    Barta, T
    [J]. ASIS '97 - PROCEEDINGS OF THE 60TH ASIS ANNUAL MEETING, VOL 34 1997, 1997, 34 : 215 - 224
  • [37] Extracting agent specifications by using data mining
    Miyaharai, Tetsuhiro
    Matsumoto, Kazunori
    Nagai, Yasuo
    Takahashi, Kenichi
    Ueda, Hiroaki
    [J]. WMSCI 2005: 9TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL 1, 2005, : 387 - 391
  • [38] Scalable Crowd Ideation Support through Data Visualization, Mining, and Structured Workflows
    Girotto, Victor
    Walker, Erin
    Burleson, Winslow
    [J]. CSCW'17: COMPANION OF THE 2017 ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, 2017, : 183 - 186
  • [39] A polynomial time matching algorithm of structured ordered tree patterns for data mining from semistructured data
    Suzuki, Y
    Inomae, K
    Shoudai, T
    Miyahara, T
    Uchida, T
    [J]. INDUCTIVE LOGIC PROGRAMMING, 2003, 2583 : 270 - 284
  • [40] Efficient algorithms for mining frequent and closed patterns from semi-structured data
    Arimura, Hiroki
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 2 - +