Mining sequential patterns with flexible constraints from MOOC data

被引:8
|
作者
Song, Wei [1 ]
Ye, Wei [1 ]
Fournier-Viger, Philippe [2 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing 100144, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Sequential pattern; MOOC; Support with flexible constraints; Downward closure property;
D O I
10.1007/s10489-021-03122-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online learning is playing an increasingly important role in education. Massive open online course (MOOC) platforms are among the most important tools in online learning, and record historical learning data from an extremely large number of learners. To enhance the learning experience, a promising approach is to apply sequential pattern mining (SPM) to discover useful knowledge in these data. In this paper, mining sequential patterns (SPs) with flexible constraints in MOOC enrollment data is proposed, which follows that research approach. Three constraints are proposed: the length constraint, discreteness constraint, and validity constraint. They are used to describe the effect of the length of enrollment sequences, variance of enrollment dates, and enrollment moments, respectively. To improve the mining efficiency, the three constraints are pushed into the support, which is the most typical parameter in SPM, to form a new parameter called support with flexible constraints (SFC). SFC is proved to satisfy the downward closure property, and two algorithms are proposed to discover SPs with flexible constraints. They traverse the search space in a breadth-first and depth-first manner. The experimental results demonstrate that the proposed algorithms effectively reduce the number of patterns, with comparable performance to classical SPM algorithms.
引用
收藏
页码:16458 / 16474
页数:17
相关论文
共 50 条
  • [31] Collaboratively mining sequential patterns over private data
    Zhan, Justin
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 3820 - 3823
  • [32] Mining Negative Sequential Rules from Negative Sequential Patterns
    Sun, Chuanhou
    Jiang, Xiaoqi
    Dong, Xiangjun
    Xu, Tiantian
    Zhao, Long
    Li, Zhao
    Zhao, Yuhai
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT I, 2022, : 459 - 475
  • [33] Mining sequential patterns from probabilistic databases
    Muzammal, Muhammad
    Raman, Rajeev
    KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 44 (02) : 325 - 358
  • [34] Mining sequential patterns from probabilistic databases
    Muhammad Muzammal
    Rajeev Raman
    Knowledge and Information Systems, 2015, 44 : 325 - 358
  • [35] Mining Sequential Patterns from Probabilistic Databases
    Muzammal, Muhammad
    Raman, Rajeev
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6635 : 210 - 221
  • [36] Privacy preserving data mining of sequential patterns for network traffic data
    Kim, Seung-Woo
    Park, Sanghyun
    Won, Jung-Im
    Kim, Sang-Wook
    ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 201 - +
  • [37] Privacy preserving data mining of sequential patterns for network traffic data
    Kim, Seung-Woo
    Park, Sanghyun
    Won, Jung-Im
    Kim, Sang-Wook
    INFORMATION SCIENCES, 2008, 178 (03) : 694 - 713
  • [38] Mining Probabilistic Frequent Spatio-Temporal Sequential Patterns with Gap Constraints from Uncertain Databases
    Li, Yuxuan
    Bailey, James
    Kulik, Lars
    Pei, Jian
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 448 - 457
  • [39] A framework for mining sequential patterns from spatio-temporal event data sets
    Huang, Yan
    Zhang, Liqin
    Zhang, Pusheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (04) : 433 - 448
  • [40] Sequential patterns mining and gene sequence visualization to discover novelty from microarray data
    Sallaberry, A.
    Pecheur, N.
    Bringay, S.
    Roche, M.
    Teisseire, M.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (05) : 760 - 774