An Approach to Applying Multiple Linear Regression Models by Interlacing Data in Classifying Similar Software

被引:0
|
作者
Lim, Hyun-il [1 ]
机构
[1] Kyungnam Univ, Dept Comp Engn, Chang Won, South Korea
来源
关键词
Machine Learning; Linear Regression; Similar Software Classification; Software Analysis;
D O I
10.3745/JIPS.04.0241
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The development of information technology is bringing many changes to everyday life, and machine learning can be used as a technique to solve a wide range of real-world problems. Analysis and utilization of data are essential processes in applying machine learning to real-world problems. As a method of processing data in machine learning, we propose an approach based on applying multiple linear regression models by interlacing data to the task of classifying similar software. Linear regression is widely used in estimation problems to model the relationship between input and output data. In our approach, multiple linear regression models are generated by training on interlaced feature data. A combination of these multiple models is then used as the prediction model for classifying similar software. Experiments are performed to evaluate the proposed approach as compared to conventional linear regression, and the experimental results show that the proposed method classifies similar software more accurately than the conventional model. We anticipate the proposed approach to be applied to various kinds of classification problems to improve the accuracy of conventional linear regression.
引用
收藏
页码:268 / 281
页数:14
相关论文
共 50 条
  • [1] A Linear Regression Approach to Modeling Software Characteristics for Classifying Similar Software
    Lim, Hyun-il
    2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2019, : 942 - 943
  • [2] Data depth approach in fitting linear regression models
    Muthukrishnan, R.
    Kalaivani, S.
    MATERIALS TODAY-PROCEEDINGS, 2022, 57 : 2212 - 2215
  • [3] Applying constrained linear regression models to predict interval-valued data
    Neto, EDL
    de Carvalho, FDT
    Freire, ES
    KI2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3698 : 92 - 106
  • [4] Multiple criteria linear programming approach to data mining: Models, algorithm designs and software development
    Kou, G
    Liu, XT
    Peng, Y
    Shi, Y
    Wise, M
    Xu, WX
    OPTIMIZATION METHODS & SOFTWARE, 2003, 18 (04): : 453 - 473
  • [5] A multiple imputation approach to linear regression with clustered censored data
    Pan, W
    Connett, JE
    LIFETIME DATA ANALYSIS, 2001, 7 (02) : 111 - 123
  • [6] A Multiple Imputation Approach to Linear Regression with Clustered Censored Data
    Wei Pan
    John E. Connett
    Lifetime Data Analysis, 2001, 7 : 111 - 123
  • [7] Multiple linear regression models for random intervals: a set arithmetic approach
    Garcia-Barzana, Marta
    Belen Ramos-Guajardo, Ana
    Colubi, Ana
    Kontoghiorghes, Erricos J.
    COMPUTATIONAL STATISTICS, 2020, 35 (02) : 755 - 773
  • [8] Multiple linear regression models for random intervals: a set arithmetic approach
    Marta García-Bárzana
    Ana Belén Ramos-Guajardo
    Ana Colubi
    Erricos J. Kontoghiorghes
    Computational Statistics, 2020, 35 : 755 - 773
  • [9] DATA ENVELOPMENT ANALYSIS WITH MISSING DATA: A MULTIPLE LINEAR REGRESSION ANALYSIS APPROACH
    Chen, Ya
    Li, Yongjun
    Wu, Huaqing
    Liang, Liang
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2014, 13 (01) : 137 - 153
  • [10] Comparative Assessment of Multiple Linear Regression and Fuzzy Linear Regression Models
    Pandit P.
    Dey P.
    Krishnamurthy K.N.
    SN Computer Science, 2021, 2 (2)