An Approach to Applying Multiple Linear Regression Models by Interlacing Data in Classifying Similar Software

被引:0
|
作者
Lim, Hyun-il [1 ]
机构
[1] Kyungnam Univ, Dept Comp Engn, Chang Won, South Korea
来源
关键词
Machine Learning; Linear Regression; Similar Software Classification; Software Analysis;
D O I
10.3745/JIPS.04.0241
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The development of information technology is bringing many changes to everyday life, and machine learning can be used as a technique to solve a wide range of real-world problems. Analysis and utilization of data are essential processes in applying machine learning to real-world problems. As a method of processing data in machine learning, we propose an approach based on applying multiple linear regression models by interlacing data to the task of classifying similar software. Linear regression is widely used in estimation problems to model the relationship between input and output data. In our approach, multiple linear regression models are generated by training on interlaced feature data. A combination of these multiple models is then used as the prediction model for classifying similar software. Experiments are performed to evaluate the proposed approach as compared to conventional linear regression, and the experimental results show that the proposed method classifies similar software more accurately than the conventional model. We anticipate the proposed approach to be applied to various kinds of classification problems to improve the accuracy of conventional linear regression.
引用
收藏
页码:268 / 281
页数:14
相关论文
共 50 条
  • [41] Regression models for exceedance data: a new approach
    Bourguignon, Marcelo
    do Nascimento, Fernando Ferraz
    STATISTICAL METHODS AND APPLICATIONS, 2021, 30 (01): : 157 - 173
  • [42] An Impact of Linear Regression Models for Improving the Software Quality with Estimated Cost
    Marandi, Arun Kumar
    Khan, Danish Ali
    ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 335 - 342
  • [43] Using linear regression models to analyse the effect of software process improvement
    Schalken, Joost
    Brinkkemper, Sjaak
    van Vliet, Hans
    PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROCEEDINGS, 2006, 4034 : 234 - 248
  • [44] Software Effort Estimation with Multiple Linear Regression: Review and Practical Application
    Fedotovai, Olga
    Teixeira, Leonor
    Alvelos, Helena
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (05) : 925 - 945
  • [45] Applying Mechanisms of Data Profiling for Assuring Data Quality in the software: a first approach
    Guerra-Garcia, Cesar
    Perez-Gonzalez, Hector G.
    Martinez-Perez, Francisco
    Juarez-Ramirez, Reyes
    Jimenez, Samantha
    2023 11TH INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION, CONISOFT 2023, 2023, : 108 - 115
  • [46] Improved multiple linear regression based models for solar collectors
    Kicsiny, Richard
    RENEWABLE ENERGY, 2016, 91 : 224 - 232
  • [47] Parallel maximum likelihood estimator for multiple linear regression models
    Guo, Guangbao
    You, Wenjie
    Qian, Guoqi
    Shao, Wei
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2015, 273 : 251 - 263
  • [48] Multiple change-point analysis for linear regression models
    Loschi, Rosangela H.
    Pontel, Jeanne G.
    Cruz, Frederico R. B.
    CHILEAN JOURNAL OF STATISTICS, 2010, 1 (02): : 93 - 112
  • [49] Link between orthogonal and standard multiple linear regression models
    Soskic, M
    Plavsic, D
    Trinajstic, N
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (04): : 829 - 832
  • [50] A study of partial F tests for multiple linear regression models
    Jamshidian, Mortaza
    Jennrich, Robert I.
    Liu, Wei
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (12) : 6269 - 6284