Two-level clustering of UML class diagrams based on semantics and structure

被引:5
|
作者
Ma, Zongmin [1 ]
Yuan, Zhongchen [2 ,3 ]
Yan, Li [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Shenyang Univ Technol, Sch Chem Proc Automat, Liaoyang 111004, Peoples R China
[3] Northeastern Univ, Sch Software, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Reuse; UML class diagram; Retrieval; Clustering; Similarity measure; Feature similarity; SOFTWARE COMPONENT; ALGORITHM; CLASSIFICATION; RETRIEVAL; REUSE;
D O I
10.1016/j.infsof.2020.106456
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: The reuse of software design has been an important issue of software reuse. UML class diagrams are widely applied in software design and has become DE factor standard. As a result, the reuse of UML class diagrams has received more attention. With the increasing number of class diagrams stored in reuse repository, their retrieval becomes a time-consuming job. The clustering can narrow down retrieval range and improve the retrieval efficiency. But few efforts have been done in clustering UML class diagrams. This paper tries to propose a clustering approach for UML class diagrams. Objective: This paper proposes a two-level clustering of UML class diagrams, namely, semantic clustering and structural clustering. The UML class diagrams stored in reuse repository are clustered into a few domains based on semantics in the first level and a few categories based on structure in the second level. Method: We propose a clustering algorithm named CUFS, in which the idea of partitioning and hierarchical clustering is combined and feature similarity is proposed for the similarity measure between two clusters in order to merge clusters. A better feature representation of a cluster, namely, feature class diagram, is proposed in this paper. In order to form each sub-cluster, the semantic and structural similarities between UML class diagrams are defined, respectively. Results: A series of experimental results show that, the proposed feature similarity measure not only speeds up the clustering process, but also expresses the closeness degree between clusters for merging clusters. The proposed algorithm shows a good clustering quality and efficiency under the condition of different size and distribution of UML class diagrams. Conclusion: It is concluded that the proposed two-level clustering method considers both semantics and structure contained in a class diagram, which can flexibly adapt to different clustering requirements. Also, the proposed clustering algorithm performs better than other related algorithms, regardless of in semantic, structural and hybrid clustering.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Integration of UML Class Diagrams Based on Semantics and Structure
    Yuan, Zhongchen
    Hu, Xingda
    Zhang, Gang
    Ma, Zongmin
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2024, 34 (08) : 1281 - 1312
  • [2] Retrieval of UML Class Diagrams Based on Semantics and Structure
    Yuan, Zhong-Chen
    Ma, Zong-Min
    [J]. Dongbei Daxue Xuebao/Journal of Northeastern University, 2020, 41 (01): : 23 - 28
  • [3] Towards the Mechanized Semantics and Refinement of UML Class Diagrams
    Sheng, Feng
    Zhu, Huibiao
    Yang, Zongyuan
    [J]. 2019 26TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC), 2019, : 47 - 54
  • [4] Semantics-based weaving of UML sequence diagrams
    Gronmo, Roy
    Sorensen, Fredrik
    Moller-Pedersen, Birger
    Krogdahl, Stein
    [J]. THEORY AND PRACTICE OF MODEL TRANSFORMATIONS, 2008, 5063 : 122 - 136
  • [5] MEASURING STRUCTURE COMPLEXITY OF UML CLASS DIAGRAMS
    Zhou Yuming Xu Baowen (Dept. of Computer Science & Eng.
    [J]. Journal of Electronics(China), 2003, (03) : 227 - 231
  • [6] Substation Load Clustering Based on Substation-consumer Two-level Structure
    He, Zhenan
    Wu, Hao
    Cheng, Xiang
    Zhan, Zhenbin
    Sun, Weizhen
    [J]. Dianwang Jishu/Power System Technology, 2019, 43 (08): : 2983 - 2990
  • [7] A NEW CLASS OF LINEAR CODES WITH TWO-LEVEL STRUCTURE
    Wang Haiyan (Dept. of Educational Communication and Technology
    [J]. Journal of Electronics(China), 2000, (01) : 90 - 93
  • [8] Evaluating structure complexity metric for UML class diagrams
    Xu Shenghua
    Yi Tong
    Wu Fangjun
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (03) : 389 - 392
  • [9] A two-level method for clustering DTDs
    Qian, WN
    Zhang, L
    Liang, YQ
    Qian, HL
    Jin, W
    [J]. WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2000, 1846 : 41 - 52
  • [10] Multi-View Clustering Based on Two-Level Weights
    Du, Guowang
    Zhou, Lihua
    Wang, Lizhen
    Du, Jingwei
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (04): : 907 - 921