Automatic Classification of Scientific Groups as Productive: An Approach based on Motif Analysis

被引:0
|
作者
Chakraborty, Tanmoy [1 ]
Ganguly, Niloy [1 ]
Mukherjee, Animesh [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Kharagpur 721302, W Bengal, India
关键词
NETWORK MOTIFS; PATTERNS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the key aspects instrumental in the advancement of science relates to "team science," or in other words "group" collaborations. There have been extensive studies analyzing various statistical properties of collaborations of individual or pairs of authors. However, the number of studies pertaining to groups/teams of scientists working together is limited in number. In this paper, we set an objective to study the productivity of group collaborations where groups are represented as small substructures usually termed as network motifs in the literature. A preliminary observation is that star-like motifs have the largest productivity (defined as a function of citation count) followed by 4-cliques. We then introduce a bunch of features and study their individual relations with the productivity of a team. Building on these observations, we develop a supervised classification model that can automatically distinguish the highly productive teams from the low productive ones based on the set of identified features. The accuracy of the classification is 82% on an average for all the motifs with the accuracy reaching as high as 95% for 4-cliques. Finally, we present a detailed analysis of the time-transition behavior of different motifs along with some of the real world highly productive motifs found in our dataset. This empirical study is a first step toward the development of a full-fledged recommendation system that can predict how productive a team would be in the future.
引用
收藏
页码:130 / 137
页数:8
相关论文
共 50 条
  • [1] The automatic classification methods to help scientific data analysis
    Bataille, N
    Bourret, P
    Bussenot, JL
    Rousselot, JY
    Carrou, A
    [J]. DASIA '97 - DATA SYSTEMS IN AEROSPACE, 1997, 409 : 97 - 101
  • [2] Automatic classification of sunspot groups for space weather analysis
    [J]. 1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (08):
  • [3] An unsupervised approach to automatic classification of scientific literature utilizing bibliographic metadata
    Joorabchi, Arash
    Mahdi, Abdulhussain E.
    [J]. JOURNAL OF INFORMATION SCIENCE, 2011, 37 (05) : 499 - 514
  • [4] A tree-based approach for motif discovery and sequence classification
    Yan, Rui
    Boutros, Paul C.
    Jurisica, Igor
    [J]. BIOINFORMATICS, 2011, 27 (15) : 2054 - 2061
  • [5] Analysis of scientific research groups with greater productive applicability in Brazil: capacities and interactions with firms
    Caliari, Thiago
    Chiarini, Tulio
    [J]. APUNTES-REVISTA DE CIENCIAS SOCIALES, 2018, 45 (82): : 71 - 98
  • [6] SYSTEM FOR AUTOMATIC CLASSIFICATION OF SCIENTIFIC LITERATURE
    GARFIELD, E
    MALIN, MV
    SMALL, H
    [J]. JOURNAL OF THE INDIAN INSTITUTE OF SCIENCE, 1975, 57 (02): : 61 - 74
  • [7] Pen ink discrimination in handwritten documents using statistical and motif texture analysis : A classification based approach
    Dansena, Prabhat
    Bag, Soumen
    Pal, Rajarshi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 30881 - 30909
  • [8] Classification of Scientific Workflows Based on Reproducibility Analysis
    Banati, A.
    Kacsuk, P.
    Kozlovszky, M.
    [J]. 2016 39TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2016, : 327 - 331
  • [9] AI-BASED APPROACH TO AUTOMATIC SLEEP CLASSIFICATION
    KUBAT, M
    PFURTSCHELLER, G
    FLOTZINGER, D
    [J]. BIOLOGICAL CYBERNETICS, 1994, 70 (05) : 443 - 448
  • [10] A multilingual approach to the classification of questions based on automatic learning
    Tomas, David
    Vicedo, Jose. L.
    Suarez, Armando
    Bisbal, Empar
    Moreno, Lidia
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 391 - 398