Graph Clustering for Large-Scale Text-Mining of Brain Imaging Studies

被引:0
|
作者
Chawla, Manisha [1 ]
Mesa, Mounika [2 ]
Miyapuram, Krishna P. [3 ]
机构
[1] Indian Inst Technol, Ctr Cognit Sci, Ahmadabad, Gujarat, India
[2] Rajiv Gandhi Univ Knowledge Technol, Dept Comp Sci & Engn, Basar, India
[3] Indian Inst Technol, Cognit Sci & Comp Sci, Ahmadabad, Gujarat, India
关键词
NeuroSynth; Meta-Analysis; functional MRI; Jaccard metric; clustering; graph-theory; METAANALYSIS; EMOTION;
D O I
10.1145/2791405.2791490
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a Graph clustering method for identifying functionally similar key concepts for meta-analysis of brain imaging studies. We use an existing database of key concepts created by a large-scale automated text mining of brain imaging studies. The key concepts here refer to specific psychological terms of interest (for instance, 'decision', 'memory' etc) identified based on their frequency of occurrence (>1 in 1,000 words) in an individual article text of 5809 studies. The pair-wise distance between all 525 nodes was calculated using the Jaccard metric. A graph was created with 525 nodes representing the key concepts. An undirected edge was drawn from every node to the node with minimum distance. We present a clustering approach using a simple graph traversal to identify connected components so that every node belongs to exactly one cluster. The results from our clustering method reveal semantically related concepts confirming potential for further use in text-mining approaches for meta analysis of brain imaging studies.
引用
收藏
页码:163 / 168
页数:6
相关论文
共 50 条
  • [1] iTextMine: integrated text-mining system for large-scale knowledge extraction from the literature
    Ren, Jia
    Li, Gang
    Ross, Karen
    Arighi, Cecilia
    McGarvey, Peter
    Rao, Shruti
    Cowart, Julie
    Madhavan, Subha
    Vijay-Shanker, K.
    Wu, Cathy H.
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2018,
  • [2] RDBridge: a knowledge graph of rare diseases based on large-scale text mining
    Xing, Huadong
    Zhang, Dachuan
    Cai, Pengli
    Zhang, Rui
    Hu, Qian-Nan
    [J]. BIOINFORMATICS, 2023, 39 (07)
  • [3] Large-Scale Text Mining of Biomedical Literature
    Ginter, Filip
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2013, (116): : 43 - 44
  • [4] Weighted Hybrid Clustering by Combining Text Mining and Bibliometrics on a Large-Scale Journal Database
    Liu, Xinhai
    Yu, Shi
    Janssens, Frizo
    Glanzel, Wolfgang
    Moreau, Yves
    De Moor, Bart
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (06): : 1105 - 1119
  • [5] Large-Scale Clustering With Structured Optimal Bipartite Graph
    Zhang, Han
    Nie, Feiping
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9950 - 9963
  • [6] A distributed and incremental algorithm for large-scale graph clustering
    Inoubli, Wissem
    Aridhi, Sabeur
    Mezni, Haithem
    Maddouri, Mondher
    Nguifo, Engelbert Mephu
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 134 : 334 - 347
  • [7] A Novel Clustering Algorithm for Large-Scale Graph Processing
    Qu, Zhaoyang
    Ding, Wei
    Qu, Nan
    Yan, Jia
    Wang, Ling
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 349 - 358
  • [8] A Novel Clustering Algorithm on Large-Scale Graph Data
    Zhang, Hao
    Zhou, Wei
    Wan, Xiaoyu
    Fu, Ge
    Xu, Zhiyong
    Han, Jizhong
    [J]. 2014 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2014, : 47 - 54
  • [9] Exploiting and Evaluating MapReduce for Large-Scale Graph Mining
    Lai, Hung-Che
    Li, Cheng-Te
    Lo, Yi-Chen
    Lin, Shou-De
    [J]. 2012 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2012, : 434 - 441
  • [10] Large-Scale Graph Mining and Learning for Information Retrieval
    Gao, Bin
    Wang, Taifeng
    Liu, Tie-Yan
    [J]. SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1194 - 1195