Text Mining: Finding Hot Topics TF*PDF vs. LSI

被引:0
|
作者
Katyayani, J. [1 ]
Sriharsha, A. V. [2 ]
Sudhir, B. [3 ]
机构
[1] Sri Padmavathi Mahila Visva Vidyalayam, Tirupati, Andhra Pradesh, India
[2] Sree Vidyanikethan Engn Coll, Tirupati, Andhra Pradesh, India
[3] Sri Venkateswara Univ, Tirupati, Andhra Pradesh, India
关键词
Text mining; dimensionality reduction; latent-semantic indexing; IR;
D O I
10.1109/IDAACS.2009.5342925
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the vast amount of digital text materials available on the Net, it is almost impractical for people to absorb all related information in a timely manner. This problem has been overcome by erstwhile researchers and scientists of data mining. The efficiency in the methods and exploratory analysis has to be ascertained yet. Document wise term frequencies and inverted frequencies are available to calculate the statistical importance among the documents. Determining the time line importance of the documents plays very essential role than just finding the document's importance. LSI is a basic PCA approach, which is proposed with time-line approach and has been discussed comparatively in this paper.
引用
收藏
页码:526 / +
页数:2
相关论文
共 9 条
  • [1] Analysis of protein/protein interactions through biomedical literature: Text mining of abstracts vs. text mining of full text articles
    Martin, EPG
    Bremer, EG
    Guerin, MC
    DeSesa, C
    Jouve, O
    KNOWLEDGE EXPLORATION IN LIFE SCIENCE INFORMATICS, PROCEEDINGS, 2004, 3303 : 96 - 108
  • [2] How Uncanny Are Virtual vs. Human Influencers: A Text Mining Approach
    Enzig, Joshua
    Guerreiro, Joao
    Loureiro, Sandra
    MARKETING IN A MULTICULTURAL AND VIBRANT WORLD, 2024 AMS WORLD MARKETING CONGRESS, 2024, : 45 - 58
  • [3] Python vs. R: a text mining approach for analyzing the research trends in scopus database
    Bhanot, Neeraj
    Singh, Harwinder
    Sharma, Divyansu
    Jain, Harshit
    Jain, Shreyansh
    arXiv, 2019,
  • [4] Automatic vs. manual curation of a multi-source chemical dictionary: the impact on text mining
    Hettne, Kristina M.
    Williams, Antony J.
    van Mulligen, Erik M.
    Kleinjans, Jos
    Tkachenko, Valery
    Kors, Jan A.
    JOURNAL OF CHEMINFORMATICS, 2010, 2
  • [5] Automatic vs. manual curation of a multi-source chemical dictionary: the impact on text mining
    Kristina M Hettne
    Antony J Williams
    Erik M van Mulligen
    Jos Kleinjans
    Valery Tkachenko
    Jan A Kors
    Journal of Cheminformatics, 2
  • [6] Erratum to: Automatic vs. manual curation of a multi-source chemical dictionary: the impact on text mining
    Kristina M Hettne
    Antony J Williams
    Erik M van Mulligen
    Jos Kleinjans
    Valery Tkachenko
    Jan A Kors
    Journal of Cheminformatics, 2 (1)
  • [7] Life priorities in the HIV-positive Asians: a text-mining analysis in young vs. old generation
    Chen, Wei-Ti
    Barbour, Russell
    AIDS CARE-PSYCHOLOGICAL AND SOCIO-MEDICAL ASPECTS OF AIDS/HIV, 2017, 29 (04): : 507 - 510
  • [8] Investigating Consumer Values of Secondhand Fashion Consumption in the Mass Market vs. Luxury Market: A Text-Mining Approach
    ul Hasan, H. M. Rakib
    Lang, Chunmin
    Xia, Sibei
    SUSTAINABILITY, 2023, 15 (01)
  • [9] A Comparison Between two Approaches to Identify Opioid Use Problems: ICD-9 vs. Text-Mining Approach
    Alzeer, Abdullah H.
    Patel, Jay
    Dixon, Brian E.
    Jones, Josette F.
    Bair, Matthew J.
    2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, : 455 - 456