Research of News Text with Word Frequency Statistics and User Information

被引:0
|
作者
Liu, Shan [1 ]
Huang, Kun [1 ]
Chai, Jianping [1 ]
机构
[1] Commun Univ China, Sch Informat Engn, Beijing, Peoples R China
关键词
word frequency; user information; tag model; TF-IDF algorithm;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper focuses on the news collection of a single topic, analyzes the large number of keywords and phrases in the news, uses the traditional TF-IDF algorithm based on the word frequency, adds the weight to the tags, extracts the keywords which have certain use value in the news, and uses the tag selection formula to tag the news. In addition, collect user reviews on the Internet news portal to create user comments information tags library for the news tag group to make the news tags more fit user needs. In this paper, the value of the tag is measured by the recall rate and the F1 score, and the tag based on the news content is analyzed according to the intermediate variable in the model operation. It also summarizes the general content of the topic by analyzing the tags distribution of the news collection.
引用
收藏
页码:2633 / 2637
页数:5
相关论文
共 50 条
  • [21] EMERGING USER INTENTIONS: MATCHING USER QUERIES WITH TOPIC EVOLUTION IN NEWS TEXT STREAMS
    Valencia, Maria
    Lauth, Codrina
    Menasalvas, Ernestina
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2009, 17 : 59 - 80
  • [22] Semantics of text and information research
    Valette, Mathieu
    Slodzian, Monique
    [J]. REVUE FRANCAISE DE LINGUISTIQUE APPLIQUEE, 2008, 13 (01): : 119 - 133
  • [23] Analysis of Native and Non-native Speakers' English Compositions based on Word-frequency Distribution and Text Statistics
    Tsubaki, Hajime
    [J]. NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 57 - 61
  • [24] Analysis of native and non-native speakers' English compositions based on word-frequency distribution and text statistics
    Tsubaki, Hajime
    [J]. ACM International Conference Proceeding Series, 2019, : 57 - 61
  • [25] WORD RECOGNITION MEMORY AND FREQUENCY INFORMATION
    UNDERWOOD, BJ
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1972, 94 (03): : 276 - +
  • [26] A Topic Recognition Method of News Text Based on Word Embedding Enhancement
    Du, Qiming
    Li, Nan
    Liu, Wenfu
    Sun, Daozhu
    Yang, Shudan
    Yue, Feng
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [27] Word length and frequency distributions in different text genres
    Antic, G
    Stadlober, E
    Grzybek, P
    Kelih, E
    [J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 310 - +
  • [28] WORD AND LOW FREQUENCY VOCABULARY IN DICTIONARIES OF THE TEXT EDITOR
    Lavoshnikova, Elina K.
    [J]. TOMSK STATE UNIVERSITY JOURNAL, 2018, (435): : 40 - 47
  • [29] Text Summarization of Indonesian Folklore with Word Frequency Concept
    Kartika, Luh Gede Surya
    Rinartha, Komang
    Siahaan, Daniel
    [J]. 2020 10TH ELECTRICAL POWER, ELECTRONICS, COMMUNICATIONS, CONTROLS AND INFORMATICS SEMINAR (EECCIS), 2020, : 259 - 262
  • [30] Text Data Augmentation Techniques for Word Embeddings in Fake News Classification
    Kapusta, Jozef
    Drzik, David
    Steflovic, Kirsten
    Nagy, Kitti Szabo
    [J]. IEEE ACCESS, 2024, 12 : 31538 - 31550