Previous research on detecting risky online behavior has been rather scattered, typically identifying single risks in online samples. To our knowledge, the presented research is the first that presents a process of building models that can efficiently detect the following four online risky behavior: (1) aggression, harassment, hate; (2) mental health; (3) use of alcohol, and drugs; and (4) sexting. Furthermore, the corpora in this research are unique because of the usage of private instant messaging conversations in the Czech language provided by adolescents. The combination of publicly unavailable and unique data with high-quality annotations of specific psychological phenomena allowed us for precise detection using transformer machine learning models that can handle sequential data and involve the context of utterances. The impact of the context length and text augmentation on model efficiency is discussed in detail. The final model provides promising results with an acceptable F1 score. Therefore, we believe that the model could be used in various applications, e.g., parental applications, chatbots, or services provided by Internet providers. Future research could investigate the usage of the model in other languages.
机构:
Sun Yat Sen Univ, Sch Informat Management, Guangzhou 510006, Guangdong, Peoples R China
Guangzhou Higher Educ Mega Ctr, 132,Waihuan East Rd, Guangzhou, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Informat Management, Guangzhou 510006, Guangdong, Peoples R China
Li, Jing
Zhang, Shiqi
论文数: 0引用数: 0
h-index: 0
机构:
Wuhan Inst Technol, Sch Management, Wuhan 430205, Hubei, Peoples R ChinaSun Yat Sen Univ, Sch Informat Management, Guangzhou 510006, Guangdong, Peoples R China
Zhang, Shiqi
Ao, Wenting
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Univ, Sch Hist & Arch, Kunming 650091, Yunnan, Peoples R China
2,Cuihu North Rd,Huashan St, Kunming, Yunnan, Peoples R ChinaSun Yat Sen Univ, Sch Informat Management, Guangzhou 510006, Guangdong, Peoples R China
机构:
Rutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USARutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USA
Grover, Karan
Pecor, Keith
论文数: 0引用数: 0
h-index: 0
机构:
Coll New Jersey, Dept Biol, Ewing, NJ USARutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USA
Pecor, Keith
Malkowski, Michael
论文数: 0引用数: 0
h-index: 0
机构:
Union City High Sch, Union, NJ USARutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USA
Malkowski, Michael
Kang, Lilia
论文数: 0引用数: 0
h-index: 0
机构:
Commun High Sch, Wall Township, NJ USARutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USA
Kang, Lilia
Machado, Sasha
论文数: 0引用数: 0
h-index: 0
机构:
Emerson Jr Sr High Sch, Emerson, NJ USARutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USA
Machado, Sasha
Lulla, Roshni
论文数: 0引用数: 0
h-index: 0
机构:
Montgomery High Sch, Skillman, NJ USARutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USA
Lulla, Roshni
Heisey, David
论文数: 0引用数: 0
h-index: 0
机构:
Scotch Plains Fanwood High Sch, Scotch Plains, NJ USARutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USA
Heisey, David
Ming, Xue
论文数: 0引用数: 0
h-index: 0
机构:
Rutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USA
Seton Hall Univ, JFK Med Ctr, New Jersey Neurosci Inst, Sleep Med Div, Edison, NJ USARutgers Biomed & Hlth Sci, New Jersey Med Sch, Dept Neurosci, Newark, NJ USA
机构:
Carnegie Mellon Univ, Human Comp Interact Inst, Pittsburgh, PA 15213 USACarnegie Mellon Univ, Human Comp Interact Inst, Pittsburgh, PA 15213 USA
Nguyen, Duyen T.
Fussell, Susan R.
论文数: 0引用数: 0
h-index: 0
机构:
Cornell Univ, Dept Commun, Ithaca, NY USA
Cornell Univ, Dept Informat Sci, Ithaca, NY USACarnegie Mellon Univ, Human Comp Interact Inst, Pittsburgh, PA 15213 USA