On Mining Crowd-based Speech Documentation

被引:0
|
作者
Moslehi, Parisa [1 ]
Adams, Bram [2 ]
Rilling, Juergen [1 ]
机构
[1] Concordia Univ, Montreal, PQ, Canada
[2] Ecole Polytech Montreal, Montreal, PQ, Canada
关键词
Crowd-based documentation; mining video content; speech analysis; Information Extraction; software documentation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the globalization of software development, relevant documentation of a project, such as requirements and design documents, often still is missing, incomplete or outdated. However, parts of that documentation can be found outside the project, where it is fragmented across hundreds of textual web documents like blog posts, email messages and forum posts, as well as multimedia documents such as screencasts and podcasts. Since dissecting and filtering multimedia information based on its relevancy to a given project is an inherently difficult task, it is necessary to provide an automated approach for mining this crowd-based documentation. In this paper, we are interested in mining the speech part of YouTube screencasts, since this part typically contains the rationale and insights of a screencast. We introduce a methodology that transcribes and analyzes the transcribed text using various Information Extraction (IE) techniques, and present a case study to illustrate the applicability of our mining methodology. In this case study, we extract use case scenarios from WordPress tutorial videos and show how their content can supplement existing documentation. We then evaluate how well existing rankings of video content are able to pinpoint the most relevant videos for a given scenario.
引用
收藏
页码:259 / 268
页数:10
相关论文
共 50 条
  • [1] Crowd-Based Mining of Reusable Process Model Patterns
    Rodriguez, Carlos
    Daniel, Florian
    Casati, Fabio
    [J]. BUSINESS PROCESS MANAGEMENT, BPM 2014, 2014, 8659 : 51 - 66
  • [2] Crowd-Based Deduplication: An Adaptive Approach
    Wang, Sibo
    Xiao, Xiaokui
    Lee, Chun-Hee
    [J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 1263 - 1277
  • [3] Crowd-based ecofriendly trip planning
    Tomaras, Dimitrios
    Kalogeraki, Vana
    Liebig, Thomas
    Gunopulos, Dimitrios
    [J]. 2018 19TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2018), 2018, : 24 - 33
  • [4] Crowd-Based Data Sourcing (Abstract)
    Milo, Tova
    [J]. DATABASES IN NETWORKED INFORMATION SYSTEMS, 2011, 7108 : 64 - 67
  • [5] Crowd-based digital sexual health
    Joseph D. Tucker
    Suzanne Day
    [J]. Nature Reviews Urology, 2020, 17 : 135 - 136
  • [6] Crowd-based velocimetry for surface flows
    Yang, Yao-Yu
    Kang, Shih-Chung
    [J]. ADVANCED ENGINEERING INFORMATICS, 2017, 32 : 275 - 286
  • [7] CPD: Crowd-based Pothole Detection
    Wirthmueller, Florian
    Hipp, Jochen
    Sattler, Kai-Uwe
    Reichert, Manfred
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS (VEHITS 2019), 2019, : 33 - 42
  • [8] Crowd-based digital sexual health
    Tucker, Joseph D.
    Day, Suzanne
    [J]. NATURE REVIEWS UROLOGY, 2020, 17 (03) : 135 - 136
  • [9] Crowd-based enhancement of chemical diversity
    Charlotte Harrison
    [J]. Nature Reviews Drug Discovery, 2012, 11 (1) : 21 - 21
  • [10] Justice for the Crowd: Organizational Justice and Turnover in Crowd-Based Labor
    Song, Xiaochuan
    Lowman, Graham H.
    Harms, Peter
    [J]. ADMINISTRATIVE SCIENCES, 2020, 10 (04)