Domain Specific Language for Web Scraper Development

被引:0
|
作者
Arifanto, Randy [1 ]
Asnar, Yudistira D. W. [1 ]
Liem, M. M. Inggriani [1 ]
机构
[1] Bandung Inst Technol, Sch Elect Engn & Informat, Bandung, Indonesia
关键词
web scraping; scraper; domain specific languages;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The process of web scraping is done by using an application called scraper. Various tools have been built to facilitate the process of scraper development, but these tools can only be used by programmers, while users who need web scraping not only programmers but also non-programmers. Non programmer referred to in this research is someone who is able to run the computer through the shell and less experienced in programming. It takes a suitable approach in the construction of scrapers by non-programmers. Therefore, in this research will be built scrapingDSL, a domain specific languages that can be used by non-programmers to build scraper. Scrapers built using scrapingDSL must be able to do web scraping against all types of websites with different structures and should be easy to understand, write and modify. To build a scrapingDSL that can meet those needs, the researcher analyzed the tools used in web scraping to determine the right tool for the scrapingDSL engine, analyzed the website structure to determine the need for the scrapingDSL and the translator application, and analyzed the the website content retrieval flow to determine the required language scrapingDSL elements. At the end of the study a test was conducted to determine whether scrapingDSL could be used by non-programmers to build scrapers. Testing is done by requesting 12 non-programmer respondents to build scraper using scrapingDSL.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Domain Specific Language Approach on Model-driven Development of Web Services
    Nguyen, Viet-Cuong
    Qafmolla, Xhevi
    Richta, Karel
    [J]. ACTA POLYTECHNICA HUNGARICA, 2014, 11 (08) : 121 - 138
  • [2] WebCaL - a domain specific language for web caching
    Gulwani, S
    Tarachandani, A
    Gupta, D
    Sanghi, D
    Barreto, LP
    Muller, G
    Consel, C
    [J]. COMPUTER COMMUNICATIONS, 2001, 24 (02) : 191 - 201
  • [3] A Web Application Is a Domain-Specific Language
    Lorenz, David H.
    Rosenan, Boaz
    [J]. COMPANION PROCEEDINGS OF THE 2016 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES AND APPLICATIONS: SOFTWARE FOR HUMANITY (SPLASH COMPANION'16), 2016, : 35 - 36
  • [4] A Domain Specific Language for Web-based GIS
    Alvarado, Suilen H.
    Cortinas, Alejandro
    Luaces, Miguel R.
    Pedreira, Oscar
    Places, Angeles S.
    [J]. WEBIST: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2019, : 462 - 469
  • [5] A Domain Specific Language for the Development of Collaborative Systems
    Bibbo, Luis Mariano
    Garcia, Diego
    Pons, Claudia
    [J]. SCCC 2008: INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY, PROCEEDINGS, 2008, : 3 - 12
  • [6] Domain Specific Language for Smart Contract Development
    Woehrer, Maximilian
    Zdun, Uwe
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BLOCKCHAIN AND CRYPTOCURRENCY (IEEE ICBC), 2020,
  • [7] A domain-specific language for web APIs and services mashups
    Maximilien, E. Michael
    Wilkinson, Hernan
    Desai, Nirmit
    Tai, Stefan
    [J]. Service-Oriented Computing - ICSOC 2007, Proceedings, 2007, 4749 : 13 - 26
  • [8] Towards a Domain-Specific Language for the Renarration of Web Pages
    Prasad, Gollapudi V. R. J. Sai
    Chimalakonda, Sridhar
    Choppella, Venkatesh
    [J]. ISEC'18: PROCEEDINGS OF THE 11TH INNOVATIONS IN SOFTWARE ENGINEERING CONFERENCE, 2018,
  • [9] On Web-based Domain-Specific Language for Internet of Things
    Sneps-Sneppe, Manfred
    Namiot, Dmitry
    [J]. 2015 7TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2015, : 287 - 292
  • [10] Design of Domain Specific Language for Web Services QoS Constraints Definition
    Sikri, Monika
    [J]. INFORMATION TECHNOLOGY AND MOBILE COMMUNICATION, 2011, 147 : 411 - 416