Natiolectal Variation in Dutch Morphosyntax: A Large-Scale, Data-Driven Perspective

被引:0
|
作者
De Troij, Robbert [1 ,2 ]
Grondelaers, Stefan [3 ,4 ,5 ]
Speelman, Dirk [1 ]
机构
[1] Katholieke Univ Leuven, QLVL, Dept Linguist, Blijde Inkomststr 21,POB 3308, B-3000 Leuven, Belgium
[2] Radboud Univ Nijmegen, Nijmegen, Netherlands
[3] Meertens Inst Amsterdam, Amsterdam, Netherlands
[4] Meertens Inst, Oudezijds Achterburgwal 185, NL-1012 DK Amsterdam, Netherlands
[5] Radboud Univ Nijmegen, Ctr Language Studies, NL-6500 HD Nijmegen, Netherlands
关键词
computational linguistics; corpus linguistics; Dutch; grammatical variation; natiolectal variation; parallel corpus; LANGUAGE; DEFINITENESS; FLEMISH;
D O I
10.1017/S1470542722000071
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
In this article, we report a large-scale corpus study aimed at tackling the (controversial) question to what extent the European national varieties of Dutch, that is, Belgian and Netherlandic Dutch, exhibit morpho-syntactic differences. Instead of relying on a manual selection of cases of morphosyntactic variation, we first marshal large bilingual parallel corpora and machine translation software to identify semiautomatically, in an extensively data-driven fashion, loci of variation from various "corners " of Dutch grammar. We then gauge the distribution of con-structional alternatives in a nationally as well as stylistically stratified corpus for a representative selection of twenty alternation patterns. We find that natiolectal variation in the grammar of Dutch is far more prevalent than often assumed, especially in less edited text types, and that it shows up in inflection phenomena, lexically conditioned syntactic variation, and pure word order permutations. Another key finding is that many cases of synchronic probabilistic asymmetries reflect a diachronic difference between the two varieties: Netherlandic Dutch often tends to be ahead in cases of ongoing grammatical change, with Belgian Dutch holding on somewhat longer to obsolescent features of the grammar.
引用
收藏
页码:1 / 68
页数:68
相关论文
共 50 条
  • [1] A Data-driven Mechanism for Large-scale Data Distribution
    Shi Peichang
    Li Yiying
    Ding Bo
    Jiang Longquan
    Liu Hui
    Zhang Jie
    [J]. 2016 WORLD AUTOMATION CONGRESS (WAC), 2016,
  • [2] Data-driven Authoring of Large-scale Ecosystems
    Kapp, Konrad
    Gain, James
    Guerin, Eric
    Galin, Eric
    Peytavie, Adrien
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2020, 39 (06):
  • [3] Large-scale Data-driven Segmentation of Banking Customers
    Hossain, Md Monir
    Sebestyen, Mark
    Mayank, Dhruv
    Ardakanian, Omid
    Khazaei, Hamzeh
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4392 - 4401
  • [4] Large-scale mode identification and data-driven sciences
    Mukhopadhyay, Subhadeep
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2017, 11 (01): : 215 - 240
  • [5] Data-Driven Cell Zooming for Large-Scale Mobile Networks
    Jiang, Hao
    Yi, Shuwen
    Wu, Lihua
    Leung, Henry
    Wang, Yuan
    Zhou, Xian
    Chen, Yanqiu
    Yang, Lintao
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2018, 15 (01): : 156 - 168
  • [6] Large-Scale Data-Driven Airline Market Influence Maximization
    Li, Duanshun
    Liu, Jing
    Jeon, Jinsung
    Hong, Seoyoung
    Le, Thai
    Lee, Dongwon
    Park, Noseong
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 914 - 924
  • [7] Large-Scale Data-Driven Traffic Sensor Health Monitoring
    Tongge Huang
    Pranamesh Chakraborty
    Anuj Sharma
    Chinmay Hegde
    [J]. Journal of Big Data Analytics in Transportation, 2021, 3 (3): : 229 - 245
  • [8] Personal workspace for large-scale data-driven computational experiment
    Sun, Yiming
    Jensen, Scott
    Pallickara, Sangmi Lee
    Plale, Beth
    [J]. 2006 7TH IEEE/ACM INTERNATIONAL CONFERENCE ON GRID COMPUTING, 2006, : 112 - +
  • [9] In Situ Data-Driven Adaptive Sampling for Large-scale Simulation Data Summarization
    Biswas, Ayan
    Dutta, Soumya
    Pulido, Jesus
    Ahrens, James
    [J]. PROCEEDINGS OF IN SITU INFRASTRUCTURES FOR ENABLING EXTREME-SCALE ANALYSIS AND VISUALIZATION (ISAV 2018), 2018, : 13 - 18
  • [10] Distributed data-driven optimal fault detection for large-scale systems
    Li, Linlin
    Ding, Steven X.
    Peng, Xin
    [J]. JOURNAL OF PROCESS CONTROL, 2020, 96 : 94 - 103