Generalized model for mapping bicycle ridership with crowdsourced data

被引:31
|
作者
Nelson, Trisalyn [1 ]
Roy, Avipsa [2 ]
Ferster, Colin [3 ]
Fischer, Jaimy [4 ]
Brum-Bastos, Vanessa [5 ]
Laberee, Karen [3 ]
Yu, Hanchen [2 ]
Winters, Meghan [4 ]
机构
[1] Univ Calif Los Angeles, Dept Geog, Los Angeles, CA 90024 USA
[2] Arizona State Univ, Sch Geog Sci & Urban Planning, Tempe, AZ 85287 USA
[3] Univ Victoria, Dept Geog, Victoria, BC, Canada
[4] Simon Fraser Univ, Fac Hlth Sci, Burnaby, BC, Canada
[5] Univ St Andrews, Dept Geog, St Andrews, Fife, Scotland
关键词
Bias-correction; LASSO; Big data; Bicycling ridership; Exposure; Strava; ROUTE CHOICE MODEL; BUILT ENVIRONMENTS; PHYSICAL-ACTIVITY; TRANSPORTATION; INCOME; INFRASTRUCTURE; INTERSECTIONS; EXPOSURE; WALKING; HEALTH;
D O I
10.1016/j.trc.2021.102981
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Fitness apps, such as Strava, are a growing source of data for mapping bicycling ridership, due to large samples and high resolution. To overcome bias introduced by data generated from only fitness app users, researchers build statistical models that predict total bicycling by integrating Strava data with official counts and geographic data. However, studies conducted on single cities provide limited insight on best practices for modeling bicycling with Strava as generalizability is difficult to assess. Our goal is to develop a generalized approach to modeling bicycling ridership using Strava data. In doing so we enable detailed mapping that is more inclusive of all bicyclists and will support more equitable decision-making across cities. We used Strava data, official counts, and geographic data to model Average Annual Daily Bicycling (AADB) in five cities: Boulder, Ottawa, Phoenix, San Francisco, and Victoria. Using a machine learning approach, LASSO, we identify variables important for predicting ridership in all cities, and independently in each city. Using the LASSO-selected variables as predictors in Poisson regression, we built generalized and city-specific models and compared accuracy. Our results indicate generalized prediction of bicycling ridership on a road segment in concert with Strava data should include the following variables: number of Strava riders, percentage of Strava trips categorized as commuting, bicycling safety, and income. Inclusion of city-specific variables increased model performance, as the R2 for generalized and city-specific models ranged from 0.08?0.80 and 0.68?0.92, respectively. However, model accuracy was influenced most by the official count data used for model training. For best results, official count data should capture diverse street conditions, including low ridership areas. Counts collected continuously over a long time period, rather than at peak periods, may also improve modeling. Modeling bicycling from Strava and geographic data enables mapping of bicycling ridership that is more inclusive of all bicyclists and better able to support decision-making.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] CROWDSOURCED MAPPING - LETTING AMATEURS INTO THE TEMPLE?
    McCullagh, Michael
    Jackson, Mike
    ISPRS HANNOVER WORKSHOP 2013, 2013, 40-1 (W-1): : 399 - 432
  • [42] Association between network characteristics and bicycle ridership across a large metropolitan region
    Beck, Ben
    Pettit, Chris
    Winters, Meghan
    Nelson, Trisalyn
    Vu, Hai L.
    Nice, Kerry
    Seneviratne, Sachith
    Saberi, Meead
    INTERNATIONAL JOURNAL OF SUSTAINABLE TRANSPORTATION, 2024, 18 (04) : 344 - 355
  • [43] Shared Bicycle-on-Transit Perspective on Rail Transit Station Accessibility and Ridership
    Chen, Ziyi
    Chen, Xiaohong
    CICTP 2022: INTELLIGENT, GREEN, AND CONNECTED TRANSPORTATION, 2022, : 974 - 983
  • [44] Effects of Trip Generation and Attraction Attributes on Bicycle-Sharing System Ridership
    Hu, Shou-Ren
    Liu, Chao-Tang
    JOURNAL OF URBAN PLANNING AND DEVELOPMENT, 2022, 148 (02)
  • [45] Crowdsourced cartography: mapping experience and knowledge
    Dodge, Martin
    Kitchin, Rob
    ENVIRONMENT AND PLANNING A, 2013, 45 (01) : 19 - 36
  • [46] A generalized model for mapping sunflower areas using Sentinel-1 SAR data
    Qadir, Abdul
    Skakun, Sergii
    Kussul, Nataliia
    Shelestov, Andrii
    Becker-Reshef, Inbal
    REMOTE SENSING OF ENVIRONMENT, 2024, 306
  • [47] On the generalized Poisson regression mixture model for mapping quantitative trait loci with count data
    Cui, Yuehua
    Kim, Dong-Yun
    Zhu, Jun
    GENETICS, 2006, 174 (04) : 2159 - 2172
  • [48] The role of crowdsourced data, participatory decision-making and mapping of flood related events
    Dixon, Barnali
    Johns, Rebecca A.
    Fernandez, Amada
    APPLIED GEOGRAPHY, 2021, 128
  • [49] Position Validation in Crowdsourced Accessibility Mapping
    Rice, Rebecca M.
    Aburizaiza, Ahmad O.
    Rice, Matthew T.
    Qin, Han
    CARTOGRAPHICA, 2016, 51 (02): : 55 - 66
  • [50] Educational Aspects of Crowdsourced Noise Mapping
    Podor, Andrea
    Zentai, Laszlo
    ADVANCES IN CARTOGRAPHY AND GISCIENCE, 2017, : 35 - 46