The use of crowdsourced mobile data in estimating pedestrian and bicycle traffic: A systematic review
Tao Tao
Carnegie Mellon University
Greg Lindsey
University of Minnesota
Raphael Stern
University of Minnesota
Michael Levin
University of Minnesota
DOI: https://doi.org/10.5198/jtlu.2024.2315
Keywords: Strava, non-motorized traffic, direct demand model, traffic estimation
Abstract
To address the need for better non-motorized traffic data, policymakers and researchers are collaborating to develop new approaches and methods for estimating pedestrian and bicyclist traffic volumes. Crowdsourced mobile data, which has higher spatial and temporal coverage and lower collection costs than data collected through traditional approaches, may help improve pedestrian and bicyclist traffic estimation despite their limitations or biases. This systemic literature review documents how researchers have used crowdsourced mobile data to estimate pedestrian and bicyclist traffic volumes. We find that one source of commercial fitness application data (i.e., Strava) has been used much more frequently than other crowdsourced mobile data, and that most studies have used crowdsourced mobile data to estimate bicyclist volumes. Comparatively few studies have estimated pedestrian volumes. The most common approach to the use of crowdsourced counts is as independent variables in direct demand models. Variables constructed from crowdsourced mobile data not only have significant correlations with observed counts in statistical models but also have larger relative importance than other factors in machine learning models. Studies also show that including crowdsourced mobile data can significantly improve estimation performance. Future research directions include application of crowdsourced mobile data in more pedestrian traffic estimations, comparison of the performance of different crowdsourced mobile data, incorporation of multiple data sources, and expansion of the methods using crowdsourced mobile data for non-motorized traffic estimation.
References
Bhowmick, D., Saberi, M., Stevenson, M., Thompson, J., Winters, M., Nelson, T., … & Beck, B. (2022). A systematic scoping review of methods for estimating link-level bicycling volumes. Transport Reviews, 43(4), 622–651. https://doi.org/10.1080/01441647.2022.2147240
Camacho-Torregrosa, F. J., Llopis-Castelló, D., López-Maldonado, G., & García, A. (2021). An examination of the Strava usage rate—A parameter to estimate average annual daily bicycle volumes on rural roadways. Safety, 7(1), 8. https://doi.org/10.3390/safety7010008
Dadashova, B., & Griffin, G. (2020). Random parameter models for estimating statewide daily bicycle counts using crowdsourced data. Transportation Research Part D: Transport and Environ-ment, 84, 102368. https://doi.org/10.1016/j.trd.2020.102368
Dadashova, B., Griffin, G., Das, S., Turner, S., & Sherman, B. (2020). Estimation of average annual daily bicycle counts using crowdsourced Strava data. Transportation Research Record: Journal of the Transportation Research Board, 2674(11), 390–402. https://doi.org/10.1177/0361198120946016
Estellés-Arolas, E., & González-Ladrón-De-Guevara, F. (2012). Towards an integrated crowdsourcing definition. Journal of Information Science, 38(2), 189–200. https://doi.org/10.1177/0165551512437638
Ewing, R., & Cervero, R. (2010). Travel and the built environment. Journal of the American Planning Association, 76(3), 265–294. https://doi.org/10.1080/01944361003766766
FHWA. (2016). The traffic monitoring guide. Retrieved from https://www.fhwa.dot.gov/policyinformation/tmguide/tmg_fhwa_pl_17_003.pdf
FHWA. (2018). Summary of Travel Trends: 2017 National Household Travel Survey. Washington, DC: FHWA.
Gamez, C. (2022). Origins and destinations data export and download. Retrieved from https://stravametro.zendesk.com/hc/en-us/articles/8187514314263
Grant, M. J., & Booth, A. (2009). A typology of reviews: An analysis of 14 review types and associated methodologies. Health Information and Libraries Journal, 26(2), 91–108. https://doi.org/10.1111/j.1471-1842.2009.00848.x
Haworth, J. (2016). Investigating the potential of activity tracking app data to estimate cycle flows in urban areas. ISPRS — International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLI-B2, 515–519. https://doi.org/10.5194/isprsarchives-XLI-B2-515-2016
Huo, J., Fu, X., Liu, Z., & Zhang, Q. (2022). Short-term estimation and prediction of pedestrian density in urban hot spots based on mobile phone data. IEEE Transactions on Intelligent Trans-portation Systems, 23(8), 10827–10838. https://doi.org/10.1109/tits.2021.3096274
Jestico, B., Nelson, T., & Winters, M. (2016). Mapping ridership using crowdsourced cycling data. Journal of Transport Geography, 52, 90–97. https://doi.org/10.1016/j.jtrangeo.2016.03.006
Kothuri, S., Broach, J., McNeil, N., Hyun, K., Mattingly, S., Miah, M. M., … & Proulx, F. (2022). Exploring data fusion techniques to estimate network-wide bicycle volumes. Retrieved from https://pdxscholar.library.pdx.edu/trec_reports/234/
Kwigizile, V., Oh, J.-S., & Kwayu, K. (2019). Integrating crowdsourced data with traditionally collected data to enhance estimation of bicycle exposure measure. Retrieved from https://rosap.ntl.bts.gov/view/dot/44138
Lee, K., & Sener, I. (2020a). Emerging data for pedestrian and bicycle monitoring: Sources and applications. Transportation Research Interdisciplinary Perspectives, 4, 100095. https://doi.org/10.1016/j.trip.2020.100095
Lee, K., & Sener, I. (2020b). Strava metro data for bicycle monitoring: A literature review. Transport Reviews, 41(1), 27–47. https://doi.org/10.1080/01441647.2020.1798558
Lin, Z., & Fan, W. D. (2020). Modeling bicycle volume using crowdsourced data from Strava smartphone application. International Journal of Transportation Science and Technology, 9(4), 334–343. https://doi.org/10.1016/j.ijtst.2020.03.003
Lißner, S., Francke, A., & Becker, T. (2018). Modeling cyclists traffic volume — Can bicycle planning benefit from smartphone-based data. Paper presented at the 7th Transport Research Arena TRA, April 16–19, Vienna, Austria.
Liu, F., Evans, J. E. J., & Rossi, T. (2012). Recent practices in regional modeling of nonmotorized travel. Transportation Research Record: Journal of the Transportation Research Board, 2303(1), 1–8. https://doi.org/10.3141/2303-01
Miah, M. M., Hyun, K. K., Mattingly, S. P., Broach, J., McNeil, N., & Kothuri, S. (2022). Challenges and opportunities of emerging data sources to estimate network-wide bike counts. Journal of Transportation Engineering, Part A: Systems. https://doi.org/10.1061/jtepbs.0000634
Miah, M. M., Hyun, K. K., Mattingly, S. P., & Khan, H. (2022). Estimation of daily bicycle traffic using machine and deep learning techniques. Transportation, 50, 1631–1684. https://doi.org/10.1007/s11116-022-10290-z
Molnar, C. (2020). Interpretable machine learning — A guide for making black box models explainable. lulu.com. Retrieved from https://christophm.github.io/interpretable-ml-book/
Munira, S. (2021). Fusing nonmotorized traffic data: A decision fusion framework. College Station, TX: Texas A&M University.
Munira, S., & Sener, I. N. (2017). Use of direct-demand modeling in estimating nonmotorized activity: A meta-analysis. College Station, TX: Texas A&M University.
Nelson, T., Ferster, C., Laberee, K., Fuller, D., & Winters, M. (2021). Crowdsourced data for bicycling research and practice. Transport Reviews, 41(1), 97–114. https://doi.org/10.1080/01441647.2020.1806943
Nelson, T., Roy, A., Ferster, C., Fischer, J., Brum-Bastos, V., Laberee, K., … & Winters, M. (2021). Generalized model for mapping bicycle ridership with crowdsourced data. Transportation Re-search Part C: Emerging Technologies, 125, 102981. https://doi.org/10.1016/j.trc.2021.102981
Nishi, K., Tsubouchi, K., & Shimosaka, M. (2014). Hourly pedestrian population trends estimation using location data from smartphones dealing with temporal and spatial sparsity. Paper presented at the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Infor-mation Systems, Nov. 1–4, Seattle, WA.
Ohlms, P. B., Dougald, L. E., & Macknight, H. E. (2019). Bicycle and pedestrian count programs: Scan of current U.S. practice. Transportation Research Record: Journal of the Transportation Re-search Board, 2673(3), 74–85. https://doi.org/10.1177/0361198119834924
Pogodzinska, S., Kiec, M., & D’Agostino, C. (2020). Bicycle traffic volume estimation based on GPS data. Transportation Research Procedia, 45, 874–881. https://doi.org/10.1016/j.trpro.2020.02.081
Proulx, F. (2016). Bicyclist exposure estimation using heterogeneous demand data sources. Berekely, CA: University of California.
Roll, J. (2018). Bicycle count data: What is it good for? A study of bicycle travel activity in central lane metropolitan planning organization. Retrieved from https://rosap.ntl.bts.gov/view/dot/36255
Roy, A., Nelson, T. A., Fotheringham, A. S., & Winters, M. (2019). Correcting bias in crowdsourced data to map bicycle ridership of all bicyclists. Urban Science, 3(2), 62. https://doi.org/10.3390/urbansci3020062
Saad, M., Abdel-Aty, M., Lee, J., & Cai, Q. (2019). Bicycle safety analysis at intersections from crowdsourced data. Transportation Research Record: Journal of the Transportation Research Board, 2673(4), 1–14. https://doi.org/10.1177/0361198119836764
Sanders, R. L., Frackelton, A., Gardner, S., Schneider, R., & Hintze, M. (2017). Ballpark method for estimating pedestrian and bicyclist exposure in Seattle, Washington. Transportation Research Record: Journal of the Transportation Research Board, 2605(1), 32–44. https://doi.org/10.3141/2605-03
Schneider, R. J., Schmitz, A., & Qin, X. (2021). Development and validation of a seven-county regional pedestrian volume model. Transportation Research Record: Journal of the Transportation Research Board, 2675(6), 352–368. https://doi.org/10.1177/0361198121992360
Singleton, P. A., Park, K., & Lee, D. H. (2021). Varying influences of the built environment on daily and hourly pedestrian crossing volumes at signalized intersections estimated from traffic signal controller event data. Journal of Transport Geography, 93, 103067. https://doi.org/10.1016/j.jtrangeo.2021.103067
Snyder, H. (2019). Literature review as a research methodology: An overview and guidelines. Journal of Business Research, 104, 333–339. https://doi.org/10.1016/j.jbusres.2019.07.039
Strauss, J., Miranda-Moreno, L. F., & Morency, P. (2015). Mapping cyclist activity and injury risk in a network combining smartphone GPS data and bicycle counts. Accident Analysis & Preven-tion, 83, 132–142. https://doi.org/10.1016/j.aap.2015.07.014
Strava. (2020). Strava announces Strava Metro, the largest active travel dataset on the planet, is now free and available to cities everywhere. Retrieved from https://blog.strava.com/press/metro/
StreetLight Data Inc. (2023a). StreetLight bicycle volume methodology and validation white paper. https://www.streetlightdata.com/whitepapers/
StreetLight Data Inc. (2023b). StreetLight pedestrian volume methodology and validation white paper. https://www.streetlightdata.com/whitepapers/
Turner, S., Hottenstein, A., & Shunk, G. (1997). Bicycle and pedestrian travel demand forecasting: Literature review. Austin, TX: Texas Department of Transportation. https://static.tti.tamu.edu/tti.tamu.edu/documents/1723-1.pdf
Turner, S., Martin, M., Griffin, G., Le, M., Das, S., Wang, R., … & Li, X. (2020). Exploring crowdsourced monitoring data for safety. Retrieved from https://safed.vtti.vt.edu/projects/exploring-crowdsourced-monitoring-data-for-safety/
Turner, S., Sener, I. N., Martin, M. E., Das, S., Hampshire, R. C., Fitzpatrick, K., … & Wijesundera, R. K. (2017). Synthesis of methods for estimating pedestrian and bicyclist exposure to risk at are-awide levels and on specific transportation facilities. Washington, DC: Federal Highway Admin-istration. https://rosap.ntl.bts.gov/view/dot/36098
Xiao, Y., & Watson, M. (2019). Guidance on conducting a systematic literature review. Journal of Planning Education and Research, 39(1), 93–112. https://doi.org/10.1177/0739456x17723971
Yasmin, S., Bhowmik, T., Rahman, M., & Eluru, N. (2021). Enhancing non-motorist safety by sim-ulating trip exposure using a transportation planning approach. Accident Analysis & Prevention, 156, 106128. https://doi.org/10.1016/j.aap.2021.106128