Populated Polygons to Networks: A Population-Centric Approach to Spatial Network Allocation
Gaboardi, James D. (James David) (author)
Folch, David C. (professor directing dissertation)
Brusco, Michael J. (university representative)
Horner, Mark W. (committee member)
Uejio, Christopher K. (committee member)
Florida State University (degree granting institution)
College of Social Sciences and Public Policy (degree granting college)
Department of Geography (degree granting department)
2019
text
doctoral thesis
This dissertation establishes an original solution for allocating populations onto networks, which is demonstrated through empirical examples within the comprising census geographies of a single census tract and the comprising census geographies for an entire county. The novel method, populated polygons to networks (pp2n), is shown to perform as accurately as a current state-of-the-art method, while being less computationally complex. Benchmark datasets are utilized to represent household-level population distributions. Datasets generated from the methods of network allocation are applied to optimal facility location modeling scenarios. Networks are an underlying part of the human experience and, as such, attention must be given to their study in the spatial analysis of anthropocentric phenomena. However, transformations in spatial data must frequently be performed in order to allow for the integration of original disparate data formats. Such is often the case with spatial population data, which are generally available as polygons. As a means to build network-based models for analysis, certain methods have been developed for allocating populations onto networks for the purpose of calculating origin-to-destination cost (distance) matrices. Two of these methods include (1) simply snapping polygon centroids onto the nearest network segment and (2) dividing population values by area and proximity to the network. Here the new method, pp2n, is proposed that incorporates the strengths of both the existing methods, while mitigating their weaknesses. The traditional approach, the state-of-the-art approach, and the pp2n method are tested against a benchmark dataset representing population-weighted estimates for property parcels. It is shown that the pp2n method is less computationally complex in the worst-case scenario than the current state-of-the-art method and more representationally accurate that the traditional method. Further, in an empirical example within one census tract in Leon County, FL, the pp2n method is found to perform with comparable accuracy to the state-of-the-art approach when compared to both the traditional approach and the benchmark dataset. Also, it is shown that the algorithm for generating pp2n population weights runs in significantly less realtime. Extending the empirical example within a single census tract (and comprising geographies), another complete empirical example is performed on the full spatial extent of Leon County, Florida. Here the focus shifts from purely how the population data are being allocated to the network, to validating the spatial data utilized in modeling and understanding the inherent associated uncertainty. Permission to access a highly-restricted address data file, the Master Address File (MAF), was granted by the U.S. Census Bureau. Within this study, the MAF functions as an ultimate gold-standard benchmark to test all the methods used in this dissertation within the context of the 2010 Decennial Census. Disclosure and privacy are discussed and a critique is given for the method used to produce the population-weighted estimates for property parcels. It is then shown, as in the single census tract example, that the pp2n method performs as well as the state-of-the-art method, while doing so in substantially less runtime. Further, the property parcel dataset is validated as an acceptable surrogate for true housing units available from the MAF. Facility location modeling is utilized to determine the effects of the network allocation methods on optimal site selection. Following a review of mathematical programming and the uncertainty involved in spatial optimization, the network allocation methods are tested with four fundamental models within a spatial optimization framework: the location set covering problem, maximal covering location problem, p-median problem, and p-center problem. The linear integer programs are solved for each model, for each method at each spatial extent with 15 sets of parameters. In total, 780 models are solved to optimality. The results of the abstract population representation models are compared again to the population-weighted estimates for property parcels, which act as a surrogate for the benchmark truth of census microdata. Results show that the method of network allocation has a non-negligible effect on the solutions to facility location models. Specifically, optimal facility configurations of the location models are affected within the selected spatio-temporal study area: 2010 Leon County, FL.
geocompuatation, GIS, spatial networks, spatial optimization
July 12, 2019.
A Dissertation submitted to the Department of Geography in partial fulfillment of the requirements for the degree of Doctor of Philosophy.
Includes bibliographical references.
David C. Folch, Professor Directing Dissertation; Michael J. Brusco, University Representative; Mark W. Horner, Committee Member; Chris K. Uejio, Committee Member.
Florida State University
2019_Summer_Gaboardi_fsu_0071E_15307