File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Spatial regionalization based on optimal information compression

TitleSpatial regionalization based on optimal information compression
Authors
Issue Date2022
Citation
Communications Physics, 2022, v. 5, article no. 249 How to Cite?
AbstractRegionalization, spatially contiguous clustering, provides a means to reduce the effect of noise in sampled data and identify homogeneous areas for policy development among many other applications. Existing regionalization methods require user input such as the number of regions or a similarity measure between regions, which does not allow for the extraction of the natural regions defined solely by the data itself. Here we view the problem of regionalization as one of data compression and develop an efficient, parameter-free regionalization algorithm based on the minimum description length principle. We demonstrate that our method is capable of recovering planted spatial clusters in noisy synthetic data, and that it can meaningfully coarse-grain real demographic data. Using our description length formulation, we find that spatial ethnoracial data in U.S. metropolitan areas has become less compressible over the period from 1980 to 2010, reflecting the rising complexity of urban segregation patterns in these metros.
Persistent Identifierhttp://hdl.handle.net/10722/319133
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorKirkley, AW-
dc.date.accessioned2022-10-14T05:07:43Z-
dc.date.available2022-10-14T05:07:43Z-
dc.date.issued2022-
dc.identifier.citationCommunications Physics, 2022, v. 5, article no. 249-
dc.identifier.urihttp://hdl.handle.net/10722/319133-
dc.description.abstractRegionalization, spatially contiguous clustering, provides a means to reduce the effect of noise in sampled data and identify homogeneous areas for policy development among many other applications. Existing regionalization methods require user input such as the number of regions or a similarity measure between regions, which does not allow for the extraction of the natural regions defined solely by the data itself. Here we view the problem of regionalization as one of data compression and develop an efficient, parameter-free regionalization algorithm based on the minimum description length principle. We demonstrate that our method is capable of recovering planted spatial clusters in noisy synthetic data, and that it can meaningfully coarse-grain real demographic data. Using our description length formulation, we find that spatial ethnoracial data in U.S. metropolitan areas has become less compressible over the period from 1980 to 2010, reflecting the rising complexity of urban segregation patterns in these metros.-
dc.languageeng-
dc.relation.ispartofCommunications Physics-
dc.titleSpatial regionalization based on optimal information compression-
dc.typeArticle-
dc.identifier.emailKirkley, AW: akirkley@hku.hk-
dc.identifier.authorityKirkley, AW=rp03010-
dc.identifier.doi10.1038/s42005-022-01029-4-
dc.identifier.hkuros338763-
dc.identifier.volume5-
dc.identifier.spagearticle no. 249-
dc.identifier.epagearticle no. 249-
dc.identifier.isiWOS:000866208400001-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats