File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Machine-learning algorithms to code public health spending accounts

TitleMachine-learning algorithms to code public health spending accounts
Authors
KeywordsHealth finance
Machine learning
Public health
Issue Date2017
Citation
Public Health Reports, 2017, v. 132, n. 3, p. 350-356 How to Cite?
AbstractObjectives: Government public health expenditure data sets require time- and labor-intensive manipulation to summarize results that public health policy makers can use. Our objective was to compare the performances of machine-learning algorithms with manual classification of public health expenditures to determine if machines could provide a faster, cheaper alternative to manual classification. Methods: We used machine-learning algorithms to replicate the process of manually classifying state public health expenditures, using the standardized public health spending categories from the Foundational Public Health Services model and a large data set from the US Census Bureau. We obtained a data set of 1.9 million individual expenditure items from 2000 to 2013. We collapsed these data into 147 280 summary expenditure records, and we followed a standardized method of manually classifying each expenditure record as public health, maybe public health, or not public health. We then trained 9 machine-learning algorithms to replicate the manual process. We calculated recall, precision, and coverage rates to measure the performance of individual and ensembled algorithms. Results: Compared with manual classification, the machine-learning random forests algorithm produced 84% recall and 91% precision. With algorithm ensembling, we achieved our target criterion of 90% recall by using a consensus ensemble of ≥6 algorithms while still retaining 93% coverage, leaving only 7% of the summary expenditure records unclassified. Conclusions: Machine learning can be a time- and cost-saving tool for estimating public health spending in the United States. It can be used with standardized public health spending categories based on the Foundational Public Health Services model to help parse public health expenditure information from other types of health-related spending, provide data that are more comparable across public health organizations, and evaluate the impact of evidence-based public health resource allocation.
Persistent Identifierhttp://hdl.handle.net/10722/327151
ISSN
2023 Impact Factor: 3.0
2023 SCImago Journal Rankings: 1.292
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorBrady, Eoghan S.-
dc.contributor.authorLeider, Jonathon P.-
dc.contributor.authorResnick, Beth A.-
dc.contributor.authorNatalia Alfonso, Y.-
dc.contributor.authorBishai, David-
dc.date.accessioned2023-03-31T05:29:19Z-
dc.date.available2023-03-31T05:29:19Z-
dc.date.issued2017-
dc.identifier.citationPublic Health Reports, 2017, v. 132, n. 3, p. 350-356-
dc.identifier.issn0033-3549-
dc.identifier.urihttp://hdl.handle.net/10722/327151-
dc.description.abstractObjectives: Government public health expenditure data sets require time- and labor-intensive manipulation to summarize results that public health policy makers can use. Our objective was to compare the performances of machine-learning algorithms with manual classification of public health expenditures to determine if machines could provide a faster, cheaper alternative to manual classification. Methods: We used machine-learning algorithms to replicate the process of manually classifying state public health expenditures, using the standardized public health spending categories from the Foundational Public Health Services model and a large data set from the US Census Bureau. We obtained a data set of 1.9 million individual expenditure items from 2000 to 2013. We collapsed these data into 147 280 summary expenditure records, and we followed a standardized method of manually classifying each expenditure record as public health, maybe public health, or not public health. We then trained 9 machine-learning algorithms to replicate the manual process. We calculated recall, precision, and coverage rates to measure the performance of individual and ensembled algorithms. Results: Compared with manual classification, the machine-learning random forests algorithm produced 84% recall and 91% precision. With algorithm ensembling, we achieved our target criterion of 90% recall by using a consensus ensemble of ≥6 algorithms while still retaining 93% coverage, leaving only 7% of the summary expenditure records unclassified. Conclusions: Machine learning can be a time- and cost-saving tool for estimating public health spending in the United States. It can be used with standardized public health spending categories based on the Foundational Public Health Services model to help parse public health expenditure information from other types of health-related spending, provide data that are more comparable across public health organizations, and evaluate the impact of evidence-based public health resource allocation.-
dc.languageeng-
dc.relation.ispartofPublic Health Reports-
dc.subjectHealth finance-
dc.subjectMachine learning-
dc.subjectPublic health-
dc.titleMachine-learning algorithms to code public health spending accounts-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1177/0033354917700356-
dc.identifier.pmid28363034-
dc.identifier.scopuseid_2-s2.0-85021859548-
dc.identifier.volume132-
dc.identifier.issue3-
dc.identifier.spage350-
dc.identifier.epage356-
dc.identifier.eissn1468-2877-
dc.identifier.isiWOS:000400520100012-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats