File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Development and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis

TitleDevelopment and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis
Authors
Keywordscase detection
electronic medical records
fall-related admissions
natural language processing
text mining
Issue Date8-Jul-2025
PublisherJMIR Publications Inc.
Citation
JMIR Aging, 2025, v. 8 How to Cite?
AbstractBackground: In order to address fall underestimation by the International Classification of Diseases (ICD) in clinical settings, information from clinical notes could be incorporated via natural language processing (NLP) as a possible solution. However, its application to inpatient notes has not been fully investigated. Objective: This study aims to develop and validate a rule-based NLP algorithm to identify falls based on inpatient admission notes from older patients. Methods: This retrospective study used 12-year electronic inpatient records of patients aged ≥65 years from public hospitals in Hong Kong. A random sample of 1000 patients was drawn to develop the NLP algorithm. Manual review was the gold standard for assessing the algorithm’s performance, with sensitivity, specificity, precision, and F1-score calculated at the record, episode, and patient levels. In addition, the study compared the number of falls identified by ICD codes and clinical notes independently and combined. Results: Our rule-based NLP algorithm showed excellent performance, with a sensitivity, specificity, precision, and F1-score of 93.3%, 99.0%, 87.5%, and 0.903 at the record and episode levels, and 92.9%, 98.3%, 89.7%, and 0.912 at the patient level. The combined identification strategy using ICD codes and the NLP method provided the most comprehensive capture of fall-related episodes and fallers. Conclusions: The NLP method proved efficient and accurate in detecting falls from clinical notes in inpatient episodes. For comprehensive capture of fall episodes and fallers, we recommend the combined use of the NLP algorithm and ICD codes, which should be applied in future fall epidemiology studies and clinical practice for identifying high-risk groups of fall interventions.
Persistent Identifierhttp://hdl.handle.net/10722/369507

 

DC FieldValueLanguage
dc.contributor.authorQian, Xing Xing-
dc.contributor.authorChau, Pui Hing-
dc.contributor.authorFong, Daniel Y.T.-
dc.contributor.authorHo, Mandy-
dc.contributor.authorWoo, Jean-
dc.date.accessioned2026-01-27T00:36:08Z-
dc.date.available2026-01-27T00:36:08Z-
dc.date.issued2025-07-08-
dc.identifier.citationJMIR Aging, 2025, v. 8-
dc.identifier.urihttp://hdl.handle.net/10722/369507-
dc.description.abstractBackground: In order to address fall underestimation by the International Classification of Diseases (ICD) in clinical settings, information from clinical notes could be incorporated via natural language processing (NLP) as a possible solution. However, its application to inpatient notes has not been fully investigated. Objective: This study aims to develop and validate a rule-based NLP algorithm to identify falls based on inpatient admission notes from older patients. Methods: This retrospective study used 12-year electronic inpatient records of patients aged ≥65 years from public hospitals in Hong Kong. A random sample of 1000 patients was drawn to develop the NLP algorithm. Manual review was the gold standard for assessing the algorithm’s performance, with sensitivity, specificity, precision, and F1-score calculated at the record, episode, and patient levels. In addition, the study compared the number of falls identified by ICD codes and clinical notes independently and combined. Results: Our rule-based NLP algorithm showed excellent performance, with a sensitivity, specificity, precision, and F1-score of 93.3%, 99.0%, 87.5%, and 0.903 at the record and episode levels, and 92.9%, 98.3%, 89.7%, and 0.912 at the patient level. The combined identification strategy using ICD codes and the NLP method provided the most comprehensive capture of fall-related episodes and fallers. Conclusions: The NLP method proved efficient and accurate in detecting falls from clinical notes in inpatient episodes. For comprehensive capture of fall episodes and fallers, we recommend the combined use of the NLP algorithm and ICD codes, which should be applied in future fall epidemiology studies and clinical practice for identifying high-risk groups of fall interventions.-
dc.languageeng-
dc.publisherJMIR Publications Inc.-
dc.relation.ispartofJMIR Aging-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectcase detection-
dc.subjectelectronic medical records-
dc.subjectfall-related admissions-
dc.subjectnatural language processing-
dc.subjecttext mining-
dc.titleDevelopment and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis-
dc.typeArticle-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.2196/65195-
dc.identifier.scopuseid_2-s2.0-105026825386-
dc.identifier.volume8-
dc.identifier.eissn2561-7605-
dc.identifier.issnl2561-7605-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats