File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images

TitleModel architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images
Authors
KeywordsArtificial intelligence
Convolutional neural network
Digital pathology
Lung cancer
Issue Date2022
Citation
Informatics in Medicine Unlocked, 2022, v. 28, article no. 100850 How to Cite?
AbstractRecent advancements in Artificial-Intelligence-based computer vision systems demonstrated impressive image and pattern recognition capabilities. A special class of neural networks known as Convolutional Neural Networks are used in a wide variety of computer vision tasks such as image classification, object detection and autonomous driving. There is a huge potential to adopt such technology in the domain of pathology. Image data in pathology are considerably larger in size than in typical image recognition problems. Dissecting the image into smaller bits, known as tiling, are often carried out. This paper aims to compare and contrast common model architectures and input tile sizes systematically and find the optimal configuration in the context of lung cancer classification problems. A dataset composed of 87 annotated whole slide images of lung cancer specimens were collected and annotated by two pathologists. Annotated areas were grouped into four classes (Tumor area, Non-tumor area, Necrosis area, and Immune cells). Annotations were converted into labelled tiles at different tile sizes, from 296 to 10000 pixels, (74–2500 μm). The problem was framed as a supervised 4-class classification problem using deep learning. For each tile size, three models, VGG19, InceptionResnetv2 and EfficientNet b3, were trained. Model performances were measured on holdout dataset using standard quantitative metrics including F1-score and AUC-ROC. Our best model instance with tile size at 500 × 500 pixels (125 × 125 μm) achieved an F1-score at 0.9685 and AUC-ROC score at 0.9627. Our results showed that tile size had a significant impact on model performance. The optimal tile size was between 500 and 1000 pixel (125–250 μm) after both quantitative and qualitative assessments. VGG19 marginally outperformed other model architectures.
Persistent Identifierhttp://hdl.handle.net/10722/343358
ISSN
2023 SCImago Journal Rankings: 0.758

 

DC FieldValueLanguage
dc.contributor.authorLee, Angus Lang Sun-
dc.contributor.authorTo, Curtis Chun Kit-
dc.contributor.authorLee, Alfred Lok Hang-
dc.contributor.authorLi, Joshua Jing Xi-
dc.contributor.authorChan, Ronald Cheong Kin-
dc.date.accessioned2024-05-10T09:07:27Z-
dc.date.available2024-05-10T09:07:27Z-
dc.date.issued2022-
dc.identifier.citationInformatics in Medicine Unlocked, 2022, v. 28, article no. 100850-
dc.identifier.issn2352-9148-
dc.identifier.urihttp://hdl.handle.net/10722/343358-
dc.description.abstractRecent advancements in Artificial-Intelligence-based computer vision systems demonstrated impressive image and pattern recognition capabilities. A special class of neural networks known as Convolutional Neural Networks are used in a wide variety of computer vision tasks such as image classification, object detection and autonomous driving. There is a huge potential to adopt such technology in the domain of pathology. Image data in pathology are considerably larger in size than in typical image recognition problems. Dissecting the image into smaller bits, known as tiling, are often carried out. This paper aims to compare and contrast common model architectures and input tile sizes systematically and find the optimal configuration in the context of lung cancer classification problems. A dataset composed of 87 annotated whole slide images of lung cancer specimens were collected and annotated by two pathologists. Annotated areas were grouped into four classes (Tumor area, Non-tumor area, Necrosis area, and Immune cells). Annotations were converted into labelled tiles at different tile sizes, from 296 to 10000 pixels, (74–2500 μm). The problem was framed as a supervised 4-class classification problem using deep learning. For each tile size, three models, VGG19, InceptionResnetv2 and EfficientNet b3, were trained. Model performances were measured on holdout dataset using standard quantitative metrics including F1-score and AUC-ROC. Our best model instance with tile size at 500 × 500 pixels (125 × 125 μm) achieved an F1-score at 0.9685 and AUC-ROC score at 0.9627. Our results showed that tile size had a significant impact on model performance. The optimal tile size was between 500 and 1000 pixel (125–250 μm) after both quantitative and qualitative assessments. VGG19 marginally outperformed other model architectures.-
dc.languageeng-
dc.relation.ispartofInformatics in Medicine Unlocked-
dc.subjectArtificial intelligence-
dc.subjectConvolutional neural network-
dc.subjectDigital pathology-
dc.subjectLung cancer-
dc.titleModel architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1016/j.imu.2022.100850-
dc.identifier.scopuseid_2-s2.0-85123002833-
dc.identifier.volume28-
dc.identifier.spagearticle no. 100850-
dc.identifier.epagearticle no. 100850-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats