File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.imu.2022.100850
- Scopus: eid_2-s2.0-85123002833
- Find via
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images
Title | Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images |
---|---|
Authors | |
Keywords | Artificial intelligence Convolutional neural network Digital pathology Lung cancer |
Issue Date | 2022 |
Citation | Informatics in Medicine Unlocked, 2022, v. 28, article no. 100850 How to Cite? |
Abstract | Recent advancements in Artificial-Intelligence-based computer vision systems demonstrated impressive image and pattern recognition capabilities. A special class of neural networks known as Convolutional Neural Networks are used in a wide variety of computer vision tasks such as image classification, object detection and autonomous driving. There is a huge potential to adopt such technology in the domain of pathology. Image data in pathology are considerably larger in size than in typical image recognition problems. Dissecting the image into smaller bits, known as tiling, are often carried out. This paper aims to compare and contrast common model architectures and input tile sizes systematically and find the optimal configuration in the context of lung cancer classification problems. A dataset composed of 87 annotated whole slide images of lung cancer specimens were collected and annotated by two pathologists. Annotated areas were grouped into four classes (Tumor area, Non-tumor area, Necrosis area, and Immune cells). Annotations were converted into labelled tiles at different tile sizes, from 296 to 10000 pixels, (74–2500 μm). The problem was framed as a supervised 4-class classification problem using deep learning. For each tile size, three models, VGG19, InceptionResnetv2 and EfficientNet b3, were trained. Model performances were measured on holdout dataset using standard quantitative metrics including F1-score and AUC-ROC. Our best model instance with tile size at 500 × 500 pixels (125 × 125 μm) achieved an F1-score at 0.9685 and AUC-ROC score at 0.9627. Our results showed that tile size had a significant impact on model performance. The optimal tile size was between 500 and 1000 pixel (125–250 μm) after both quantitative and qualitative assessments. VGG19 marginally outperformed other model architectures. |
Persistent Identifier | http://hdl.handle.net/10722/343358 |
ISSN | 2023 SCImago Journal Rankings: 0.758 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Angus Lang Sun | - |
dc.contributor.author | To, Curtis Chun Kit | - |
dc.contributor.author | Lee, Alfred Lok Hang | - |
dc.contributor.author | Li, Joshua Jing Xi | - |
dc.contributor.author | Chan, Ronald Cheong Kin | - |
dc.date.accessioned | 2024-05-10T09:07:27Z | - |
dc.date.available | 2024-05-10T09:07:27Z | - |
dc.date.issued | 2022 | - |
dc.identifier.citation | Informatics in Medicine Unlocked, 2022, v. 28, article no. 100850 | - |
dc.identifier.issn | 2352-9148 | - |
dc.identifier.uri | http://hdl.handle.net/10722/343358 | - |
dc.description.abstract | Recent advancements in Artificial-Intelligence-based computer vision systems demonstrated impressive image and pattern recognition capabilities. A special class of neural networks known as Convolutional Neural Networks are used in a wide variety of computer vision tasks such as image classification, object detection and autonomous driving. There is a huge potential to adopt such technology in the domain of pathology. Image data in pathology are considerably larger in size than in typical image recognition problems. Dissecting the image into smaller bits, known as tiling, are often carried out. This paper aims to compare and contrast common model architectures and input tile sizes systematically and find the optimal configuration in the context of lung cancer classification problems. A dataset composed of 87 annotated whole slide images of lung cancer specimens were collected and annotated by two pathologists. Annotated areas were grouped into four classes (Tumor area, Non-tumor area, Necrosis area, and Immune cells). Annotations were converted into labelled tiles at different tile sizes, from 296 to 10000 pixels, (74–2500 μm). The problem was framed as a supervised 4-class classification problem using deep learning. For each tile size, three models, VGG19, InceptionResnetv2 and EfficientNet b3, were trained. Model performances were measured on holdout dataset using standard quantitative metrics including F1-score and AUC-ROC. Our best model instance with tile size at 500 × 500 pixels (125 × 125 μm) achieved an F1-score at 0.9685 and AUC-ROC score at 0.9627. Our results showed that tile size had a significant impact on model performance. The optimal tile size was between 500 and 1000 pixel (125–250 μm) after both quantitative and qualitative assessments. VGG19 marginally outperformed other model architectures. | - |
dc.language | eng | - |
dc.relation.ispartof | Informatics in Medicine Unlocked | - |
dc.subject | Artificial intelligence | - |
dc.subject | Convolutional neural network | - |
dc.subject | Digital pathology | - |
dc.subject | Lung cancer | - |
dc.title | Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images | - |
dc.type | Article | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1016/j.imu.2022.100850 | - |
dc.identifier.scopus | eid_2-s2.0-85123002833 | - |
dc.identifier.volume | 28 | - |
dc.identifier.spage | article no. 100850 | - |
dc.identifier.epage | article no. 100850 | - |