Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images

Lee, Angus Lang Sun; To, Curtis Chun Kit; Lee, Alfred Lok Hang; Li, Joshua Jing Xi; Chan, Ronald Cheong Kin

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1016/j.imu.2022.100850
Scopus: eid_2-s2.0-85123002833
Find via

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Pathology: Journal/Magazine Articles

Article: Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images

Title	Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images
Authors	Lee, Angus Lang Sun To, Curtis Chun Kit Lee, Alfred Lok Hang Li, Joshua Jing Xi Chan, Ronald Cheong Kin
Keywords	Artificial intelligence Convolutional neural network Digital pathology Lung cancer
Issue Date	2022
Citation	Informatics in Medicine Unlocked, 2022, v. 28, article no. 100850 How to Cite? DOI: http://dx.doi.org/10.1016/j.imu.2022.100850
Abstract	Recent advancements in Artificial-Intelligence-based computer vision systems demonstrated impressive image and pattern recognition capabilities. A special class of neural networks known as Convolutional Neural Networks are used in a wide variety of computer vision tasks such as image classification, object detection and autonomous driving. There is a huge potential to adopt such technology in the domain of pathology. Image data in pathology are considerably larger in size than in typical image recognition problems. Dissecting the image into smaller bits, known as tiling, are often carried out. This paper aims to compare and contrast common model architectures and input tile sizes systematically and find the optimal configuration in the context of lung cancer classification problems. A dataset composed of 87 annotated whole slide images of lung cancer specimens were collected and annotated by two pathologists. Annotated areas were grouped into four classes (Tumor area, Non-tumor area, Necrosis area, and Immune cells). Annotations were converted into labelled tiles at different tile sizes, from 296 to 10000 pixels, (74–2500 μm). The problem was framed as a supervised 4-class classification problem using deep learning. For each tile size, three models, VGG19, InceptionResnetv2 and EfficientNet b3, were trained. Model performances were measured on holdout dataset using standard quantitative metrics including F1-score and AUC-ROC. Our best model instance with tile size at 500 × 500 pixels (125 × 125 μm) achieved an F1-score at 0.9685 and AUC-ROC score at 0.9627. Our results showed that tile size had a significant impact on model performance. The optimal tile size was between 500 and 1000 pixel (125–250 μm) after both quantitative and qualitative assessments. VGG19 marginally outperformed other model architectures.
Persistent Identifier	http://hdl.handle.net/10722/343358
ISSN	2352-9148 2023 SCImago Journal Rankings: 0.758

DC Field	Value	Language
dc.contributor.author	Lee, Angus Lang Sun	-
dc.contributor.author	To, Curtis Chun Kit	-
dc.contributor.author	Lee, Alfred Lok Hang	-
dc.contributor.author	Li, Joshua Jing Xi	-
dc.contributor.author	Chan, Ronald Cheong Kin	-
dc.date.accessioned	2024-05-10T09:07:27Z	-
dc.date.available	2024-05-10T09:07:27Z	-
dc.date.issued	2022	-
dc.identifier.citation	Informatics in Medicine Unlocked, 2022, v. 28, article no. 100850	-
dc.identifier.issn	2352-9148	-
dc.identifier.uri	http://hdl.handle.net/10722/343358	-
dc.description.abstract	Recent advancements in Artificial-Intelligence-based computer vision systems demonstrated impressive image and pattern recognition capabilities. A special class of neural networks known as Convolutional Neural Networks are used in a wide variety of computer vision tasks such as image classification, object detection and autonomous driving. There is a huge potential to adopt such technology in the domain of pathology. Image data in pathology are considerably larger in size than in typical image recognition problems. Dissecting the image into smaller bits, known as tiling, are often carried out. This paper aims to compare and contrast common model architectures and input tile sizes systematically and find the optimal configuration in the context of lung cancer classification problems. A dataset composed of 87 annotated whole slide images of lung cancer specimens were collected and annotated by two pathologists. Annotated areas were grouped into four classes (Tumor area, Non-tumor area, Necrosis area, and Immune cells). Annotations were converted into labelled tiles at different tile sizes, from 296 to 10000 pixels, (74–2500 μm). The problem was framed as a supervised 4-class classification problem using deep learning. For each tile size, three models, VGG19, InceptionResnetv2 and EfficientNet b3, were trained. Model performances were measured on holdout dataset using standard quantitative metrics including F1-score and AUC-ROC. Our best model instance with tile size at 500 × 500 pixels (125 × 125 μm) achieved an F1-score at 0.9685 and AUC-ROC score at 0.9627. Our results showed that tile size had a significant impact on model performance. The optimal tile size was between 500 and 1000 pixel (125–250 μm) after both quantitative and qualitative assessments. VGG19 marginally outperformed other model architectures.	-
dc.language	eng	-
dc.relation.ispartof	Informatics in Medicine Unlocked	-
dc.subject	Artificial intelligence	-
dc.subject	Convolutional neural network	-
dc.subject	Digital pathology	-
dc.subject	Lung cancer	-
dc.title	Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images	-
dc.type	Article	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1016/j.imu.2022.100850	-
dc.identifier.scopus	eid_2-s2.0-85123002833	-
dc.identifier.volume	28	-
dc.identifier.spage	article no. 100850	-
dc.identifier.epage	article no. 100850	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Model architecture and tile size selection for convolutional neural network training for non-small cell lung cancer detection on whole slide images

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats