Data-efficient ultrasound imaging analysis with deep learning

Sun, Xiaofei; 孫晓菲

File Download

FullText.pdf

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Electrical & Electronic Engineering: Theses

postgraduate thesis: Data-efficient ultrasound imaging analysis with deep learning

Title	Data-efficient ultrasound imaging analysis with deep learning
Authors	Sun, Xiaofei 孫晓菲
Advisors	Advisor(s):Lee, W Lam, EYM
Issue Date	2023
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Sun, X. [孫晓菲]. (2023). Data-efficient ultrasound imaging analysis with deep learning. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	Deep learning techniques for ultrasound imaging, which is a widespread non-invasive diagnostic imaging tool, have advanced recently. These techniques are designed to tackle long-standing challenges in ultrasound imaging, such as spatial resolution, motion analysis, and common image processing tasks (e.g., segmentation). The operator-dependent nature of ultrasound scans leads to inter-operator variability and analysis complexities. This makes the availability of large public ultrasound datasets and data annotations limited. The cornerstone of this thesis is the concept of data-efficient analysis. Data-efficient analysis aims to maximize model performance with limited data, thus addressing aforementioned challenges. The contributions of this thesis are manifold. Chapter 2 introduces the CCycleGAN model, which is a novel approach to generate ultrasound sector images with spatial resolution that is more spatially uniform throughout the entire sector field of view. As mentioned earlier, operator-dependency of ultrasound scanning makes acquisitions of paired high-resolution and low-resolution ultrasound images impractical. By leveraging the power of CycleGAN with unpaired data, CCycleGAN bridges the gap between unpaired ultrasound configurations (linear array vs. phased array), thus improving spatial resolution. The CCycleGAN model, with an efficient training strategy with a newly proposed constrained-consistency loss, is tailored to ensure that generated images retain critical anatomical details and speckle patterns. The optimized architecture also ensures swift inference time (an average of about 200 ms per image) suitable for near real-time clinical applications. Chapter 3 explores the field of 2D (i.e., lateral and axial) motion estimation in ultrasound imaging, with a special focus on the challenging lateral direction. The proposed TransPWCLite model, a lightweight transformer-encoded optical flow pyramidal network, is introduced for accurate 2D motion estimation. This model not only showcases the potential of transformer architectures in capturing temporal dynamics inherent in ultrasound sequences but also accelerate inference time (an average inference time of about 300 ms per image pair). The training with data augmentation ensures data-efficient learning, making it suitable for limited training samples. In Chapter 4, the proposed WMSEUNet model tackles tissue segmentation in dynamic lung ultrasound images. WMSEUNet is a weakly-supervised mask enhanced multi-scale self-attention efficient UNetIt integrates self-attention mechanisms to focus on critical regions in the ultrasound images, ensuring accurate segmentation. This is particularly beneficial when images exhibit intrinsic speckle noise, low contrast, and obscured tissue boundaries, such as those between cutaneous layers and muscles. The ability to train the model with weakly-supervised annotated data alleviates extensive annotations, making the training more feasible and efficient. Chapter 5 first summarizes the pivotal role of data-efficient deep learning models in ultrasound imaging, not only to address the intrinsic challenges associated with ultrasound imaging but also to provide useful insights into future research, with aspirations of widespread clinical integration. Lastly, the thesis highlights potential areas, including semi-supervised learning, model optimization, and integration with other modalities. In conclusion, this thesis emphasizes the importance and demonstrates the feasibility of achieving high deep learning model performance while relaxing data requirements in ultrasound imaging and analysis tasks, including spatial resolution improvement, motion estimation, and tissue segmentation.
Degree	Doctor of Philosophy
Subject	Diagnostic ultrasonic imaging Deep learning (Machine learning)
Dept/Program	Electrical and Electronic Engineering
Persistent Identifier	http://hdl.handle.net/10722/352581

DC Field	Value	Language
dc.contributor.advisor	Lee, W	-
dc.contributor.advisor	Lam, EYM	-
dc.contributor.author	Sun, Xiaofei	-
dc.contributor.author	孫晓菲	-
dc.date.accessioned	2024-12-17T08:58:47Z	-
dc.date.available	2024-12-17T08:58:47Z	-
dc.date.issued	2023	-
dc.identifier.citation	Sun, X. [孫晓菲]. (2023). Data-efficient ultrasound imaging analysis with deep learning. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/352581	-
dc.description.abstract	Deep learning techniques for ultrasound imaging, which is a widespread non-invasive diagnostic imaging tool, have advanced recently. These techniques are designed to tackle long-standing challenges in ultrasound imaging, such as spatial resolution, motion analysis, and common image processing tasks (e.g., segmentation). The operator-dependent nature of ultrasound scans leads to inter-operator variability and analysis complexities. This makes the availability of large public ultrasound datasets and data annotations limited. The cornerstone of this thesis is the concept of data-efficient analysis. Data-efficient analysis aims to maximize model performance with limited data, thus addressing aforementioned challenges. The contributions of this thesis are manifold. Chapter 2 introduces the CCycleGAN model, which is a novel approach to generate ultrasound sector images with spatial resolution that is more spatially uniform throughout the entire sector field of view. As mentioned earlier, operator-dependency of ultrasound scanning makes acquisitions of paired high-resolution and low-resolution ultrasound images impractical. By leveraging the power of CycleGAN with unpaired data, CCycleGAN bridges the gap between unpaired ultrasound configurations (linear array vs. phased array), thus improving spatial resolution. The CCycleGAN model, with an efficient training strategy with a newly proposed constrained-consistency loss, is tailored to ensure that generated images retain critical anatomical details and speckle patterns. The optimized architecture also ensures swift inference time (an average of about 200 ms per image) suitable for near real-time clinical applications. Chapter 3 explores the field of 2D (i.e., lateral and axial) motion estimation in ultrasound imaging, with a special focus on the challenging lateral direction. The proposed TransPWCLite model, a lightweight transformer-encoded optical flow pyramidal network, is introduced for accurate 2D motion estimation. This model not only showcases the potential of transformer architectures in capturing temporal dynamics inherent in ultrasound sequences but also accelerate inference time (an average inference time of about 300 ms per image pair). The training with data augmentation ensures data-efficient learning, making it suitable for limited training samples. In Chapter 4, the proposed WMSEUNet model tackles tissue segmentation in dynamic lung ultrasound images. WMSEUNet is a weakly-supervised mask enhanced multi-scale self-attention efficient UNetIt integrates self-attention mechanisms to focus on critical regions in the ultrasound images, ensuring accurate segmentation. This is particularly beneficial when images exhibit intrinsic speckle noise, low contrast, and obscured tissue boundaries, such as those between cutaneous layers and muscles. The ability to train the model with weakly-supervised annotated data alleviates extensive annotations, making the training more feasible and efficient. Chapter 5 first summarizes the pivotal role of data-efficient deep learning models in ultrasound imaging, not only to address the intrinsic challenges associated with ultrasound imaging but also to provide useful insights into future research, with aspirations of widespread clinical integration. Lastly, the thesis highlights potential areas, including semi-supervised learning, model optimization, and integration with other modalities. In conclusion, this thesis emphasizes the importance and demonstrates the feasibility of achieving high deep learning model performance while relaxing data requirements in ultrasound imaging and analysis tasks, including spatial resolution improvement, motion estimation, and tissue segmentation.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Diagnostic ultrasonic imaging	-
dc.subject.lcsh	Deep learning (Machine learning)	-
dc.title	Data-efficient ultrasound imaging analysis with deep learning	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Electrical and Electronic Engineering	-
dc.description.nature	published_or_final_version	-
dc.date.hkucongregation	2024	-
dc.identifier.mmsid	991044770608003414	-

File Download

Supplementary

postgraduate thesis: Data-efficient ultrasound imaging analysis with deep learning

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats