Toward robust, reliable, and ultra-low-latency edge intelligence

Wang, Zhanwei; 王占伟

File Download

FullText.pdf

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Electrical & Electronic Engineering: Theses

postgraduate thesis: Toward robust, reliable, and ultra-low-latency edge intelligence

Title	Toward robust, reliable, and ultra-low-latency edge intelligence
Authors	Wang, Zhanwei 王占伟
Advisors	Advisor(s):Huang, K
Issue Date	2025
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Wang, Z. [王占伟]. (2025). Toward robust, reliable, and ultra-low-latency edge intelligence. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	The rapid advancement of chip-level computing capabilities, coupled with the exponential proliferation of Internet-of-Things (IoT) devices, has ushered in an era of data explosion in wireless networks. This trend facilitates the migration of artificial intelligence (AI) from centralized cloud infrastructures to the network edge, closer to distributed data, giving rise to a new research area known as edge intelligence. One of the key features of edge intelligence is edge learning, which extracts AI from local data using machine learning algorithms; the other is integrated sensing and edge AI (ISEA), focusing on the efficient deployment of AI models at edge networks for environmental perception. However, the implementation of these functions faces two primary challenges: 1) interference-induced perturbations during data exchange between the edge server and devices, and 2) end-to-end (E2E) latency in task execution, involving data sensing, on-device computation, and communication. To address these challenges, this dissertation explores task-oriented designs to support robust, reliable, and ultra-low-latency edge intelligence. To achieve robust edge intelligence, this dissertation proposes the spectrum breathing framework. This framework integrates data compression and spread-spectrum techniques to suppress interference without bandwidth expansion. The former operation contracts the data bandwidth for the subsequent spectrum spreading. Relevant topics explored in this dissertation include edge learning and distributed sensing. For robust edge learning, the proposed framework cascades the gradient pruning and spread spectrum for interference suppression. The convergence speed under this transceiver is characterized based on the supermartingale process, revealing the performance tradeoff between the pruning error and induced interference under constrained bandwidth. Optimization of the tradeoff yields two schemes that can be either fixed or adaptive to channels and the learning process, verified experimentally to achieve near-optimal performance. As for robust distributed sensing, spectrum breathing cascades feature compression and spread spectrum prior to feature transmission. With the metric of sensing accuracy, the tradeoff between these two operations is uncovered under a bandwidth constraint. The optimal feature compression strategy, adaptive to varying channel conditions, is derived by approximating sensing accuracy with a unimodal surrogate function. Evaluations on real-world datasets demonstrate the superiority of the proposed approach. Targeting mission-critical and time-sensitive tasks, this dissertation achieves reliable and ultra-low-latency edge intelligence. One contribution enhances the reliability of latency-constrained edge inference systems by introducing and mathematically characterizing a goal-oriented reliability metric, termed inference outage (InfOut) probability. Constrained by E2E latency, this framework establishes a fundamental tradeoff between communication overhead and InfOut probability. This dissertation then enables tractable tradeoff optimization by deriving accurate surrogate functions for InfOut probability. Experiments demonstrate the significant performance gain from conventional communication-centric approaches. To meet strict latency requirements, the other contribution proposes an ultra-low-latency (ultra-LoLa) inference framework that integrates short-packet transmission with edge inference in distributed sensing. An efficient optimization approach is developed by characterizing the tradeoff between packet length and sensing observations. Experiments show its superiority over conventional reliability-oriented protocols.
Degree	Doctor of Philosophy
Subject	Edge computing Artificial intelligence
Dept/Program	Electrical and Electronic Engineering
Persistent Identifier	http://hdl.handle.net/10722/367477

DC Field	Value	Language
dc.contributor.advisor	Huang, K	-
dc.contributor.author	Wang, Zhanwei	-
dc.contributor.author	王占伟	-
dc.date.accessioned	2025-12-11T06:42:22Z	-
dc.date.available	2025-12-11T06:42:22Z	-
dc.date.issued	2025	-
dc.identifier.citation	Wang, Z. [王占伟]. (2025). Toward robust, reliable, and ultra-low-latency edge intelligence. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/367477	-
dc.description.abstract	The rapid advancement of chip-level computing capabilities, coupled with the exponential proliferation of Internet-of-Things (IoT) devices, has ushered in an era of data explosion in wireless networks. This trend facilitates the migration of artificial intelligence (AI) from centralized cloud infrastructures to the network edge, closer to distributed data, giving rise to a new research area known as edge intelligence. One of the key features of edge intelligence is edge learning, which extracts AI from local data using machine learning algorithms; the other is integrated sensing and edge AI (ISEA), focusing on the efficient deployment of AI models at edge networks for environmental perception. However, the implementation of these functions faces two primary challenges: 1) interference-induced perturbations during data exchange between the edge server and devices, and 2) end-to-end (E2E) latency in task execution, involving data sensing, on-device computation, and communication. To address these challenges, this dissertation explores task-oriented designs to support robust, reliable, and ultra-low-latency edge intelligence. To achieve robust edge intelligence, this dissertation proposes the spectrum breathing framework. This framework integrates data compression and spread-spectrum techniques to suppress interference without bandwidth expansion. The former operation contracts the data bandwidth for the subsequent spectrum spreading. Relevant topics explored in this dissertation include edge learning and distributed sensing. For robust edge learning, the proposed framework cascades the gradient pruning and spread spectrum for interference suppression. The convergence speed under this transceiver is characterized based on the supermartingale process, revealing the performance tradeoff between the pruning error and induced interference under constrained bandwidth. Optimization of the tradeoff yields two schemes that can be either fixed or adaptive to channels and the learning process, verified experimentally to achieve near-optimal performance. As for robust distributed sensing, spectrum breathing cascades feature compression and spread spectrum prior to feature transmission. With the metric of sensing accuracy, the tradeoff between these two operations is uncovered under a bandwidth constraint. The optimal feature compression strategy, adaptive to varying channel conditions, is derived by approximating sensing accuracy with a unimodal surrogate function. Evaluations on real-world datasets demonstrate the superiority of the proposed approach. Targeting mission-critical and time-sensitive tasks, this dissertation achieves reliable and ultra-low-latency edge intelligence. One contribution enhances the reliability of latency-constrained edge inference systems by introducing and mathematically characterizing a goal-oriented reliability metric, termed inference outage (InfOut) probability. Constrained by E2E latency, this framework establishes a fundamental tradeoff between communication overhead and InfOut probability. This dissertation then enables tractable tradeoff optimization by deriving accurate surrogate functions for InfOut probability. Experiments demonstrate the significant performance gain from conventional communication-centric approaches. To meet strict latency requirements, the other contribution proposes an ultra-low-latency (ultra-LoLa) inference framework that integrates short-packet transmission with edge inference in distributed sensing. An efficient optimization approach is developed by characterizing the tradeoff between packet length and sensing observations. Experiments show its superiority over conventional reliability-oriented protocols.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Edge computing	-
dc.subject.lcsh	Artificial intelligence	-
dc.title	Toward robust, reliable, and ultra-low-latency edge intelligence	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Electrical and Electronic Engineering	-
dc.description.nature	published_or_final_version	-
dc.date.hkucongregation	2025	-
dc.identifier.mmsid	991045147153003414	-

File Download

Supplementary

postgraduate thesis: Toward robust, reliable, and ultra-low-latency edge intelligence

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats