File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Adaptive Video Streaming for Massive MIMO Networks via Approximate MDP and Reinforcement Learning

TitleAdaptive Video Streaming for Massive MIMO Networks via Approximate MDP and Reinforcement Learning
Authors
KeywordsStreaming media
Quality of experience
Wireless communication
Bit rate
Massive MIMO
Issue Date2020
PublisherInstitute of Electrical and Electronics Engineers. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7693
Citation
IEEE Transactions on Wireless Communications, 2020, v. 9 n. 19, p. 5716-5731 How to Cite?
AbstractThe scheduling of downlink video streaming in a massive multiple-input multiple-output (MIMO) network is considered in this paper, where active users arrive randomly to request video contents of a finite playback duration via their service base stations (BSs). Each video content consisting of a sequence of segments can be transmitted to the requesting users with variable video bitrates. We formulate the joint control of transmitted segment number, frame allocation and segment bitrate in all the super frames (each comprising multiple frames) as an infinite-horizon Markov decision process (MDP). The maximization objective is a discounted measurement of the average Quality of Experience (QoE). Since there is no efficient method for scheduling design with random user arrivals and departures in the existing literature, a novel approximate MDP method is proposed to obtain a low-complexity scheduling policy, where a lower bound on its performance is derived. Specifically, we first introduce a baseline policy and derive its asymptotic value function. One-step policy iteration is then applied to improve this value function, yielding the mentioned low-complexity policy. Finally, we propose a novel and efficient reinforcement learning (RL) algorithm to evaluate the value function when the prior knowledge on user arrival intensity is absent.
Persistent Identifierhttp://hdl.handle.net/10722/295883
ISSN
2021 Impact Factor: 8.346
2020 SCImago Journal Rankings: 2.010
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLAN, Q-
dc.contributor.authorLv, B-
dc.contributor.authorWang, R-
dc.contributor.authorHuang, K-
dc.contributor.authorGong, Y-
dc.date.accessioned2021-02-08T08:15:23Z-
dc.date.available2021-02-08T08:15:23Z-
dc.date.issued2020-
dc.identifier.citationIEEE Transactions on Wireless Communications, 2020, v. 9 n. 19, p. 5716-5731-
dc.identifier.issn1536-1276-
dc.identifier.urihttp://hdl.handle.net/10722/295883-
dc.description.abstractThe scheduling of downlink video streaming in a massive multiple-input multiple-output (MIMO) network is considered in this paper, where active users arrive randomly to request video contents of a finite playback duration via their service base stations (BSs). Each video content consisting of a sequence of segments can be transmitted to the requesting users with variable video bitrates. We formulate the joint control of transmitted segment number, frame allocation and segment bitrate in all the super frames (each comprising multiple frames) as an infinite-horizon Markov decision process (MDP). The maximization objective is a discounted measurement of the average Quality of Experience (QoE). Since there is no efficient method for scheduling design with random user arrivals and departures in the existing literature, a novel approximate MDP method is proposed to obtain a low-complexity scheduling policy, where a lower bound on its performance is derived. Specifically, we first introduce a baseline policy and derive its asymptotic value function. One-step policy iteration is then applied to improve this value function, yielding the mentioned low-complexity policy. Finally, we propose a novel and efficient reinforcement learning (RL) algorithm to evaluate the value function when the prior knowledge on user arrival intensity is absent.-
dc.languageeng-
dc.publisherInstitute of Electrical and Electronics Engineers. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=7693-
dc.relation.ispartofIEEE Transactions on Wireless Communications-
dc.rightsIEEE Transactions on Wireless Communications. Copyright © Institute of Electrical and Electronics Engineers.-
dc.rights©20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.-
dc.subjectStreaming media-
dc.subjectQuality of experience-
dc.subjectWireless communication-
dc.subjectBit rate-
dc.subjectMassive MIMO-
dc.titleAdaptive Video Streaming for Massive MIMO Networks via Approximate MDP and Reinforcement Learning-
dc.typeArticle-
dc.identifier.emailHuang, K: huangkb@eee.hku.hk-
dc.identifier.authorityHuang, K=rp01875-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TWC.2020.2995944-
dc.identifier.scopuseid_2-s2.0-85091151663-
dc.identifier.hkuros321256-
dc.identifier.volume9-
dc.identifier.issue19-
dc.identifier.spage5716-
dc.identifier.epage5731-
dc.identifier.isiWOS:000568683900006-
dc.publisher.placeUnited States-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats