File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Stochastic Game Based Cooperative Alternating Q-Learning Caching in Dynamic D2D Networks

TitleStochastic Game Based Cooperative Alternating Q-Learning Caching in Dynamic D2D Networks
Authors
KeywordsCache placement
device-to-device communication
edge caching
stochastic game
Issue Date2021
Citation
IEEE Transactions on Vehicular Technology, 2021, v. 70, n. 12, p. 13255-13269 How to Cite?
AbstractEdge caching has become an effective solution to cope with the challenges brought by the massive content delivery in cellular networks. In device-to-device (D2D) enabled caching cellular networks with time-varying content popularity distribution and user terminal (UT) location, we model these dynamic networks as a stochastic game to design a cooperative cache placement policy. The cache placement reward of each UT is defined as the caching incentive minus the transmission power cost for content caching and sharing. We consider the long-term cache placement reward of all UTs in this stochastic game. In an effort to solve the stochastic game problem, we propose a multi-agent cooperative alternating Q-learning (CAQL) based cache placement algorithm. The caching control unit is defined to execute the proposed CAQL, in which, the cache placement policy of each UT is alternatively updated according to the stable policy of other UTs during the learning process, until the stable cache placement policy of all the UTs in the cell is obtained. We discuss the convergence and complexity of CAQL, which obtains the stable cache placement policy with low space complexity. Simulation results show that the proposed algorithm can effectively reduce the backhaul load and the average content access delay in dynamic networks.
Persistent Identifierhttp://hdl.handle.net/10722/349621
ISSN
2023 Impact Factor: 6.1
2023 SCImago Journal Rankings: 2.714

 

DC FieldValueLanguage
dc.contributor.authorZhang, Tiankui-
dc.contributor.authorFang, Xinyuan-
dc.contributor.authorWang, Ziduan-
dc.contributor.authorLiu, Yuanwei-
dc.contributor.authorNallanathan, Arumugam-
dc.date.accessioned2024-10-17T06:59:45Z-
dc.date.available2024-10-17T06:59:45Z-
dc.date.issued2021-
dc.identifier.citationIEEE Transactions on Vehicular Technology, 2021, v. 70, n. 12, p. 13255-13269-
dc.identifier.issn0018-9545-
dc.identifier.urihttp://hdl.handle.net/10722/349621-
dc.description.abstractEdge caching has become an effective solution to cope with the challenges brought by the massive content delivery in cellular networks. In device-to-device (D2D) enabled caching cellular networks with time-varying content popularity distribution and user terminal (UT) location, we model these dynamic networks as a stochastic game to design a cooperative cache placement policy. The cache placement reward of each UT is defined as the caching incentive minus the transmission power cost for content caching and sharing. We consider the long-term cache placement reward of all UTs in this stochastic game. In an effort to solve the stochastic game problem, we propose a multi-agent cooperative alternating Q-learning (CAQL) based cache placement algorithm. The caching control unit is defined to execute the proposed CAQL, in which, the cache placement policy of each UT is alternatively updated according to the stable policy of other UTs during the learning process, until the stable cache placement policy of all the UTs in the cell is obtained. We discuss the convergence and complexity of CAQL, which obtains the stable cache placement policy with low space complexity. Simulation results show that the proposed algorithm can effectively reduce the backhaul load and the average content access delay in dynamic networks.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Vehicular Technology-
dc.subjectCache placement-
dc.subjectdevice-to-device communication-
dc.subjectedge caching-
dc.subjectstochastic game-
dc.titleStochastic Game Based Cooperative Alternating Q-Learning Caching in Dynamic D2D Networks-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TVT.2021.3120292-
dc.identifier.scopuseid_2-s2.0-85117852182-
dc.identifier.volume70-
dc.identifier.issue12-
dc.identifier.spage13255-
dc.identifier.epage13269-
dc.identifier.eissn1939-9359-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats