Reinforcement Learning-Based Routing Algorithm in Satellite-Terrestrial Integrated Networks
Satellite-terrestrial integrated network (STIN) is an indispensable component of the Next Generation Internet (NGI) due to its wide coverage, high flexibility, and seamless communication services. It uses the part of satellite network to provide communication services to the users who cannot communi...
Guardado en:
Autores principales: | , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Hindawi-Wiley
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/9b75be738fa840d692430dd1869c5f76 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:9b75be738fa840d692430dd1869c5f76 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:9b75be738fa840d692430dd1869c5f762021-11-08T02:36:09ZReinforcement Learning-Based Routing Algorithm in Satellite-Terrestrial Integrated Networks1530-867710.1155/2021/3759631https://doaj.org/article/9b75be738fa840d692430dd1869c5f762021-01-01T00:00:00Zhttp://dx.doi.org/10.1155/2021/3759631https://doaj.org/toc/1530-8677Satellite-terrestrial integrated network (STIN) is an indispensable component of the Next Generation Internet (NGI) due to its wide coverage, high flexibility, and seamless communication services. It uses the part of satellite network to provide communication services to the users who cannot communicate directly in terrestrial network. However, existing satellite routing algorithms ignore the users’ request resources and the states of the satellite network. Therefore, these algorithms cannot effectively manage network resources in routing, leading to the congestion of satellite network in advance. To solve this problem, we model the routing problem in satellite network as a finite-state Markov decision process and formulate it as a combinatorial optimization problem. Then, we put forth a Q-learning-based routing algorithm (QLRA). By maximizing users’ utility, our proposed QLRA algorithm is able to select the optimal paths according to the dynamic characteristics of satellite network. Considering that the convergence speed of QLRA is slow due to the routing loop or ping-pong effect in the process of routing, we propose a split-based speed-up convergence strategy and also design a speed-up Q-learning-based routing algorithm, termed SQLRA. In addition, we update the Q value of each node from back to front in the learning process, which further accelerate the convergence speed of SQLRA. Experimental results show that our improved routing algorithm SQLRA greatly enhances the performance of satellite network in terms of throughput, delay, and bit error rate compared with other routing algorithms.Yabo YinChuanghe HuangDong-Fang WuShidong HuangM. Wasim Abbas AshrafQianqian GuoHindawi-WileyarticleTechnologyTTelecommunicationTK5101-6720ENWireless Communications and Mobile Computing, Vol 2021 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Technology T Telecommunication TK5101-6720 |
spellingShingle |
Technology T Telecommunication TK5101-6720 Yabo Yin Chuanghe Huang Dong-Fang Wu Shidong Huang M. Wasim Abbas Ashraf Qianqian Guo Reinforcement Learning-Based Routing Algorithm in Satellite-Terrestrial Integrated Networks |
description |
Satellite-terrestrial integrated network (STIN) is an indispensable component of the Next Generation Internet (NGI) due to its wide coverage, high flexibility, and seamless communication services. It uses the part of satellite network to provide communication services to the users who cannot communicate directly in terrestrial network. However, existing satellite routing algorithms ignore the users’ request resources and the states of the satellite network. Therefore, these algorithms cannot effectively manage network resources in routing, leading to the congestion of satellite network in advance. To solve this problem, we model the routing problem in satellite network as a finite-state Markov decision process and formulate it as a combinatorial optimization problem. Then, we put forth a Q-learning-based routing algorithm (QLRA). By maximizing users’ utility, our proposed QLRA algorithm is able to select the optimal paths according to the dynamic characteristics of satellite network. Considering that the convergence speed of QLRA is slow due to the routing loop or ping-pong effect in the process of routing, we propose a split-based speed-up convergence strategy and also design a speed-up Q-learning-based routing algorithm, termed SQLRA. In addition, we update the Q value of each node from back to front in the learning process, which further accelerate the convergence speed of SQLRA. Experimental results show that our improved routing algorithm SQLRA greatly enhances the performance of satellite network in terms of throughput, delay, and bit error rate compared with other routing algorithms. |
format |
article |
author |
Yabo Yin Chuanghe Huang Dong-Fang Wu Shidong Huang M. Wasim Abbas Ashraf Qianqian Guo |
author_facet |
Yabo Yin Chuanghe Huang Dong-Fang Wu Shidong Huang M. Wasim Abbas Ashraf Qianqian Guo |
author_sort |
Yabo Yin |
title |
Reinforcement Learning-Based Routing Algorithm in Satellite-Terrestrial Integrated Networks |
title_short |
Reinforcement Learning-Based Routing Algorithm in Satellite-Terrestrial Integrated Networks |
title_full |
Reinforcement Learning-Based Routing Algorithm in Satellite-Terrestrial Integrated Networks |
title_fullStr |
Reinforcement Learning-Based Routing Algorithm in Satellite-Terrestrial Integrated Networks |
title_full_unstemmed |
Reinforcement Learning-Based Routing Algorithm in Satellite-Terrestrial Integrated Networks |
title_sort |
reinforcement learning-based routing algorithm in satellite-terrestrial integrated networks |
publisher |
Hindawi-Wiley |
publishDate |
2021 |
url |
https://doaj.org/article/9b75be738fa840d692430dd1869c5f76 |
work_keys_str_mv |
AT yaboyin reinforcementlearningbasedroutingalgorithminsatelliteterrestrialintegratednetworks AT chuanghehuang reinforcementlearningbasedroutingalgorithminsatelliteterrestrialintegratednetworks AT dongfangwu reinforcementlearningbasedroutingalgorithminsatelliteterrestrialintegratednetworks AT shidonghuang reinforcementlearningbasedroutingalgorithminsatelliteterrestrialintegratednetworks AT mwasimabbasashraf reinforcementlearningbasedroutingalgorithminsatelliteterrestrialintegratednetworks AT qianqianguo reinforcementlearningbasedroutingalgorithminsatelliteterrestrialintegratednetworks |
_version_ |
1718443213309607936 |