TY - GEN
T1 - Adaptive Beam Alignment in Mm-Wave Networks
T2 - 2021 IEEE Global Communications Conference, GLOBECOM 2021
AU - Hussain, Muddassar
AU - Michelusi, Nicolo
N1 - Funding Information:
This work was supported in part by the National Science Foundation under grants CNS-1642982 and CNS-2129015. An extended version of this paper has been submitted to IEEE JSAC [1] M. Hussain is with the School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, USA; email: hussai13@purdue.edu. N. Michelusi is with the School of Electrical, Computer and Energy Engineering, Arizona State University, AZ, USA; email: nicolo.michelusi@asu.edu.
Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - This paper proposes a dual timescale learning and adaptation framework to learn a probabilistic model of beam dynamics and concurrently exploit this model to design adaptive beam-training with low overhead: on a long timescale, a deep recurrent variational autoencoder (DR-VAE) uses noisy beam-training observations to learn a probabilistic model of beam dynamics; on a short timescale, an adaptive beam-training procedure is formulated as a partially observable Markov decision process and optimized using point-based value iteration by leveraging beam-training feedback and probabilistic predictions of the strongest beam pair provided by the DR-VAE. In turn, beam-training observations are used to refine the DR-VAE via stochastic gradient ascent in a continuous process of learning and adaptation. It is shown that the proposed DR-VAE learning framework learns accurate beam dynamics and, as learning progresses, the training overhead decreases and the spectral efficiency increases. Moreover, the proposed dual timescale approach achieves near-optimal spectral efficiency, with a gain of 85% over a policy that scans exhaustively over the dominant beam pairs, and of 18% over a state-of-the-art POMDP policy.
AB - This paper proposes a dual timescale learning and adaptation framework to learn a probabilistic model of beam dynamics and concurrently exploit this model to design adaptive beam-training with low overhead: on a long timescale, a deep recurrent variational autoencoder (DR-VAE) uses noisy beam-training observations to learn a probabilistic model of beam dynamics; on a short timescale, an adaptive beam-training procedure is formulated as a partially observable Markov decision process and optimized using point-based value iteration by leveraging beam-training feedback and probabilistic predictions of the strongest beam pair provided by the DR-VAE. In turn, beam-training observations are used to refine the DR-VAE via stochastic gradient ascent in a continuous process of learning and adaptation. It is shown that the proposed DR-VAE learning framework learns accurate beam dynamics and, as learning progresses, the training overhead decreases and the spectral efficiency increases. Moreover, the proposed dual timescale approach achieves near-optimal spectral efficiency, with a gain of 85% over a policy that scans exhaustively over the dominant beam pairs, and of 18% over a state-of-the-art POMDP policy.
UR - http://www.scopus.com/inward/record.url?scp=85119247148&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119247148&partnerID=8YFLogxK
U2 - 10.1109/GLOBECOM46510.2021.9685969
DO - 10.1109/GLOBECOM46510.2021.9685969
M3 - Conference contribution
AN - SCOPUS:85119247148
T3 - 2021 IEEE Global Communications Conference, GLOBECOM 2021 - Proceedings
BT - 2021 IEEE Global Communications Conference, GLOBECOM 2021 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 7 December 2021 through 11 December 2021
ER -