Reinforcement Learning of Beam Codebooks in Millimeter Wave and Terahertz MIMO Systems

Yu Zhang; Muhammad Alrabeiah; Ahmed Alkhateeb

doi:10.1109/TCOMM.2021.3126856

Reinforcement Learning of Beam Codebooks in Millimeter Wave and Terahertz MIMO Systems

Yu Zhang, Muhammad Alrabeiah, Ahmed Alkhateeb

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Contribution to journal › Article › peer-review

29 Scopus citations

Abstract

Millimeter wave (mmWave) and terahertz MIMO systems rely on pre-defined beamforming codebooks for both initial access and data transmission. These pre-defined codebooks, however, are commonly not optimized for specific environments, user distributions, and/or possible hardware impairments. This leads to large codebook sizes with high beam training overhead which makes it hard for these systems to support highly mobile applications. To overcome these limitations, this paper develops a deep reinforcement learning framework that learns how to optimize the codebook beam patterns relying only on the receive power measurements. The developed model learns how to adapt the beam patterns based on the surrounding environment, user distribution, hardware impairments, and array geometry. Further, this approach does not require any knowledge about the channel, RF hardware, or user positions. To reduce the learning time, the proposed model designs a novel Wolpertinger-variant architecture that is capable of efficiently searching the large discrete action space. The proposed learning framework respects the RF hardware constraints such as the constant-modulus and quantized phase shifter constraints. Simulation results confirm the ability of the developed framework to learn near-optimal beam patterns for line-of-sight (LOS), non-LOS (NLOS), mixed LOS/NLOS scenarios and for arrays with hardware impairments without requiring any channel knowledge.

Original language	English (US)
Pages (from-to)	904-919
Number of pages	16
Journal	IEEE Transactions on Communications
Volume	70
Issue number	2
DOIs	https://doi.org/10.1109/TCOMM.2021.3126856
State	Published - Feb 1 2022

Keywords

Beamforming codebook
millimeter wave (mmWave)
reinforcement learning
site-specific
terahertz (THz)

ASJC Scopus subject areas

Electrical and Electronic Engineering

Access to Document

10.1109/TCOMM.2021.3126856

Cite this

@article{472ff5a12ef542d09d440f2d4cff392c,

title = "Reinforcement Learning of Beam Codebooks in Millimeter Wave and Terahertz MIMO Systems",

abstract = "Millimeter wave (mmWave) and terahertz MIMO systems rely on pre-defined beamforming codebooks for both initial access and data transmission. These pre-defined codebooks, however, are commonly not optimized for specific environments, user distributions, and/or possible hardware impairments. This leads to large codebook sizes with high beam training overhead which makes it hard for these systems to support highly mobile applications. To overcome these limitations, this paper develops a deep reinforcement learning framework that learns how to optimize the codebook beam patterns relying only on the receive power measurements. The developed model learns how to adapt the beam patterns based on the surrounding environment, user distribution, hardware impairments, and array geometry. Further, this approach does not require any knowledge about the channel, RF hardware, or user positions. To reduce the learning time, the proposed model designs a novel Wolpertinger-variant architecture that is capable of efficiently searching the large discrete action space. The proposed learning framework respects the RF hardware constraints such as the constant-modulus and quantized phase shifter constraints. Simulation results confirm the ability of the developed framework to learn near-optimal beam patterns for line-of-sight (LOS), non-LOS (NLOS), mixed LOS/NLOS scenarios and for arrays with hardware impairments without requiring any channel knowledge.",

keywords = "Beamforming codebook, millimeter wave (mmWave), reinforcement learning, site-specific, terahertz (THz)",

author = "Yu Zhang and Muhammad Alrabeiah and Ahmed Alkhateeb",

note = "Funding Information: This work is supported by the National Science Foundation under Grant No. 1923676. Publisher Copyright: {\textcopyright} 1972-2012 IEEE.",

year = "2022",

month = feb,

day = "1",

doi = "10.1109/TCOMM.2021.3126856",

language = "English (US)",

volume = "70",

pages = "904--919",

journal = "IEEE Transactions on Communications",

issn = "0090-6778",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "2",

}

TY - JOUR

T1 - Reinforcement Learning of Beam Codebooks in Millimeter Wave and Terahertz MIMO Systems

AU - Zhang, Yu

AU - Alrabeiah, Muhammad

AU - Alkhateeb, Ahmed

PY - 2022/2/1

Y1 - 2022/2/1

N2 - Millimeter wave (mmWave) and terahertz MIMO systems rely on pre-defined beamforming codebooks for both initial access and data transmission. These pre-defined codebooks, however, are commonly not optimized for specific environments, user distributions, and/or possible hardware impairments. This leads to large codebook sizes with high beam training overhead which makes it hard for these systems to support highly mobile applications. To overcome these limitations, this paper develops a deep reinforcement learning framework that learns how to optimize the codebook beam patterns relying only on the receive power measurements. The developed model learns how to adapt the beam patterns based on the surrounding environment, user distribution, hardware impairments, and array geometry. Further, this approach does not require any knowledge about the channel, RF hardware, or user positions. To reduce the learning time, the proposed model designs a novel Wolpertinger-variant architecture that is capable of efficiently searching the large discrete action space. The proposed learning framework respects the RF hardware constraints such as the constant-modulus and quantized phase shifter constraints. Simulation results confirm the ability of the developed framework to learn near-optimal beam patterns for line-of-sight (LOS), non-LOS (NLOS), mixed LOS/NLOS scenarios and for arrays with hardware impairments without requiring any channel knowledge.

AB - Millimeter wave (mmWave) and terahertz MIMO systems rely on pre-defined beamforming codebooks for both initial access and data transmission. These pre-defined codebooks, however, are commonly not optimized for specific environments, user distributions, and/or possible hardware impairments. This leads to large codebook sizes with high beam training overhead which makes it hard for these systems to support highly mobile applications. To overcome these limitations, this paper develops a deep reinforcement learning framework that learns how to optimize the codebook beam patterns relying only on the receive power measurements. The developed model learns how to adapt the beam patterns based on the surrounding environment, user distribution, hardware impairments, and array geometry. Further, this approach does not require any knowledge about the channel, RF hardware, or user positions. To reduce the learning time, the proposed model designs a novel Wolpertinger-variant architecture that is capable of efficiently searching the large discrete action space. The proposed learning framework respects the RF hardware constraints such as the constant-modulus and quantized phase shifter constraints. Simulation results confirm the ability of the developed framework to learn near-optimal beam patterns for line-of-sight (LOS), non-LOS (NLOS), mixed LOS/NLOS scenarios and for arrays with hardware impairments without requiring any channel knowledge.

KW - Beamforming codebook

KW - millimeter wave (mmWave)

KW - reinforcement learning

KW - site-specific

KW - terahertz (THz)

UR - http://www.scopus.com/inward/record.url?scp=85119012709&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85119012709&partnerID=8YFLogxK

U2 - 10.1109/TCOMM.2021.3126856

DO - 10.1109/TCOMM.2021.3126856

M3 - Article

AN - SCOPUS:85119012709

SN - 0090-6778

VL - 70

SP - 904

EP - 919

JO - IEEE Transactions on Communications

JF - IEEE Transactions on Communications

IS - 2

ER -

Reinforcement Learning of Beam Codebooks in Millimeter Wave and Terahertz MIMO Systems

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this