TY - GEN
T1 - Partitioning and Gaussian Processes for Accelerating Sampling in Monte Carlo Tree Search for Continuous Decisions
AU - Liu, Menghan
AU - Pedrielli, Giulia
AU - Cao, Yumeng
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - We propose Part-MCTS for sampling continuous decisions at each stage of a Monte Carlo Tree Search algorithm. At each MCTS stage, Part-MCTS sequentially partitions the decision space and keeps a collection of Gaussian processes to describe the landscape of the objective function. A classification criteria based on the estimation of the minimum allows us to focus the attention on regions with better predicted behavior, reducing the evaluation effort elsewhere. Within each subregion, we can use any sampling distribution, and we propose to sample using Bayesian optimization. We compare our approach to KR-UCT (Yee et al. 2016) as state of the art competitor. Part-MCTS achieves better accuracy over a set of nonlinear test functions, and it has the ability to identify multiple promising solutions in a single run. This can be important when multiple solutions from a stage can be preserved and expanded at subsequent stages.
AB - We propose Part-MCTS for sampling continuous decisions at each stage of a Monte Carlo Tree Search algorithm. At each MCTS stage, Part-MCTS sequentially partitions the decision space and keeps a collection of Gaussian processes to describe the landscape of the objective function. A classification criteria based on the estimation of the minimum allows us to focus the attention on regions with better predicted behavior, reducing the evaluation effort elsewhere. Within each subregion, we can use any sampling distribution, and we propose to sample using Bayesian optimization. We compare our approach to KR-UCT (Yee et al. 2016) as state of the art competitor. Part-MCTS achieves better accuracy over a set of nonlinear test functions, and it has the ability to identify multiple promising solutions in a single run. This can be important when multiple solutions from a stage can be preserved and expanded at subsequent stages.
UR - http://www.scopus.com/inward/record.url?scp=85126100708&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126100708&partnerID=8YFLogxK
U2 - 10.1109/WSC52266.2021.9715405
DO - 10.1109/WSC52266.2021.9715405
M3 - Conference contribution
AN - SCOPUS:85126100708
T3 - Proceedings - Winter Simulation Conference
BT - 2021 Winter Simulation Conference, WSC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 Winter Simulation Conference, WSC 2021
Y2 - 12 December 2021 through 15 December 2021
ER -