Partitioning and Gaussian Processes for Accelerating Sampling in Monte Carlo Tree Search for Continuous Decisions

Menghan Liu, Giulia Pedrielli, Yumeng Cao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We propose Part-MCTS for sampling continuous decisions at each stage of a Monte Carlo Tree Search algorithm. At each MCTS stage, Part-MCTS sequentially partitions the decision space and keeps a collection of Gaussian processes to describe the landscape of the objective function. A classification criteria based on the estimation of the minimum allows us to focus the attention on regions with better predicted behavior, reducing the evaluation effort elsewhere. Within each subregion, we can use any sampling distribution, and we propose to sample using Bayesian optimization. We compare our approach to KR-UCT (Yee et al. 2016) as state of the art competitor. Part-MCTS achieves better accuracy over a set of nonlinear test functions, and it has the ability to identify multiple promising solutions in a single run. This can be important when multiple solutions from a stage can be preserved and expanded at subsequent stages.

Original languageEnglish (US)
Title of host publication2021 Winter Simulation Conference, WSC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665433112
DOIs
StatePublished - 2021
Event2021 Winter Simulation Conference, WSC 2021 - Phoenix, United States
Duration: Dec 12 2021Dec 15 2021

Publication series

NameProceedings - Winter Simulation Conference
Volume2021-December
ISSN (Print)0891-7736

Conference

Conference2021 Winter Simulation Conference, WSC 2021
Country/TerritoryUnited States
CityPhoenix
Period12/12/2112/15/21

ASJC Scopus subject areas

  • Software
  • Modeling and Simulation
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Partitioning and Gaussian Processes for Accelerating Sampling in Monte Carlo Tree Search for Continuous Decisions'. Together they form a unique fingerprint.

Cite this