Learning human search strategies from a crowdsourcing game

Thurston Sexton; Max Yi Ren

doi:10.1115/DETC2016-59775

Learning human search strategies from a crowdsourcing game

Thurston Sexton, Max Yi Ren

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

There is evidence that humans can be more efficient than existing algorithms at searching for good solutions in highdimensional and non-convex design or control spaces, potentially due to our prior knowledge and learning capability. This work attempts to quantify the search strategy of human beings to enhance a Bayesian optimization (BO) algorithm for an optimal design and control problem. We consider the sequence of human solutions as generated from BO, and propose to recover the algorithmic parameters of BO by maximizing the likelihood of the observed solution path. The method is different from inverse reinforcement learning (where an optimal control solution is learned based on human demonstrations) in that the latter requires near-optimal solutions from humans, while we only require the existence of a good search strategy. The method is first verified through simulation studies and then applied to the human solutions crowdsourced through a gamification of the problem under study [1]. We learn BO parameters from a player with a demonstrated good search strategy and show that applying the BO algorithm with these parameters to the game noticeably improves the convergence of the search from using a default BO setting.

Original language	English (US)
Title of host publication	42nd Design Automation Conference
Publisher	American Society of Mechanical Engineers (ASME)
ISBN (Electronic)	9780791850107
DOIs	https://doi.org/10.1115/DETC2016-59775
State	Published - 2016
Event	ASME 2016 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC/CIE 2016 - Charlotte, United States Duration: Aug 21 2016 → Aug 24 2016

Publication series

Name	Proceedings of the ASME Design Engineering Technical Conference
Volume	2A-2016

Other

Other	ASME 2016 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC/CIE 2016
Country/Territory	United States
City	Charlotte
Period	8/21/16 → 8/24/16

ASJC Scopus subject areas

Mechanical Engineering
Computer Graphics and Computer-Aided Design
Computer Science Applications
Modeling and Simulation

Access to Document

10.1115/DETC2016-59775

Cite this

Sexton, T & Ren, MY 2016, Learning human search strategies from a crowdsourcing game. in 42nd Design Automation Conference. Proceedings of the ASME Design Engineering Technical Conference, vol. 2A-2016, American Society of Mechanical Engineers (ASME), ASME 2016 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC/CIE 2016, Charlotte, United States, 8/21/16. https://doi.org/10.1115/DETC2016-59775

@inproceedings{fb295044f99a4bffacc301604251c7b2,

title = "Learning human search strategies from a crowdsourcing game",

abstract = "There is evidence that humans can be more efficient than existing algorithms at searching for good solutions in highdimensional and non-convex design or control spaces, potentially due to our prior knowledge and learning capability. This work attempts to quantify the search strategy of human beings to enhance a Bayesian optimization (BO) algorithm for an optimal design and control problem. We consider the sequence of human solutions as generated from BO, and propose to recover the algorithmic parameters of BO by maximizing the likelihood of the observed solution path. The method is different from inverse reinforcement learning (where an optimal control solution is learned based on human demonstrations) in that the latter requires near-optimal solutions from humans, while we only require the existence of a good search strategy. The method is first verified through simulation studies and then applied to the human solutions crowdsourced through a gamification of the problem under study [1]. We learn BO parameters from a player with a demonstrated good search strategy and show that applying the BO algorithm with these parameters to the game noticeably improves the convergence of the search from using a default BO setting.",

author = "Thurston Sexton and Ren, {Max Yi}",

note = "Funding Information: This work has been supported by the National Science Foundation under Grant No. CMMI-1266184 and the start-up funding from Arizona State University. These supports are gratefully acknowledged. Publisher Copyright: {\textcopyright} Copyright 2016 by ASME.; ASME 2016 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC/CIE 2016 ; Conference date: 21-08-2016 Through 24-08-2016",

year = "2016",

doi = "10.1115/DETC2016-59775",

language = "English (US)",

series = "Proceedings of the ASME Design Engineering Technical Conference",

publisher = "American Society of Mechanical Engineers (ASME)",

booktitle = "42nd Design Automation Conference",

}

TY - GEN

T1 - Learning human search strategies from a crowdsourcing game

AU - Sexton, Thurston

AU - Ren, Max Yi

N1 - Funding Information: This work has been supported by the National Science Foundation under Grant No. CMMI-1266184 and the start-up funding from Arizona State University. These supports are gratefully acknowledged. Publisher Copyright: © Copyright 2016 by ASME.

PY - 2016

Y1 - 2016

N2 - There is evidence that humans can be more efficient than existing algorithms at searching for good solutions in highdimensional and non-convex design or control spaces, potentially due to our prior knowledge and learning capability. This work attempts to quantify the search strategy of human beings to enhance a Bayesian optimization (BO) algorithm for an optimal design and control problem. We consider the sequence of human solutions as generated from BO, and propose to recover the algorithmic parameters of BO by maximizing the likelihood of the observed solution path. The method is different from inverse reinforcement learning (where an optimal control solution is learned based on human demonstrations) in that the latter requires near-optimal solutions from humans, while we only require the existence of a good search strategy. The method is first verified through simulation studies and then applied to the human solutions crowdsourced through a gamification of the problem under study [1]. We learn BO parameters from a player with a demonstrated good search strategy and show that applying the BO algorithm with these parameters to the game noticeably improves the convergence of the search from using a default BO setting.

AB - There is evidence that humans can be more efficient than existing algorithms at searching for good solutions in highdimensional and non-convex design or control spaces, potentially due to our prior knowledge and learning capability. This work attempts to quantify the search strategy of human beings to enhance a Bayesian optimization (BO) algorithm for an optimal design and control problem. We consider the sequence of human solutions as generated from BO, and propose to recover the algorithmic parameters of BO by maximizing the likelihood of the observed solution path. The method is different from inverse reinforcement learning (where an optimal control solution is learned based on human demonstrations) in that the latter requires near-optimal solutions from humans, while we only require the existence of a good search strategy. The method is first verified through simulation studies and then applied to the human solutions crowdsourced through a gamification of the problem under study [1]. We learn BO parameters from a player with a demonstrated good search strategy and show that applying the BO algorithm with these parameters to the game noticeably improves the convergence of the search from using a default BO setting.

UR - http://www.scopus.com/inward/record.url?scp=85008234786&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85008234786&partnerID=8YFLogxK

U2 - 10.1115/DETC2016-59775

DO - 10.1115/DETC2016-59775

M3 - Conference contribution

AN - SCOPUS:85008234786

T3 - Proceedings of the ASME Design Engineering Technical Conference

BT - 42nd Design Automation Conference

PB - American Society of Mechanical Engineers (ASME)

T2 - ASME 2016 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, IDETC/CIE 2016

Y2 - 21 August 2016 through 24 August 2016

ER -

Learning human search strategies from a crowdsourcing game

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this