TY - JOUR
T1 - ICON
T2 - An Adaptation of Infinite HMMs for Time Traces with Drift
AU - Sgouralis, Ioannis
AU - Presse, Steve
N1 - Funding Information:
S.P. acknowledges support from the Division of Molecular and Cellular Biosciences, National Science Foundation (NSF), as well as his Indiana University-Purdue University Indianapolis Startup. Computations shown in this article were carried out at Indiana University's computing cluster (Karst), which is supported in part by the Lilly Endowment, through its support for the Indiana University Pervasive Technology Institute, and in part by the Indiana METACyt Initiative at Indiana University, which is also supported in part by the Lilly Endowment.
Publisher Copyright:
© 2017 Biophysical Society
PY - 2017/5/23
Y1 - 2017/5/23
N2 - Bayesian nonparametric methods have recently transformed emerging areas within data science. One such promising method, the infinite hidden Markov model (iHMM), generalizes the HMM that itself has become a workhorse in single molecule data analysis. The iHMM goes beyond the HMM by self-consistently learning all parameters learned by the HMM in addition to learning the number of states without recourse to any model selection steps. Despite its generality, simple features (such as drift), common to single molecule time traces, result in an overinterpretation of drift and the introduction of artifact states. Here we present an adaptation of the iHMM that can treat data with drift originating from one or many traces (e.g., Förster resonance energy transfer). Our fully Bayesian method couples the iHMM to a continuous control process (drift) self-consistently learned while learning all other quantities determined by the iHMM (including state numbers). A key advantage of this method is that all traces—regardless of drift or states visited across traces—may now be treated on an equal footing, thereby eliminating user-dependent trace selection (based on drift levels), preprocessing to remove drift, and postprocessing model selection based on state number.
AB - Bayesian nonparametric methods have recently transformed emerging areas within data science. One such promising method, the infinite hidden Markov model (iHMM), generalizes the HMM that itself has become a workhorse in single molecule data analysis. The iHMM goes beyond the HMM by self-consistently learning all parameters learned by the HMM in addition to learning the number of states without recourse to any model selection steps. Despite its generality, simple features (such as drift), common to single molecule time traces, result in an overinterpretation of drift and the introduction of artifact states. Here we present an adaptation of the iHMM that can treat data with drift originating from one or many traces (e.g., Förster resonance energy transfer). Our fully Bayesian method couples the iHMM to a continuous control process (drift) self-consistently learned while learning all other quantities determined by the iHMM (including state numbers). A key advantage of this method is that all traces—regardless of drift or states visited across traces—may now be treated on an equal footing, thereby eliminating user-dependent trace selection (based on drift levels), preprocessing to remove drift, and postprocessing model selection based on state number.
UR - http://www.scopus.com/inward/record.url?scp=85019904674&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85019904674&partnerID=8YFLogxK
U2 - 10.1016/j.bpj.2017.04.009
DO - 10.1016/j.bpj.2017.04.009
M3 - Article
C2 - 28538149
AN - SCOPUS:85019904674
SN - 0006-3495
VL - 112
SP - 2117
EP - 2126
JO - Biophysical Journal
JF - Biophysical Journal
IS - 10
ER -