Multimodality datasets are becoming increasingly common in various domains to provide complementary information for predictive analytics. One significant challenge in fusing multimodality data is that the multiple modalities are not universally available for all samples due to cost and accessibility constraints. This situation results in a unique data structure called an Incomplete Multimodality Dataset. We propose a novel Incomplete-Multimodality Transfer Learning (IMTL) model that builds a predictive model for each sub-cohort of samples with the same missing modality pattern, and meanwhile couples the model estimation processes for different sub-cohorts to allow for transfer learning. We develop an Expectation-Maximization (EM) algorithm to estimate the parameters of IMTL and further extend it to a collaborative learning paradigm that is specifically valuable for patient privacy preservation in health care applications. We prove two advantageous properties of IMTL: the ability for out-of-sample prediction and a theoretical guarantee for a larger Fisher information compared with models without transfer learning. IMTL is applied to diagnosis and prognosis of Alzheimer’s disease at an early stage called Mild Cognitive Impairment using incomplete multimodality imaging data. IMTL achieves higher accuracy than competing methods without transfer learning.
- Incomplete multimodality data
- health care
- predictive analytics
- transfer learning
ASJC Scopus subject areas
- Industrial and Manufacturing Engineering