Detecting genetic risk factors for Alzheimer's disease in whole genome sequence data via Lasso screening

Tao Yang, Jie Wang, Qian Sun, Derrek P. Hibar, Neda Jahanshad, Li Liu, Yalin Wang, Liang Zhan, Paul M. Thompson, Jieping Ye

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Citations (Scopus)

Abstract

Genetic factors play a key role in Alzheimer's disease (AD). The Alzheimer's Disease Neuroimaging Initiative (ADNI) whole genome sequence (WGS) data offers new power to investigate mechanisms of AD by combining entire genome sequences with neuroimaging and clinical data. Here we explore the ADNI WGS SNP (single nucleotide polymorphism) data in depth and extract approximately six million valid SNP features. We investigate imaging genetics associations using Lasso regression - a widely used sparse learning technique. To solve the large-scale Lasso problem more efficiently, we employ a highly efficient screening rule for Lasso - called dual polytope projections (DPP) - to remove irrelevant features from the optimization problem. Experiments demonstrate that the DPP can effectively identify irrelevant features and leads to a 400× speedup. This allows us for the first time to run the compute-intensive model selection procedure called stability selection to rank SNPs that may affect the brain and AD risk.

Original languageEnglish (US)
Title of host publicationProceedings - International Symposium on Biomedical Imaging
PublisherIEEE Computer Society
Pages985-989
Number of pages5
Volume2015-July
ISBN (Print)9781479923748
DOIs
StatePublished - Jul 21 2015
Event12th IEEE International Symposium on Biomedical Imaging, ISBI 2015 - Brooklyn, United States
Duration: Apr 16 2015Apr 19 2015

Other

Other12th IEEE International Symposium on Biomedical Imaging, ISBI 2015
CountryUnited States
CityBrooklyn
Period4/16/154/19/15

Fingerprint

Neuroimaging
Alzheimer Disease
Screening
Genes
Genome
Single Nucleotide Polymorphism
Nucleotides
Polymorphism
Brain Diseases
Brain
alachlor
Learning
Imaging techniques
Experiments

Keywords

  • Alzheimer's Disease
  • Lasso
  • Lasso Screening
  • Whole Genome Sequence

ASJC Scopus subject areas

  • Biomedical Engineering
  • Radiology Nuclear Medicine and imaging

Cite this

Yang, T., Wang, J., Sun, Q., Hibar, D. P., Jahanshad, N., Liu, L., ... Ye, J. (2015). Detecting genetic risk factors for Alzheimer's disease in whole genome sequence data via Lasso screening. In Proceedings - International Symposium on Biomedical Imaging (Vol. 2015-July, pp. 985-989). [7164036] IEEE Computer Society. https://doi.org/10.1109/ISBI.2015.7164036

Detecting genetic risk factors for Alzheimer's disease in whole genome sequence data via Lasso screening. / Yang, Tao; Wang, Jie; Sun, Qian; Hibar, Derrek P.; Jahanshad, Neda; Liu, Li; Wang, Yalin; Zhan, Liang; Thompson, Paul M.; Ye, Jieping.

Proceedings - International Symposium on Biomedical Imaging. Vol. 2015-July IEEE Computer Society, 2015. p. 985-989 7164036.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Yang, T, Wang, J, Sun, Q, Hibar, DP, Jahanshad, N, Liu, L, Wang, Y, Zhan, L, Thompson, PM & Ye, J 2015, Detecting genetic risk factors for Alzheimer's disease in whole genome sequence data via Lasso screening. in Proceedings - International Symposium on Biomedical Imaging. vol. 2015-July, 7164036, IEEE Computer Society, pp. 985-989, 12th IEEE International Symposium on Biomedical Imaging, ISBI 2015, Brooklyn, United States, 4/16/15. https://doi.org/10.1109/ISBI.2015.7164036
Yang T, Wang J, Sun Q, Hibar DP, Jahanshad N, Liu L et al. Detecting genetic risk factors for Alzheimer's disease in whole genome sequence data via Lasso screening. In Proceedings - International Symposium on Biomedical Imaging. Vol. 2015-July. IEEE Computer Society. 2015. p. 985-989. 7164036 https://doi.org/10.1109/ISBI.2015.7164036
Yang, Tao ; Wang, Jie ; Sun, Qian ; Hibar, Derrek P. ; Jahanshad, Neda ; Liu, Li ; Wang, Yalin ; Zhan, Liang ; Thompson, Paul M. ; Ye, Jieping. / Detecting genetic risk factors for Alzheimer's disease in whole genome sequence data via Lasso screening. Proceedings - International Symposium on Biomedical Imaging. Vol. 2015-July IEEE Computer Society, 2015. pp. 985-989
@inproceedings{d45fb4d14e5643afb10a430af0b9b010,
title = "Detecting genetic risk factors for Alzheimer's disease in whole genome sequence data via Lasso screening",
abstract = "Genetic factors play a key role in Alzheimer's disease (AD). The Alzheimer's Disease Neuroimaging Initiative (ADNI) whole genome sequence (WGS) data offers new power to investigate mechanisms of AD by combining entire genome sequences with neuroimaging and clinical data. Here we explore the ADNI WGS SNP (single nucleotide polymorphism) data in depth and extract approximately six million valid SNP features. We investigate imaging genetics associations using Lasso regression - a widely used sparse learning technique. To solve the large-scale Lasso problem more efficiently, we employ a highly efficient screening rule for Lasso - called dual polytope projections (DPP) - to remove irrelevant features from the optimization problem. Experiments demonstrate that the DPP can effectively identify irrelevant features and leads to a 400× speedup. This allows us for the first time to run the compute-intensive model selection procedure called stability selection to rank SNPs that may affect the brain and AD risk.",
keywords = "Alzheimer's Disease, Lasso, Lasso Screening, Whole Genome Sequence",
author = "Tao Yang and Jie Wang and Qian Sun and Hibar, {Derrek P.} and Neda Jahanshad and Li Liu and Yalin Wang and Liang Zhan and Thompson, {Paul M.} and Jieping Ye",
year = "2015",
month = "7",
day = "21",
doi = "10.1109/ISBI.2015.7164036",
language = "English (US)",
isbn = "9781479923748",
volume = "2015-July",
pages = "985--989",
booktitle = "Proceedings - International Symposium on Biomedical Imaging",
publisher = "IEEE Computer Society",

}

TY - GEN

T1 - Detecting genetic risk factors for Alzheimer's disease in whole genome sequence data via Lasso screening

AU - Yang, Tao

AU - Wang, Jie

AU - Sun, Qian

AU - Hibar, Derrek P.

AU - Jahanshad, Neda

AU - Liu, Li

AU - Wang, Yalin

AU - Zhan, Liang

AU - Thompson, Paul M.

AU - Ye, Jieping

PY - 2015/7/21

Y1 - 2015/7/21

N2 - Genetic factors play a key role in Alzheimer's disease (AD). The Alzheimer's Disease Neuroimaging Initiative (ADNI) whole genome sequence (WGS) data offers new power to investigate mechanisms of AD by combining entire genome sequences with neuroimaging and clinical data. Here we explore the ADNI WGS SNP (single nucleotide polymorphism) data in depth and extract approximately six million valid SNP features. We investigate imaging genetics associations using Lasso regression - a widely used sparse learning technique. To solve the large-scale Lasso problem more efficiently, we employ a highly efficient screening rule for Lasso - called dual polytope projections (DPP) - to remove irrelevant features from the optimization problem. Experiments demonstrate that the DPP can effectively identify irrelevant features and leads to a 400× speedup. This allows us for the first time to run the compute-intensive model selection procedure called stability selection to rank SNPs that may affect the brain and AD risk.

AB - Genetic factors play a key role in Alzheimer's disease (AD). The Alzheimer's Disease Neuroimaging Initiative (ADNI) whole genome sequence (WGS) data offers new power to investigate mechanisms of AD by combining entire genome sequences with neuroimaging and clinical data. Here we explore the ADNI WGS SNP (single nucleotide polymorphism) data in depth and extract approximately six million valid SNP features. We investigate imaging genetics associations using Lasso regression - a widely used sparse learning technique. To solve the large-scale Lasso problem more efficiently, we employ a highly efficient screening rule for Lasso - called dual polytope projections (DPP) - to remove irrelevant features from the optimization problem. Experiments demonstrate that the DPP can effectively identify irrelevant features and leads to a 400× speedup. This allows us for the first time to run the compute-intensive model selection procedure called stability selection to rank SNPs that may affect the brain and AD risk.

KW - Alzheimer's Disease

KW - Lasso

KW - Lasso Screening

KW - Whole Genome Sequence

UR - http://www.scopus.com/inward/record.url?scp=84944321509&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84944321509&partnerID=8YFLogxK

U2 - 10.1109/ISBI.2015.7164036

DO - 10.1109/ISBI.2015.7164036

M3 - Conference contribution

SN - 9781479923748

VL - 2015-July

SP - 985

EP - 989

BT - Proceedings - International Symposium on Biomedical Imaging

PB - IEEE Computer Society

ER -