### Abstract

Interactions between features of high-dimensional biomedical data often exhibit complex and organized, yet latent, network topological structures. Estimating the non-sparse large covariance matrix of these high-dimensional biomedical data while preserving and recognizing the latent network topology are challenging. A two step procedure is proposed that first detects latent network topological structures from the sample correlation matrix by implementing new penalized optimization and then regularizes the covariance matrix by leveraging the detected network topological information. The network topology guided regularization can reduce false positive and false negative rates simultaneously because it allows edges to borrow strengths from each other precisely. Empirical data examples demonstrate that organized latent network topological structures widely exist in high-dimensional biomedical data across platforms and identifying these network structures can effectively improve estimating covariance matrix and understanding interactive relationships between biomedical features.

Original language | English (US) |
---|---|

Pages (from-to) | 82-95 |

Number of pages | 14 |

Journal | Computational Statistics and Data Analysis |

Volume | 127 |

DOIs | |

State | Published - Nov 2018 |

### Keywords

- Correlation matrix
- Graph
- Parsimony
- Shrinkage
- Thresholding

### ASJC Scopus subject areas

- Statistics and Probability
- Computational Mathematics
- Computational Theory and Mathematics
- Applied Mathematics

## Fingerprint Dive into the research topics of 'Estimating large covariance matrix with network topology for high-dimensional biomedical data'. Together they form a unique fingerprint.

## Cite this

*Computational Statistics and Data Analysis*,

*127*, 82-95. https://doi.org/10.1016/j.csda.2018.05.008