Directionally dependent multi-view clustering using copula model

Kahkashan Afrin; Ashif S. Iquebal; Mostafa Karimi; Allyson Souris; Se Yoon Lee; Bani K. Mallick

doi:10.1371/journal.pone.0238996

Directionally dependent multi-view clustering using copula model

Kahkashan Afrin, Ashif S. Iquebal, Mostafa Karimi, Allyson Souris, Se Yoon Lee, Bani K. Mallick

Research output: Contribution to journal › Article › peer-review

Abstract

Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for revealing pathological insights. However, multi-view clustering is challenging due to the complex dependence structure between multiple data types, including directional dependency. Specifically, genomics data types have pre-specified directional dependencies known as the central dogma that describes the process of information flow from DNA to messenger RNA (mRNA) and then from mRNA to protein. Most of the existing multi-view clustering approaches assume an independent structure or pair-wise (non-directional) dependence between data types, thereby ignoring their directional relationship. Motivated by this, we propose a biology-inspired Bayesian integrated multi-view clustering model that uses an asymmetric copula to accommodate the directional dependencies between the data types. Via extensive simulation experiments, we demonstrate the negative impact of ignoring directional dependency on clustering performance. We also present an application of our model to a real-world dataset of breast cancer tumor samples collected from The Cancer Genome Altas program and provide comparative results.

Original language	English (US)
Article number	e0238996
Journal	PloS one
Volume	15
Issue number	10 October
DOIs	https://doi.org/10.1371/journal.pone.0238996
State	Published - Oct 2020
Externally published	Yes

ASJC Scopus subject areas

General

Access to Document

10.1371/journal.pone.0238996

Cite this

@article{405ca0ef20784dd0ab73b10489a998b1,

title = "Directionally dependent multi-view clustering using copula model",

abstract = "Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for revealing pathological insights. However, multi-view clustering is challenging due to the complex dependence structure between multiple data types, including directional dependency. Specifically, genomics data types have pre-specified directional dependencies known as the central dogma that describes the process of information flow from DNA to messenger RNA (mRNA) and then from mRNA to protein. Most of the existing multi-view clustering approaches assume an independent structure or pair-wise (non-directional) dependence between data types, thereby ignoring their directional relationship. Motivated by this, we propose a biology-inspired Bayesian integrated multi-view clustering model that uses an asymmetric copula to accommodate the directional dependencies between the data types. Via extensive simulation experiments, we demonstrate the negative impact of ignoring directional dependency on clustering performance. We also present an application of our model to a real-world dataset of breast cancer tumor samples collected from The Cancer Genome Altas program and provide comparative results.",

author = "Kahkashan Afrin and Iquebal, {Ashif S.} and Mostafa Karimi and Allyson Souris and Lee, {Se Yoon} and Mallick, {Bani K.}",

note = "Publisher Copyright: {\textcopyright} 2020 Afrin et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.",

year = "2020",

month = oct,

doi = "10.1371/journal.pone.0238996",

language = "English (US)",

volume = "15",

journal = "PloS one",

issn = "1932-6203",

publisher = "Public Library of Science",

number = "10 October",

}

TY - JOUR

T1 - Directionally dependent multi-view clustering using copula model

AU - Afrin, Kahkashan

AU - Iquebal, Ashif S.

AU - Karimi, Mostafa

AU - Souris, Allyson

AU - Lee, Se Yoon

AU - Mallick, Bani K.

N1 - Publisher Copyright: © 2020 Afrin et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PY - 2020/10

Y1 - 2020/10

N2 - Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for revealing pathological insights. However, multi-view clustering is challenging due to the complex dependence structure between multiple data types, including directional dependency. Specifically, genomics data types have pre-specified directional dependencies known as the central dogma that describes the process of information flow from DNA to messenger RNA (mRNA) and then from mRNA to protein. Most of the existing multi-view clustering approaches assume an independent structure or pair-wise (non-directional) dependence between data types, thereby ignoring their directional relationship. Motivated by this, we propose a biology-inspired Bayesian integrated multi-view clustering model that uses an asymmetric copula to accommodate the directional dependencies between the data types. Via extensive simulation experiments, we demonstrate the negative impact of ignoring directional dependency on clustering performance. We also present an application of our model to a real-world dataset of breast cancer tumor samples collected from The Cancer Genome Altas program and provide comparative results.

AB - Recent developments in high-throughput methods have resulted in the collection of high-dimensional data types from multiple sources and technologies that measure distinct yet complementary information. Integrated clustering of such multiple data types or multi-view clustering is critical for revealing pathological insights. However, multi-view clustering is challenging due to the complex dependence structure between multiple data types, including directional dependency. Specifically, genomics data types have pre-specified directional dependencies known as the central dogma that describes the process of information flow from DNA to messenger RNA (mRNA) and then from mRNA to protein. Most of the existing multi-view clustering approaches assume an independent structure or pair-wise (non-directional) dependence between data types, thereby ignoring their directional relationship. Motivated by this, we propose a biology-inspired Bayesian integrated multi-view clustering model that uses an asymmetric copula to accommodate the directional dependencies between the data types. Via extensive simulation experiments, we demonstrate the negative impact of ignoring directional dependency on clustering performance. We also present an application of our model to a real-world dataset of breast cancer tumor samples collected from The Cancer Genome Altas program and provide comparative results.

UR - http://www.scopus.com/inward/record.url?scp=85094559465&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85094559465&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0238996

DO - 10.1371/journal.pone.0238996

M3 - Article

C2 - 33095785

AN - SCOPUS:85094559465

SN - 1932-6203

VL - 15

JO - PloS one

JF - PloS one

IS - 10 October

M1 - e0238996

ER -

Directionally dependent multi-view clustering using copula model

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this