cMonkey2: Automated, systematic, integrated detection of co-regulated gene modules for any organism

David J. Reiss; Christopher L. Plaisier; Wei Ju Wu; Nitin S. Baliga

doi:10.1093/nar/gkv300

cMonkey₂: Automated, systematic, integrated detection of co-regulated gene modules for any organism

David J. Reiss, Christopher L. Plaisier, Wei Ju Wu, Nitin S. Baliga

Research output: Contribution to journal › Review article › peer-review

30 Scopus citations

Abstract

The cMonkey integrated biclustering algorithm identifies conditionally co-regulated modules of genes (biclusters). cMonkey integrates various orthogonal pieces of information which support evidence of gene co-regulation, and optimizes biclusters to be supported simultaneously by one or more of these prior constraints. The algorithm served as the cornerstone for constructing the first global, predictive Environmental Gene Regulatory Influence Network (EGRIN) model for a free-living cell, and has now been applied to many more organisms. However, due to its computational inefficiencies, long run-time and complexity of various input data types, cMonkey was not readily usable by the wider community. To address these primary concerns, we have significantly updated the cMonkey algorithm and refactored its implementation, improving its usability and extendibility. These improvements provide a fully functioning and user-friendly platform for building co-regulated gene modules and the tools necessary for their exploration and interpretation. We show, via three separate analyses of data for E.coli, M. tuberculosis and H.sapiens, that the updated algorithm and inclusion of novel scoring functions for new data types (e.g. ChIP-seq and transcription factor over-expression [TFOE]) improve discovery of biologically informative co-regulated modules. The complete cMonkey₂ software package, including source code, is available at https://github.com/baliga-lab/cmonkey2.

Original language	English (US)
Pages (from-to)	e87
Journal	Nucleic acids research
Volume	43
Issue number	13
DOIs	https://doi.org/10.1093/nar/gkv300
State	Published - Mar 26 2015
Externally published	Yes

ASJC Scopus subject areas

Genetics

Access to Document

10.1093/nar/gkv300

Cite this

@article{52909052c5264e5998ba4fdc2040ab27,

title = "cMonkey2: Automated, systematic, integrated detection of co-regulated gene modules for any organism",

abstract = "The cMonkey integrated biclustering algorithm identifies conditionally co-regulated modules of genes (biclusters). cMonkey integrates various orthogonal pieces of information which support evidence of gene co-regulation, and optimizes biclusters to be supported simultaneously by one or more of these prior constraints. The algorithm served as the cornerstone for constructing the first global, predictive Environmental Gene Regulatory Influence Network (EGRIN) model for a free-living cell, and has now been applied to many more organisms. However, due to its computational inefficiencies, long run-time and complexity of various input data types, cMonkey was not readily usable by the wider community. To address these primary concerns, we have significantly updated the cMonkey algorithm and refactored its implementation, improving its usability and extendibility. These improvements provide a fully functioning and user-friendly platform for building co-regulated gene modules and the tools necessary for their exploration and interpretation. We show, via three separate analyses of data for E.coli, M. tuberculosis and H.sapiens, that the updated algorithm and inclusion of novel scoring functions for new data types (e.g. ChIP-seq and transcription factor over-expression [TFOE]) improve discovery of biologically informative co-regulated modules. The complete cMonkey2 software package, including source code, is available at https://github.com/baliga-lab/cmonkey2.",

author = "Reiss, {David J.} and Plaisier, {Christopher L.} and Wu, {Wei Ju} and Baliga, {Nitin S.}",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.",

year = "2015",

month = mar,

day = "26",

doi = "10.1093/nar/gkv300",

language = "English (US)",

volume = "43",

pages = "e87",

journal = "Nucleic acids research",

issn = "0305-1048",

publisher = "Oxford University Press",

number = "13",

}

TY - JOUR

T1 - cMonkey2

T2 - Automated, systematic, integrated detection of co-regulated gene modules for any organism

AU - Reiss, David J.

AU - Plaisier, Christopher L.

AU - Wu, Wei Ju

AU - Baliga, Nitin S.

N1 - Publisher Copyright: © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

PY - 2015/3/26

Y1 - 2015/3/26

N2 - The cMonkey integrated biclustering algorithm identifies conditionally co-regulated modules of genes (biclusters). cMonkey integrates various orthogonal pieces of information which support evidence of gene co-regulation, and optimizes biclusters to be supported simultaneously by one or more of these prior constraints. The algorithm served as the cornerstone for constructing the first global, predictive Environmental Gene Regulatory Influence Network (EGRIN) model for a free-living cell, and has now been applied to many more organisms. However, due to its computational inefficiencies, long run-time and complexity of various input data types, cMonkey was not readily usable by the wider community. To address these primary concerns, we have significantly updated the cMonkey algorithm and refactored its implementation, improving its usability and extendibility. These improvements provide a fully functioning and user-friendly platform for building co-regulated gene modules and the tools necessary for their exploration and interpretation. We show, via three separate analyses of data for E.coli, M. tuberculosis and H.sapiens, that the updated algorithm and inclusion of novel scoring functions for new data types (e.g. ChIP-seq and transcription factor over-expression [TFOE]) improve discovery of biologically informative co-regulated modules. The complete cMonkey2 software package, including source code, is available at https://github.com/baliga-lab/cmonkey2.

AB - The cMonkey integrated biclustering algorithm identifies conditionally co-regulated modules of genes (biclusters). cMonkey integrates various orthogonal pieces of information which support evidence of gene co-regulation, and optimizes biclusters to be supported simultaneously by one or more of these prior constraints. The algorithm served as the cornerstone for constructing the first global, predictive Environmental Gene Regulatory Influence Network (EGRIN) model for a free-living cell, and has now been applied to many more organisms. However, due to its computational inefficiencies, long run-time and complexity of various input data types, cMonkey was not readily usable by the wider community. To address these primary concerns, we have significantly updated the cMonkey algorithm and refactored its implementation, improving its usability and extendibility. These improvements provide a fully functioning and user-friendly platform for building co-regulated gene modules and the tools necessary for their exploration and interpretation. We show, via three separate analyses of data for E.coli, M. tuberculosis and H.sapiens, that the updated algorithm and inclusion of novel scoring functions for new data types (e.g. ChIP-seq and transcription factor over-expression [TFOE]) improve discovery of biologically informative co-regulated modules. The complete cMonkey2 software package, including source code, is available at https://github.com/baliga-lab/cmonkey2.

UR - http://www.scopus.com/inward/record.url?scp=84939604164&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84939604164&partnerID=8YFLogxK

U2 - 10.1093/nar/gkv300

DO - 10.1093/nar/gkv300

M3 - Review article

C2 - 25873626

AN - SCOPUS:84939604164

SN - 0305-1048

VL - 43

SP - e87

JO - Nucleic acids research

JF - Nucleic acids research

IS - 13

ER -

cMonkey₂: Automated, systematic, integrated detection of co-regulated gene modules for any organism

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this