Bioinformatics software for biologists in the genomics era

Sudhir Kumar; Joel Dudley

doi:10.1093/bioinformatics/btm239

Bioinformatics software for biologists in the genomics era

Sudhir Kumar, Joel Dudley

Life Sciences, School of (SOLS)

Research output: Contribution to journal › Review article › peer-review

59 Scopus citations

Abstract

Motivation: The genome sequencing revolution is approaching a landmark figure of 1000 completely sequenced genomes. Coupled with fast-declining, per-base sequencing costs, this influx of DNA sequence data has encouraged laboratory scientists to engage large datasets in comparative sequence analyses for making evolutionary, functional and translational inferences. However, the majority of the scientists at the forefront of experimental research are not bioinformaticians, so a gap exists between the user-friendly software needed and the scripting/ programming infrastructure often employed for the analysis of large numbers of genes, long genomic segments and groups of sequences. We see an urgent need for the expansion of the fundamental paradigms under which biologist-friendly software tools are designed and developed to fulfill the needs of biologists to analyze large datasets by using sophisticated computational methods. We argue that the design principles need to be sensitive to the reality that comparatively small teams of biologists have historically developed some of the most popular biological software packages in molecular evolutionary analysis. Furthermore, biological intuitiveness and investigator empowerment need to take precedence over the current supposition that biologists should re-tool and become programmers when analyzing genome scale datasets.

Original language	English (US)
Pages (from-to)	1713-1717
Number of pages	5
Journal	Bioinformatics
Volume	23
Issue number	14
DOIs	https://doi.org/10.1093/bioinformatics/btm239
State	Published - Jul 15 2007

ASJC Scopus subject areas

Statistics and Probability
Biochemistry
Molecular Biology
Computer Science Applications
Computational Theory and Mathematics
Computational Mathematics

Access to Document

10.1093/bioinformatics/btm239

Cite this

@article{29d20180e0e94016bb77cbd27ba1b8d5,

title = "Bioinformatics software for biologists in the genomics era",

abstract = "Motivation: The genome sequencing revolution is approaching a landmark figure of 1000 completely sequenced genomes. Coupled with fast-declining, per-base sequencing costs, this influx of DNA sequence data has encouraged laboratory scientists to engage large datasets in comparative sequence analyses for making evolutionary, functional and translational inferences. However, the majority of the scientists at the forefront of experimental research are not bioinformaticians, so a gap exists between the user-friendly software needed and the scripting/ programming infrastructure often employed for the analysis of large numbers of genes, long genomic segments and groups of sequences. We see an urgent need for the expansion of the fundamental paradigms under which biologist-friendly software tools are designed and developed to fulfill the needs of biologists to analyze large datasets by using sophisticated computational methods. We argue that the design principles need to be sensitive to the reality that comparatively small teams of biologists have historically developed some of the most popular biological software packages in molecular evolutionary analysis. Furthermore, biological intuitiveness and investigator empowerment need to take precedence over the current supposition that biologists should re-tool and become programmers when analyzing genome scale datasets.",

author = "Sudhir Kumar and Joel Dudley",

note = "Funding Information: packages in Figure 1B. We also thank Ms Kristi Garboushian for editorial support and Ashly Ruttman for help with generating figures. This work was supported in part by a research grant from National Institutes of Health (S.K.).",

year = "2007",

month = jul,

day = "15",

doi = "10.1093/bioinformatics/btm239",

language = "English (US)",

volume = "23",

pages = "1713--1717",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "14",

}

TY - JOUR

T1 - Bioinformatics software for biologists in the genomics era

AU - Kumar, Sudhir

AU - Dudley, Joel

N1 - Funding Information: packages in Figure 1B. We also thank Ms Kristi Garboushian for editorial support and Ashly Ruttman for help with generating figures. This work was supported in part by a research grant from National Institutes of Health (S.K.).

PY - 2007/7/15

Y1 - 2007/7/15

N2 - Motivation: The genome sequencing revolution is approaching a landmark figure of 1000 completely sequenced genomes. Coupled with fast-declining, per-base sequencing costs, this influx of DNA sequence data has encouraged laboratory scientists to engage large datasets in comparative sequence analyses for making evolutionary, functional and translational inferences. However, the majority of the scientists at the forefront of experimental research are not bioinformaticians, so a gap exists between the user-friendly software needed and the scripting/ programming infrastructure often employed for the analysis of large numbers of genes, long genomic segments and groups of sequences. We see an urgent need for the expansion of the fundamental paradigms under which biologist-friendly software tools are designed and developed to fulfill the needs of biologists to analyze large datasets by using sophisticated computational methods. We argue that the design principles need to be sensitive to the reality that comparatively small teams of biologists have historically developed some of the most popular biological software packages in molecular evolutionary analysis. Furthermore, biological intuitiveness and investigator empowerment need to take precedence over the current supposition that biologists should re-tool and become programmers when analyzing genome scale datasets.

AB - Motivation: The genome sequencing revolution is approaching a landmark figure of 1000 completely sequenced genomes. Coupled with fast-declining, per-base sequencing costs, this influx of DNA sequence data has encouraged laboratory scientists to engage large datasets in comparative sequence analyses for making evolutionary, functional and translational inferences. However, the majority of the scientists at the forefront of experimental research are not bioinformaticians, so a gap exists between the user-friendly software needed and the scripting/ programming infrastructure often employed for the analysis of large numbers of genes, long genomic segments and groups of sequences. We see an urgent need for the expansion of the fundamental paradigms under which biologist-friendly software tools are designed and developed to fulfill the needs of biologists to analyze large datasets by using sophisticated computational methods. We argue that the design principles need to be sensitive to the reality that comparatively small teams of biologists have historically developed some of the most popular biological software packages in molecular evolutionary analysis. Furthermore, biological intuitiveness and investigator empowerment need to take precedence over the current supposition that biologists should re-tool and become programmers when analyzing genome scale datasets.

UR - http://www.scopus.com/inward/record.url?scp=34547763724&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34547763724&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btm239

DO - 10.1093/bioinformatics/btm239

M3 - Review article

C2 - 17485425

AN - SCOPUS:34547763724

SN - 1367-4803

VL - 23

SP - 1713

EP - 1717

JO - Bioinformatics

JF - Bioinformatics

IS - 14

ER -

Bioinformatics software for biologists in the genomics era

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this