GeoBoost: Accelerating research involving the geospatial metadata of virus GenBank records

Tasnia Tahsin, Davy Weissenbacher, Karen O'Connor, Arjun Magge, Matthew Scotch, Graciela Gonzalez-Hernandez

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

GeoBoost is a command-line software package developed to address sparse or incomplete metadata in GenBank sequence records that relate to the location of the infected host (LOIH) of viruses. Given a set of GenBank accession numbers corresponding to virus GenBank records, GeoBoost extracts, integrates and normalizes geographic information reflecting the LOIH of the viruses using integrated information from GenBank metadata and related full-text publications. In addition, to facilitate probabilistic geospatial modeling, GeoBoost assigns probability scores for each possible LOIH. Availability and implementation Binaries and resources required for running GeoBoost are packed into a single zipped file and freely available for download at https://tinyurl.com/geoboost. A video tutorial is included to help users quickly and easily install and run the software. The software is implemented in Java 1.8, and supported on MS Windows and Linux platforms. Contact gragon@upenn.edu Supplementary informationSupplementary dataare available at Bioinformatics online.

Original languageEnglish (US)
Pages (from-to)1606-1608
Number of pages3
JournalBioinformatics
Volume34
Issue number9
DOIs
StatePublished - May 1 2018

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'GeoBoost: Accelerating research involving the geospatial metadata of virus GenBank records'. Together they form a unique fingerprint.

Cite this