TY - JOUR
T1 - SHOGUN
T2 - A modular, accurate and scalable framework for microbiome quantification
AU - Hillmann, Benjamin
AU - Al-Ghalith, Gabriel A.
AU - Shields-Cutler, Robin R.
AU - Zhu, Qiyun
AU - Knight, Rob
AU - Knights, Dan
N1 - Funding Information:
This work was supported by the University of Minnesota Masonic Cancer Center (to D.K.), National Institutes of Health grant R01AI121383 (to D.K.), grant P01DK078669 and in-kind support from National Science Foundation 1565057 and the Alfred P. Sloan Foundation 2017–9838 (to R.K., QIIME and QIITA) and by the Masonic Cancer Center at the University of Minnesota.
Publisher Copyright:
© 2020 The Author(s). Published by Oxford University Press. All rights reserved.
PY - 2020/7/1
Y1 - 2020/7/1
N2 - The software pipeline SHOGUN profiles known taxonomic and gene abundances of short-read shotgun metagenomics sequencing data. The pipeline is scalable, modular and flexible. Data analysis and transformation steps can be run individually or together in an automated workflow. Users can easily create new reference databases and can select one of three DNA alignment tools, ranging from ultra-fast low-RAM k-mer-based database search to fully exhaustive gapped DNA alignment, to best fit their analysis needs and computational resources. The pipeline includes an implementation of a published method for taxonomy assignment disambiguation with empirical Bayesian redistribution. The software is installable via the conda resource management framework, has plugins for the QIIME2 and QIITA packages and produces both taxonomy and gene abundance profile tables with a single command, thus promoting convenient and reproducible metagenomics research.
AB - The software pipeline SHOGUN profiles known taxonomic and gene abundances of short-read shotgun metagenomics sequencing data. The pipeline is scalable, modular and flexible. Data analysis and transformation steps can be run individually or together in an automated workflow. Users can easily create new reference databases and can select one of three DNA alignment tools, ranging from ultra-fast low-RAM k-mer-based database search to fully exhaustive gapped DNA alignment, to best fit their analysis needs and computational resources. The pipeline includes an implementation of a published method for taxonomy assignment disambiguation with empirical Bayesian redistribution. The software is installable via the conda resource management framework, has plugins for the QIIME2 and QIITA packages and produces both taxonomy and gene abundance profile tables with a single command, thus promoting convenient and reproducible metagenomics research.
UR - http://www.scopus.com/inward/record.url?scp=85087531287&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85087531287&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btaa277
DO - 10.1093/bioinformatics/btaa277
M3 - Article
C2 - 32365167
AN - SCOPUS:85087531287
SN - 1367-4803
VL - 36
SP - 4088
EP - 4090
JO - Bioinformatics
JF - Bioinformatics
IS - 13
ER -