BitTorious: Global controlled genomics data publication, research and archiving via BitTorrent extensions

Preston V. Lee, Valentin Dinu

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Background: Centralized silos of genomic data are architecturally easier to initially design, develop and deploy than distributed models. However, as interoperability pains in EHR/EMR, HIE and other collaboration-centric life sciences domains have taught us, the core challenge of networking genomics systems is not in the construction of individual silos, but the interoperability of those deployments in a manner embracing the heterogeneous needs, terms and infrastructure of collaborating parties. This article demonstrates the adaptation of BitTorrent to private collaboration networks in an authenticated, authorized and encrypted manner while retaining the same characteristics of standard BitTorrent. Results: The BitTorious portal was sucessfully used to manage many concurrent domestic Bittorrent clients across the United States: exchanging genomics data payloads in excess of 500GiB using the uTorrent client software on Linux, OSX and Windows platforms. Individual nodes were sporadically interrupted to verify the resilience of the system to outages of a single client node as well as recovery of nodes resuming operation on intermittent Internet connections. Conclusions: The authorization-based extension of Bittorrent and accompanying BitTorious reference tracker and user management web portal provide a free, standards-based, general purpose and extensible data distribution system for large ?omics collaborations.

Original languageEnglish (US)
Article number424
JournalBMC Bioinformatics
Volume15
Issue number1
DOIs
StatePublished - Dec 11 2014

Fingerprint

BitTorrent
Genomics
Interoperability
Publications
Biological Science Disciplines
Research
Information Systems
Outages
Internet
Software
Vertex of a graph
Web Portal
Pain
Recovery
Life sciences
Authorization
Distribution System
Resilience
Data Distribution
Linux

Keywords

  • Big data
  • Bioinformatics
  • Bittorrent
  • Data transfer
  • Networking
  • Open source
  • Software

ASJC Scopus subject areas

  • Applied Mathematics
  • Structural Biology
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications

Cite this

BitTorious : Global controlled genomics data publication, research and archiving via BitTorrent extensions. / Lee, Preston V.; Dinu, Valentin.

In: BMC Bioinformatics, Vol. 15, No. 1, 424, 11.12.2014.

Research output: Contribution to journalArticle

@article{94c227c0ac6f445d85fdc883c6439c58,
title = "BitTorious: Global controlled genomics data publication, research and archiving via BitTorrent extensions",
abstract = "Background: Centralized silos of genomic data are architecturally easier to initially design, develop and deploy than distributed models. However, as interoperability pains in EHR/EMR, HIE and other collaboration-centric life sciences domains have taught us, the core challenge of networking genomics systems is not in the construction of individual silos, but the interoperability of those deployments in a manner embracing the heterogeneous needs, terms and infrastructure of collaborating parties. This article demonstrates the adaptation of BitTorrent to private collaboration networks in an authenticated, authorized and encrypted manner while retaining the same characteristics of standard BitTorrent. Results: The BitTorious portal was sucessfully used to manage many concurrent domestic Bittorrent clients across the United States: exchanging genomics data payloads in excess of 500GiB using the uTorrent client software on Linux, OSX and Windows platforms. Individual nodes were sporadically interrupted to verify the resilience of the system to outages of a single client node as well as recovery of nodes resuming operation on intermittent Internet connections. Conclusions: The authorization-based extension of Bittorrent and accompanying BitTorious reference tracker and user management web portal provide a free, standards-based, general purpose and extensible data distribution system for large ?omics collaborations.",
keywords = "Big data, Bioinformatics, Bittorrent, Data transfer, Networking, Open source, Software",
author = "Lee, {Preston V.} and Valentin Dinu",
year = "2014",
month = "12",
day = "11",
doi = "10.1186/s12859-014-0424-9",
language = "English (US)",
volume = "15",
journal = "BMC Bioinformatics",
issn = "1471-2105",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - BitTorious

T2 - Global controlled genomics data publication, research and archiving via BitTorrent extensions

AU - Lee, Preston V.

AU - Dinu, Valentin

PY - 2014/12/11

Y1 - 2014/12/11

N2 - Background: Centralized silos of genomic data are architecturally easier to initially design, develop and deploy than distributed models. However, as interoperability pains in EHR/EMR, HIE and other collaboration-centric life sciences domains have taught us, the core challenge of networking genomics systems is not in the construction of individual silos, but the interoperability of those deployments in a manner embracing the heterogeneous needs, terms and infrastructure of collaborating parties. This article demonstrates the adaptation of BitTorrent to private collaboration networks in an authenticated, authorized and encrypted manner while retaining the same characteristics of standard BitTorrent. Results: The BitTorious portal was sucessfully used to manage many concurrent domestic Bittorrent clients across the United States: exchanging genomics data payloads in excess of 500GiB using the uTorrent client software on Linux, OSX and Windows platforms. Individual nodes were sporadically interrupted to verify the resilience of the system to outages of a single client node as well as recovery of nodes resuming operation on intermittent Internet connections. Conclusions: The authorization-based extension of Bittorrent and accompanying BitTorious reference tracker and user management web portal provide a free, standards-based, general purpose and extensible data distribution system for large ?omics collaborations.

AB - Background: Centralized silos of genomic data are architecturally easier to initially design, develop and deploy than distributed models. However, as interoperability pains in EHR/EMR, HIE and other collaboration-centric life sciences domains have taught us, the core challenge of networking genomics systems is not in the construction of individual silos, but the interoperability of those deployments in a manner embracing the heterogeneous needs, terms and infrastructure of collaborating parties. This article demonstrates the adaptation of BitTorrent to private collaboration networks in an authenticated, authorized and encrypted manner while retaining the same characteristics of standard BitTorrent. Results: The BitTorious portal was sucessfully used to manage many concurrent domestic Bittorrent clients across the United States: exchanging genomics data payloads in excess of 500GiB using the uTorrent client software on Linux, OSX and Windows platforms. Individual nodes were sporadically interrupted to verify the resilience of the system to outages of a single client node as well as recovery of nodes resuming operation on intermittent Internet connections. Conclusions: The authorization-based extension of Bittorrent and accompanying BitTorious reference tracker and user management web portal provide a free, standards-based, general purpose and extensible data distribution system for large ?omics collaborations.

KW - Big data

KW - Bioinformatics

KW - Bittorrent

KW - Data transfer

KW - Networking

KW - Open source

KW - Software

UR - http://www.scopus.com/inward/record.url?scp=84924044565&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84924044565&partnerID=8YFLogxK

U2 - 10.1186/s12859-014-0424-9

DO - 10.1186/s12859-014-0424-9

M3 - Article

C2 - 25528455

AN - SCOPUS:84924044565

VL - 15

JO - BMC Bioinformatics

JF - BMC Bioinformatics

SN - 1471-2105

IS - 1

M1 - 424

ER -