To increase trust, change the social design behind aggregated biodiversity data

Nico Franz; Beckett Sterner

doi:10.1093/database/bax100

To increase trust, change the social design behind aggregated biodiversity data

Nico Franz, Beckett Sterner

Research output: Contribution to journal › Article › peer-review

32 Scopus citations

Abstract

Growing concerns about the quality of aggregated biodiversity data are lowering trust in large-scale data networks. Aggregators frequently respond to quality concerns by recommending that biologists work with original data providers to correct errors 'at the source.' We show that this strategy falls systematically short of a full diagnosis of the underlying causes of distrust. In particular, trust in an aggregator is not just a feature of the data signal quality provided by the sources to the aggregator, but also a consequence of the social design of the aggregation process and the resulting power balance between individual data contributors and aggregators. The latter have created an accountability gap by downplaying the authorship and significance of the taxonomic hierarchies - frequently called 'backbones' - they generate, and which are in effect novel classification theories that operate at the core of data-structuring process. The Darwin Core standard for sharing occurrence records plays an under-appreciated role in maintaining the accountability gap, because this standard lacks the syntactic structure needed to preserve the taxonomic coherence of data packages submitted for aggregation, potentially leading to inferences that no individual source would support. Since high-quality data packages can mirror competing and conflicting classifications, i.e. unsettled systematic research, this plurality must be accommodated in the design of biodiversity data integration. Looking forward, a key directive is to develop new technical pathways and social incentives for experts to contribute directly to the validation of taxonomically coherent data packages as part of a greater, trustworthy aggregation process.

Original language	English (US)
Journal	Database
Volume	2018
Issue number	2018
DOIs	https://doi.org/10.1093/database/bax100
State	Published - Jan 1 2018

ASJC Scopus subject areas

General Medicine

Access to Document

10.1093/database/bax100

Cite this

@article{bb1c8c58e4c74878aa8b223fdeceb5c1,

title = "To increase trust, change the social design behind aggregated biodiversity data",

abstract = "Growing concerns about the quality of aggregated biodiversity data are lowering trust in large-scale data networks. Aggregators frequently respond to quality concerns by recommending that biologists work with original data providers to correct errors 'at the source.' We show that this strategy falls systematically short of a full diagnosis of the underlying causes of distrust. In particular, trust in an aggregator is not just a feature of the data signal quality provided by the sources to the aggregator, but also a consequence of the social design of the aggregation process and the resulting power balance between individual data contributors and aggregators. The latter have created an accountability gap by downplaying the authorship and significance of the taxonomic hierarchies - frequently called 'backbones' - they generate, and which are in effect novel classification theories that operate at the core of data-structuring process. The Darwin Core standard for sharing occurrence records plays an under-appreciated role in maintaining the accountability gap, because this standard lacks the syntactic structure needed to preserve the taxonomic coherence of data packages submitted for aggregation, potentially leading to inferences that no individual source would support. Since high-quality data packages can mirror competing and conflicting classifications, i.e. unsettled systematic research, this plurality must be accommodated in the design of biodiversity data integration. Looking forward, a key directive is to develop new technical pathways and social incentives for experts to contribute directly to the validation of taxonomically coherent data packages as part of a greater, trustworthy aggregation process.",

author = "Nico Franz and Beckett Sterner",

note = "Funding Information: The authors thank the four referees for their constructive and detailed feedback. The authors are also grateful to Erin Barringer-Sterner, Andrew Johnston, Jonathan Rees, David Remsen, David Shorthouse and Guanyang Zhang for helpful discussions on this subject. This work was supported by the National Science Foundation [DEB-1155984, DBI-1342595 (NMF); and SES-1153114 (BWS)]. Funding Information: This work was supported by the National Science Foundation [DEB–1155984, DBI–1342595 (NMF); and SES–1153114 (BWS)]. Publisher Copyright: {\textcopyright} The Author(s) 2017. Published by Oxford University Press.",

year = "2018",

month = jan,

day = "1",

doi = "10.1093/database/bax100",

language = "English (US)",

volume = "2018",

journal = "Database",

issn = "1758-0463",

publisher = "Oxford University Press",

number = "2018",

}

TY - JOUR

T1 - To increase trust, change the social design behind aggregated biodiversity data

AU - Franz, Nico

AU - Sterner, Beckett

N1 - Funding Information: The authors thank the four referees for their constructive and detailed feedback. The authors are also grateful to Erin Barringer-Sterner, Andrew Johnston, Jonathan Rees, David Remsen, David Shorthouse and Guanyang Zhang for helpful discussions on this subject. This work was supported by the National Science Foundation [DEB-1155984, DBI-1342595 (NMF); and SES-1153114 (BWS)]. Funding Information: This work was supported by the National Science Foundation [DEB–1155984, DBI–1342595 (NMF); and SES–1153114 (BWS)]. Publisher Copyright: © The Author(s) 2017. Published by Oxford University Press.

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Growing concerns about the quality of aggregated biodiversity data are lowering trust in large-scale data networks. Aggregators frequently respond to quality concerns by recommending that biologists work with original data providers to correct errors 'at the source.' We show that this strategy falls systematically short of a full diagnosis of the underlying causes of distrust. In particular, trust in an aggregator is not just a feature of the data signal quality provided by the sources to the aggregator, but also a consequence of the social design of the aggregation process and the resulting power balance between individual data contributors and aggregators. The latter have created an accountability gap by downplaying the authorship and significance of the taxonomic hierarchies - frequently called 'backbones' - they generate, and which are in effect novel classification theories that operate at the core of data-structuring process. The Darwin Core standard for sharing occurrence records plays an under-appreciated role in maintaining the accountability gap, because this standard lacks the syntactic structure needed to preserve the taxonomic coherence of data packages submitted for aggregation, potentially leading to inferences that no individual source would support. Since high-quality data packages can mirror competing and conflicting classifications, i.e. unsettled systematic research, this plurality must be accommodated in the design of biodiversity data integration. Looking forward, a key directive is to develop new technical pathways and social incentives for experts to contribute directly to the validation of taxonomically coherent data packages as part of a greater, trustworthy aggregation process.

AB - Growing concerns about the quality of aggregated biodiversity data are lowering trust in large-scale data networks. Aggregators frequently respond to quality concerns by recommending that biologists work with original data providers to correct errors 'at the source.' We show that this strategy falls systematically short of a full diagnosis of the underlying causes of distrust. In particular, trust in an aggregator is not just a feature of the data signal quality provided by the sources to the aggregator, but also a consequence of the social design of the aggregation process and the resulting power balance between individual data contributors and aggregators. The latter have created an accountability gap by downplaying the authorship and significance of the taxonomic hierarchies - frequently called 'backbones' - they generate, and which are in effect novel classification theories that operate at the core of data-structuring process. The Darwin Core standard for sharing occurrence records plays an under-appreciated role in maintaining the accountability gap, because this standard lacks the syntactic structure needed to preserve the taxonomic coherence of data packages submitted for aggregation, potentially leading to inferences that no individual source would support. Since high-quality data packages can mirror competing and conflicting classifications, i.e. unsettled systematic research, this plurality must be accommodated in the design of biodiversity data integration. Looking forward, a key directive is to develop new technical pathways and social incentives for experts to contribute directly to the validation of taxonomically coherent data packages as part of a greater, trustworthy aggregation process.

UR - http://www.scopus.com/inward/record.url?scp=85052946597&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85052946597&partnerID=8YFLogxK

U2 - 10.1093/database/bax100

DO - 10.1093/database/bax100

M3 - Article

C2 - 29315357

AN - SCOPUS:85052946597

SN - 1758-0463

VL - 2018

JO - Database

JF - Database

IS - 2018

ER -

To increase trust, change the social design behind aggregated biodiversity data

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this