Socially Privacy-Preserving Data Collection for Crowdsensing

Guang Yang; Zhiguo Shi; Shibo He; Junshan Zhang

doi:10.1109/TVT.2019.2950907

Socially Privacy-Preserving Data Collection for Crowdsensing

Guang Yang, Zhiguo Shi, Shibo He, Junshan Zhang

Research output: Contribution to journal › Article › peer-review

10 Scopus citations

Abstract

Crowdsensing has been recognized as a promising data collection paradigm, in which a platform outsources sensing tasks to a large number of users. However, requesting users to report raw data may give rise to many practical concerns, such as a significant overhead of communication and central processing, besides users' privacy concerns. In many scenarios (e.g, advertising and recommendation), the data collector directly benefits from statistical aggregation of raw data. Thus motivated, we consider the data collection problem based on user's local histograms, which is intimately related to the fundamental trade-off between the platform's accuracy and users' privacy. Because of users' social relationship, their data are often correlated, indicating that users' privacy may be leaked from others' data. To tackle this challenge, we first utilize Gaussian Markov random fields to model the correlation structure embedded in users' data. The data collection is modeled as a Stackelberg game where the platform decides its reward policy and users decide their noise levels while taking into account the social coupling among users. For the reward policy design, we first establish the relationship between users' Nash equilibrium and the payment mechanism, and then optimize the platform's accuracy under a budget constraint. Further, since the noise levels are users' private information, they may use falsified noise levels to achieve higher payoffs, which in turn impairs the crowdsensing performance. It turns out that with the insight into the correlation structure among users' data, the information asymmetry can be overcome based on peer prediction. We revisit the payment mechanism to guarantee dominant truthfulness of each user's strategy. Theoretical analysis and numerical results demonstrate the effectiveness of the proposed mechanism.

Original language	English (US)
Article number	8889739
Pages (from-to)	851-861
Number of pages	11
Journal	IEEE Transactions on Vehicular Technology
Volume	69
Issue number	1
DOIs	https://doi.org/10.1109/TVT.2019.2950907
State	Published - Jan 2020

Keywords

Crowdsensing
data correlation
local histogram
privacy
social relationship

ASJC Scopus subject areas

Automotive Engineering
Aerospace Engineering
Electrical and Electronic Engineering
Applied Mathematics

Access to Document

10.1109/TVT.2019.2950907

Cite this

@article{f8e546c30f8b441fb88efc3b80bbc0e0,

title = "Socially Privacy-Preserving Data Collection for Crowdsensing",

abstract = "Crowdsensing has been recognized as a promising data collection paradigm, in which a platform outsources sensing tasks to a large number of users. However, requesting users to report raw data may give rise to many practical concerns, such as a significant overhead of communication and central processing, besides users' privacy concerns. In many scenarios (e.g, advertising and recommendation), the data collector directly benefits from statistical aggregation of raw data. Thus motivated, we consider the data collection problem based on user's local histograms, which is intimately related to the fundamental trade-off between the platform's accuracy and users' privacy. Because of users' social relationship, their data are often correlated, indicating that users' privacy may be leaked from others' data. To tackle this challenge, we first utilize Gaussian Markov random fields to model the correlation structure embedded in users' data. The data collection is modeled as a Stackelberg game where the platform decides its reward policy and users decide their noise levels while taking into account the social coupling among users. For the reward policy design, we first establish the relationship between users' Nash equilibrium and the payment mechanism, and then optimize the platform's accuracy under a budget constraint. Further, since the noise levels are users' private information, they may use falsified noise levels to achieve higher payoffs, which in turn impairs the crowdsensing performance. It turns out that with the insight into the correlation structure among users' data, the information asymmetry can be overcome based on peer prediction. We revisit the payment mechanism to guarantee dominant truthfulness of each user's strategy. Theoretical analysis and numerical results demonstrate the effectiveness of the proposed mechanism.",

keywords = "Crowdsensing, data correlation, local histogram, privacy, social relationship",

author = "Guang Yang and Zhiguo Shi and Shibo He and Junshan Zhang",

note = "Funding Information: Manuscript received June 27, 2019; revised September 6, 2019; accepted October 15, 2019. Date of publication November 1, 2019; date of current version January 15, 2020. This work was supported in part by the National Natural Science Foundation of China under Grant 61672458 and Grant 61772467 and in part by the Natural Science Foundation of Zhejiang Province under Grant LR16F010002. The review of this article was coordinated by Prof. J. Ren. (Corresponding author: Zhiguo Shi.) G. Yang is with the College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China (e-mail: yangg2015@zju.edu.cn). Publisher Copyright: {\textcopyright} 1967-2012 IEEE.",

year = "2020",

month = jan,

doi = "10.1109/TVT.2019.2950907",

language = "English (US)",

volume = "69",

pages = "851--861",

journal = "IEEE Transactions on Vehicular Technology",

issn = "0018-9545",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "1",

}

TY - JOUR

T1 - Socially Privacy-Preserving Data Collection for Crowdsensing

AU - Yang, Guang

AU - Shi, Zhiguo

AU - He, Shibo

AU - Zhang, Junshan

N1 - Funding Information: Manuscript received June 27, 2019; revised September 6, 2019; accepted October 15, 2019. Date of publication November 1, 2019; date of current version January 15, 2020. This work was supported in part by the National Natural Science Foundation of China under Grant 61672458 and Grant 61772467 and in part by the Natural Science Foundation of Zhejiang Province under Grant LR16F010002. The review of this article was coordinated by Prof. J. Ren. (Corresponding author: Zhiguo Shi.) G. Yang is with the College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China (e-mail: yangg2015@zju.edu.cn). Publisher Copyright: © 1967-2012 IEEE.

PY - 2020/1

Y1 - 2020/1

N2 - Crowdsensing has been recognized as a promising data collection paradigm, in which a platform outsources sensing tasks to a large number of users. However, requesting users to report raw data may give rise to many practical concerns, such as a significant overhead of communication and central processing, besides users' privacy concerns. In many scenarios (e.g, advertising and recommendation), the data collector directly benefits from statistical aggregation of raw data. Thus motivated, we consider the data collection problem based on user's local histograms, which is intimately related to the fundamental trade-off between the platform's accuracy and users' privacy. Because of users' social relationship, their data are often correlated, indicating that users' privacy may be leaked from others' data. To tackle this challenge, we first utilize Gaussian Markov random fields to model the correlation structure embedded in users' data. The data collection is modeled as a Stackelberg game where the platform decides its reward policy and users decide their noise levels while taking into account the social coupling among users. For the reward policy design, we first establish the relationship between users' Nash equilibrium and the payment mechanism, and then optimize the platform's accuracy under a budget constraint. Further, since the noise levels are users' private information, they may use falsified noise levels to achieve higher payoffs, which in turn impairs the crowdsensing performance. It turns out that with the insight into the correlation structure among users' data, the information asymmetry can be overcome based on peer prediction. We revisit the payment mechanism to guarantee dominant truthfulness of each user's strategy. Theoretical analysis and numerical results demonstrate the effectiveness of the proposed mechanism.

AB - Crowdsensing has been recognized as a promising data collection paradigm, in which a platform outsources sensing tasks to a large number of users. However, requesting users to report raw data may give rise to many practical concerns, such as a significant overhead of communication and central processing, besides users' privacy concerns. In many scenarios (e.g, advertising and recommendation), the data collector directly benefits from statistical aggregation of raw data. Thus motivated, we consider the data collection problem based on user's local histograms, which is intimately related to the fundamental trade-off between the platform's accuracy and users' privacy. Because of users' social relationship, their data are often correlated, indicating that users' privacy may be leaked from others' data. To tackle this challenge, we first utilize Gaussian Markov random fields to model the correlation structure embedded in users' data. The data collection is modeled as a Stackelberg game where the platform decides its reward policy and users decide their noise levels while taking into account the social coupling among users. For the reward policy design, we first establish the relationship between users' Nash equilibrium and the payment mechanism, and then optimize the platform's accuracy under a budget constraint. Further, since the noise levels are users' private information, they may use falsified noise levels to achieve higher payoffs, which in turn impairs the crowdsensing performance. It turns out that with the insight into the correlation structure among users' data, the information asymmetry can be overcome based on peer prediction. We revisit the payment mechanism to guarantee dominant truthfulness of each user's strategy. Theoretical analysis and numerical results demonstrate the effectiveness of the proposed mechanism.

KW - Crowdsensing

KW - data correlation

KW - local histogram

KW - privacy

KW - social relationship

UR - http://www.scopus.com/inward/record.url?scp=85078466094&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85078466094&partnerID=8YFLogxK

U2 - 10.1109/TVT.2019.2950907

DO - 10.1109/TVT.2019.2950907

M3 - Article

AN - SCOPUS:85078466094

SN - 0018-9545

VL - 69

SP - 851

EP - 861

JO - IEEE Transactions on Vehicular Technology

JF - IEEE Transactions on Vehicular Technology

IS - 1

M1 - 8889739

ER -

Socially Privacy-Preserving Data Collection for Crowdsensing

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this