TY - JOUR
T1 - Socially Privacy-Preserving Data Collection for Crowdsensing
AU - Yang, Guang
AU - Shi, Zhiguo
AU - He, Shibo
AU - Zhang, Junshan
N1 - Funding Information:
Manuscript received June 27, 2019; revised September 6, 2019; accepted October 15, 2019. Date of publication November 1, 2019; date of current version January 15, 2020. This work was supported in part by the National Natural Science Foundation of China under Grant 61672458 and Grant 61772467 and in part by the Natural Science Foundation of Zhejiang Province under Grant LR16F010002. The review of this article was coordinated by Prof. J. Ren. (Corresponding author: Zhiguo Shi.) G. Yang is with the College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China (e-mail: yangg2015@zju.edu.cn).
Publisher Copyright:
© 1967-2012 IEEE.
PY - 2020/1
Y1 - 2020/1
N2 - Crowdsensing has been recognized as a promising data collection paradigm, in which a platform outsources sensing tasks to a large number of users. However, requesting users to report raw data may give rise to many practical concerns, such as a significant overhead of communication and central processing, besides users' privacy concerns. In many scenarios (e.g, advertising and recommendation), the data collector directly benefits from statistical aggregation of raw data. Thus motivated, we consider the data collection problem based on user's local histograms, which is intimately related to the fundamental trade-off between the platform's accuracy and users' privacy. Because of users' social relationship, their data are often correlated, indicating that users' privacy may be leaked from others' data. To tackle this challenge, we first utilize Gaussian Markov random fields to model the correlation structure embedded in users' data. The data collection is modeled as a Stackelberg game where the platform decides its reward policy and users decide their noise levels while taking into account the social coupling among users. For the reward policy design, we first establish the relationship between users' Nash equilibrium and the payment mechanism, and then optimize the platform's accuracy under a budget constraint. Further, since the noise levels are users' private information, they may use falsified noise levels to achieve higher payoffs, which in turn impairs the crowdsensing performance. It turns out that with the insight into the correlation structure among users' data, the information asymmetry can be overcome based on peer prediction. We revisit the payment mechanism to guarantee dominant truthfulness of each user's strategy. Theoretical analysis and numerical results demonstrate the effectiveness of the proposed mechanism.
AB - Crowdsensing has been recognized as a promising data collection paradigm, in which a platform outsources sensing tasks to a large number of users. However, requesting users to report raw data may give rise to many practical concerns, such as a significant overhead of communication and central processing, besides users' privacy concerns. In many scenarios (e.g, advertising and recommendation), the data collector directly benefits from statistical aggregation of raw data. Thus motivated, we consider the data collection problem based on user's local histograms, which is intimately related to the fundamental trade-off between the platform's accuracy and users' privacy. Because of users' social relationship, their data are often correlated, indicating that users' privacy may be leaked from others' data. To tackle this challenge, we first utilize Gaussian Markov random fields to model the correlation structure embedded in users' data. The data collection is modeled as a Stackelberg game where the platform decides its reward policy and users decide their noise levels while taking into account the social coupling among users. For the reward policy design, we first establish the relationship between users' Nash equilibrium and the payment mechanism, and then optimize the platform's accuracy under a budget constraint. Further, since the noise levels are users' private information, they may use falsified noise levels to achieve higher payoffs, which in turn impairs the crowdsensing performance. It turns out that with the insight into the correlation structure among users' data, the information asymmetry can be overcome based on peer prediction. We revisit the payment mechanism to guarantee dominant truthfulness of each user's strategy. Theoretical analysis and numerical results demonstrate the effectiveness of the proposed mechanism.
KW - Crowdsensing
KW - data correlation
KW - local histogram
KW - privacy
KW - social relationship
UR - http://www.scopus.com/inward/record.url?scp=85078466094&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85078466094&partnerID=8YFLogxK
U2 - 10.1109/TVT.2019.2950907
DO - 10.1109/TVT.2019.2950907
M3 - Article
AN - SCOPUS:85078466094
SN - 0018-9545
VL - 69
SP - 851
EP - 861
JO - IEEE Transactions on Vehicular Communications
JF - IEEE Transactions on Vehicular Communications
IS - 1
M1 - 8889739
ER -