Privacy-Preserving Social Media Data Outsourcing

Jinxue Zhang, Jingchao Sun, Rui Zhang, Yanchao Zhang, Xia Hu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

User-generated social media data are exploding and of high demand in public and private sectors. The disclosure of intact social media data exacerbates the threats to user privacy. In this paper, we first identify a text-based user-linkage attack on current data outsourcing practices, in which the real users in an anonymized dataset can be pinpointed based on the users' unprotected text data. Then we propose a framework for differentially privacy-preserving social media data outsourcing for the first time in literature. Within our framework, social media data service providers can outsource perturbed datasets to provide users differential privacy while offering high data utility to social media data consumers. Our differential privacy mechanism is based on a novel notion of E - text indistinguishability, which we propose to thwart the text-based user-linkage attack. Extensive experiments on real-world and synthetic datasets confirm that our framework can enable high-level differential privacy protection and also high data utility.

Original languageEnglish (US)
Title of host publicationINFOCOM 2018 - IEEE Conference on Computer Communications
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1106-1114
Number of pages9
Volume2018-April
ISBN (Electronic)9781538641286
DOIs
StatePublished - Oct 8 2018
Event2018 IEEE Conference on Computer Communications, INFOCOM 2018 - Honolulu, United States
Duration: Apr 15 2018Apr 19 2018

Other

Other2018 IEEE Conference on Computer Communications, INFOCOM 2018
CountryUnited States
CityHonolulu
Period4/15/184/19/18

Fingerprint

Outsourcing
Experiments

ASJC Scopus subject areas

  • Computer Science(all)
  • Electrical and Electronic Engineering

Cite this

Zhang, J., Sun, J., Zhang, R., Zhang, Y., & Hu, X. (2018). Privacy-Preserving Social Media Data Outsourcing. In INFOCOM 2018 - IEEE Conference on Computer Communications (Vol. 2018-April, pp. 1106-1114). [8486242] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/INFOCOM.2018.8486242

Privacy-Preserving Social Media Data Outsourcing. / Zhang, Jinxue; Sun, Jingchao; Zhang, Rui; Zhang, Yanchao; Hu, Xia.

INFOCOM 2018 - IEEE Conference on Computer Communications. Vol. 2018-April Institute of Electrical and Electronics Engineers Inc., 2018. p. 1106-1114 8486242.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Zhang, J, Sun, J, Zhang, R, Zhang, Y & Hu, X 2018, Privacy-Preserving Social Media Data Outsourcing. in INFOCOM 2018 - IEEE Conference on Computer Communications. vol. 2018-April, 8486242, Institute of Electrical and Electronics Engineers Inc., pp. 1106-1114, 2018 IEEE Conference on Computer Communications, INFOCOM 2018, Honolulu, United States, 4/15/18. https://doi.org/10.1109/INFOCOM.2018.8486242
Zhang J, Sun J, Zhang R, Zhang Y, Hu X. Privacy-Preserving Social Media Data Outsourcing. In INFOCOM 2018 - IEEE Conference on Computer Communications. Vol. 2018-April. Institute of Electrical and Electronics Engineers Inc. 2018. p. 1106-1114. 8486242 https://doi.org/10.1109/INFOCOM.2018.8486242
Zhang, Jinxue ; Sun, Jingchao ; Zhang, Rui ; Zhang, Yanchao ; Hu, Xia. / Privacy-Preserving Social Media Data Outsourcing. INFOCOM 2018 - IEEE Conference on Computer Communications. Vol. 2018-April Institute of Electrical and Electronics Engineers Inc., 2018. pp. 1106-1114
@inproceedings{fd948dd1328648209c252fc9a1a33ab3,
title = "Privacy-Preserving Social Media Data Outsourcing",
abstract = "User-generated social media data are exploding and of high demand in public and private sectors. The disclosure of intact social media data exacerbates the threats to user privacy. In this paper, we first identify a text-based user-linkage attack on current data outsourcing practices, in which the real users in an anonymized dataset can be pinpointed based on the users' unprotected text data. Then we propose a framework for differentially privacy-preserving social media data outsourcing for the first time in literature. Within our framework, social media data service providers can outsource perturbed datasets to provide users differential privacy while offering high data utility to social media data consumers. Our differential privacy mechanism is based on a novel notion of E - text indistinguishability, which we propose to thwart the text-based user-linkage attack. Extensive experiments on real-world and synthetic datasets confirm that our framework can enable high-level differential privacy protection and also high data utility.",
author = "Jinxue Zhang and Jingchao Sun and Rui Zhang and Yanchao Zhang and Xia Hu",
year = "2018",
month = "10",
day = "8",
doi = "10.1109/INFOCOM.2018.8486242",
language = "English (US)",
volume = "2018-April",
pages = "1106--1114",
booktitle = "INFOCOM 2018 - IEEE Conference on Computer Communications",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Privacy-Preserving Social Media Data Outsourcing

AU - Zhang, Jinxue

AU - Sun, Jingchao

AU - Zhang, Rui

AU - Zhang, Yanchao

AU - Hu, Xia

PY - 2018/10/8

Y1 - 2018/10/8

N2 - User-generated social media data are exploding and of high demand in public and private sectors. The disclosure of intact social media data exacerbates the threats to user privacy. In this paper, we first identify a text-based user-linkage attack on current data outsourcing practices, in which the real users in an anonymized dataset can be pinpointed based on the users' unprotected text data. Then we propose a framework for differentially privacy-preserving social media data outsourcing for the first time in literature. Within our framework, social media data service providers can outsource perturbed datasets to provide users differential privacy while offering high data utility to social media data consumers. Our differential privacy mechanism is based on a novel notion of E - text indistinguishability, which we propose to thwart the text-based user-linkage attack. Extensive experiments on real-world and synthetic datasets confirm that our framework can enable high-level differential privacy protection and also high data utility.

AB - User-generated social media data are exploding and of high demand in public and private sectors. The disclosure of intact social media data exacerbates the threats to user privacy. In this paper, we first identify a text-based user-linkage attack on current data outsourcing practices, in which the real users in an anonymized dataset can be pinpointed based on the users' unprotected text data. Then we propose a framework for differentially privacy-preserving social media data outsourcing for the first time in literature. Within our framework, social media data service providers can outsource perturbed datasets to provide users differential privacy while offering high data utility to social media data consumers. Our differential privacy mechanism is based on a novel notion of E - text indistinguishability, which we propose to thwart the text-based user-linkage attack. Extensive experiments on real-world and synthetic datasets confirm that our framework can enable high-level differential privacy protection and also high data utility.

UR - http://www.scopus.com/inward/record.url?scp=85056156737&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85056156737&partnerID=8YFLogxK

U2 - 10.1109/INFOCOM.2018.8486242

DO - 10.1109/INFOCOM.2018.8486242

M3 - Conference contribution

AN - SCOPUS:85056156737

VL - 2018-April

SP - 1106

EP - 1114

BT - INFOCOM 2018 - IEEE Conference on Computer Communications

PB - Institute of Electrical and Electronics Engineers Inc.

ER -