Privacy preserving text representation learning

Ghazaleh Beigi, Kai Shu, Ruocheng Guo, Suhang Wang, Huan Liu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Online users generate tremendous amounts of textual information by participating in different online activities. This data provides opportunities for researchers and business partners to understand individuals. However, this user-generated textual data not only can reveal the identity of the user but also may contain individual's private attribute information. Publishing the textual data thus compromises the privacy of users. It is challenging to design effective anonymization techniques for textual information which minimize the chances of re-identification and does not contain private information while retaining the textual semantic meaning. In this paper, we study this problem and propose a novel double privacy preserving text representation learning framework, DPText. We show the effectiveness of DPText in preserving privacy and utility.

Original languageEnglish (US)
Title of host publicationHT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media
PublisherAssociation for Computing Machinery, Inc
Pages275-276
Number of pages2
ISBN (Electronic)9781450368858
DOIs
StatePublished - Sep 12 2019
Externally publishedYes
Event30th ACM Conference on Hypertext and Social Media, HT 2019 - Hof, Germany
Duration: Sep 17 2019Sep 20 2019

Publication series

NameHT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media

Conference

Conference30th ACM Conference on Hypertext and Social Media, HT 2019
CountryGermany
CityHof
Period9/17/199/20/19

Fingerprint

Semantics
Industry

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence
  • Human-Computer Interaction
  • Computer Graphics and Computer-Aided Design

Cite this

Beigi, G., Shu, K., Guo, R., Wang, S., & Liu, H. (2019). Privacy preserving text representation learning. In HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media (pp. 275-276). (HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media). Association for Computing Machinery, Inc. https://doi.org/10.1145/3342220.3344925

Privacy preserving text representation learning. / Beigi, Ghazaleh; Shu, Kai; Guo, Ruocheng; Wang, Suhang; Liu, Huan.

HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media. Association for Computing Machinery, Inc, 2019. p. 275-276 (HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Beigi, G, Shu, K, Guo, R, Wang, S & Liu, H 2019, Privacy preserving text representation learning. in HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media. HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media, Association for Computing Machinery, Inc, pp. 275-276, 30th ACM Conference on Hypertext and Social Media, HT 2019, Hof, Germany, 9/17/19. https://doi.org/10.1145/3342220.3344925
Beigi G, Shu K, Guo R, Wang S, Liu H. Privacy preserving text representation learning. In HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media. Association for Computing Machinery, Inc. 2019. p. 275-276. (HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media). https://doi.org/10.1145/3342220.3344925
Beigi, Ghazaleh ; Shu, Kai ; Guo, Ruocheng ; Wang, Suhang ; Liu, Huan. / Privacy preserving text representation learning. HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media. Association for Computing Machinery, Inc, 2019. pp. 275-276 (HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media).
@inproceedings{c524be58f380447ebfafd5eb7ce2cf11,
title = "Privacy preserving text representation learning",
abstract = "Online users generate tremendous amounts of textual information by participating in different online activities. This data provides opportunities for researchers and business partners to understand individuals. However, this user-generated textual data not only can reveal the identity of the user but also may contain individual's private attribute information. Publishing the textual data thus compromises the privacy of users. It is challenging to design effective anonymization techniques for textual information which minimize the chances of re-identification and does not contain private information while retaining the textual semantic meaning. In this paper, we study this problem and propose a novel double privacy preserving text representation learning framework, DPText. We show the effectiveness of DPText in preserving privacy and utility.",
author = "Ghazaleh Beigi and Kai Shu and Ruocheng Guo and Suhang Wang and Huan Liu",
year = "2019",
month = "9",
day = "12",
doi = "10.1145/3342220.3344925",
language = "English (US)",
series = "HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media",
publisher = "Association for Computing Machinery, Inc",
pages = "275--276",
booktitle = "HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media",

}

TY - GEN

T1 - Privacy preserving text representation learning

AU - Beigi, Ghazaleh

AU - Shu, Kai

AU - Guo, Ruocheng

AU - Wang, Suhang

AU - Liu, Huan

PY - 2019/9/12

Y1 - 2019/9/12

N2 - Online users generate tremendous amounts of textual information by participating in different online activities. This data provides opportunities for researchers and business partners to understand individuals. However, this user-generated textual data not only can reveal the identity of the user but also may contain individual's private attribute information. Publishing the textual data thus compromises the privacy of users. It is challenging to design effective anonymization techniques for textual information which minimize the chances of re-identification and does not contain private information while retaining the textual semantic meaning. In this paper, we study this problem and propose a novel double privacy preserving text representation learning framework, DPText. We show the effectiveness of DPText in preserving privacy and utility.

AB - Online users generate tremendous amounts of textual information by participating in different online activities. This data provides opportunities for researchers and business partners to understand individuals. However, this user-generated textual data not only can reveal the identity of the user but also may contain individual's private attribute information. Publishing the textual data thus compromises the privacy of users. It is challenging to design effective anonymization techniques for textual information which minimize the chances of re-identification and does not contain private information while retaining the textual semantic meaning. In this paper, we study this problem and propose a novel double privacy preserving text representation learning framework, DPText. We show the effectiveness of DPText in preserving privacy and utility.

UR - http://www.scopus.com/inward/record.url?scp=85073374620&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85073374620&partnerID=8YFLogxK

U2 - 10.1145/3342220.3344925

DO - 10.1145/3342220.3344925

M3 - Conference contribution

AN - SCOPUS:85073374620

T3 - HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media

SP - 275

EP - 276

BT - HT 2019 - Proceedings of the 30th ACM Conference on Hypertext and Social Media

PB - Association for Computing Machinery, Inc

ER -