Text Transformations in Contrastive Self-Supervised Learning: A Review

Amrita Bhattacharjee; Mansooreh Karami; Huan Liu

Text Transformations in Contrastive Self-Supervised Learning: A Review

Amrita Bhattacharjee, Mansooreh Karami, Huan Liu

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

3 Scopus citations

Abstract

Contrastive self-supervised learning has become a prominent technique in representation learning. The main step in these methods is to contrast semantically similar and dissimilar pairs of samples. However, in the domain of Natural Language Processing (NLP), the augmentation methods used in creating similar pairs with regard to contrastive learning (CL) assumptions are challenging. This is because, even simply modifying a word in the input might change the semantic meaning of the sentence, and hence, would violate the distributional hypothesis. In this review paper, we formalize the contrastive learning framework, emphasize the considerations that need to be addressed in the data transformation step, and review the state-of-the-art methods and evaluations for contrastive representation learning in NLP. Finally, we describe some challenges and potential directions for learning better text representations using contrastive methods.

Original language	English (US)
Title of host publication	Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022
Editors	Luc De Raedt, Luc De Raedt
Publisher	International Joint Conferences on Artificial Intelligence
Pages	5394-5401
Number of pages	8
ISBN (Electronic)	9781956792003
State	Published - 2022
Event	31st International Joint Conference on Artificial Intelligence, IJCAI 2022 - Vienna, Austria Duration: Jul 23 2022 → Jul 29 2022

Publication series

Name	IJCAI International Joint Conference on Artificial Intelligence
ISSN (Print)	1045-0823

Conference

Conference	31st International Joint Conference on Artificial Intelligence, IJCAI 2022
Country/Territory	Austria
City	Vienna
Period	7/23/22 → 7/29/22

ASJC Scopus subject areas

Artificial Intelligence

Cite this

Bhattacharjee, A., Karami, M., & Liu, H. (2022). Text Transformations in Contrastive Self-Supervised Learning: A Review. In L. De Raedt, & L. De Raedt (Eds.), Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022 (pp. 5394-5401). (IJCAI International Joint Conference on Artificial Intelligence). International Joint Conferences on Artificial Intelligence.

Text Transformations in Contrastive Self-Supervised Learning: A Review. / Bhattacharjee, Amrita; Karami, Mansooreh; Liu, Huan.
Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022. ed. / Luc De Raedt; Luc De Raedt. International Joint Conferences on Artificial Intelligence, 2022. p. 5394-5401 (IJCAI International Joint Conference on Artificial Intelligence).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Bhattacharjee, A, Karami, M & Liu, H 2022, Text Transformations in Contrastive Self-Supervised Learning: A Review. in L De Raedt & L De Raedt (eds), Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022. IJCAI International Joint Conference on Artificial Intelligence, International Joint Conferences on Artificial Intelligence, pp. 5394-5401, 31st International Joint Conference on Artificial Intelligence, IJCAI 2022, Vienna, Austria, 7/23/22.

Bhattacharjee A, Karami M, Liu H. Text Transformations in Contrastive Self-Supervised Learning: A Review. In De Raedt L, De Raedt L, editors, Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022. International Joint Conferences on Artificial Intelligence. 2022. p. 5394-5401. (IJCAI International Joint Conference on Artificial Intelligence).

Bhattacharjee, Amrita ; Karami, Mansooreh ; Liu, Huan. / Text Transformations in Contrastive Self-Supervised Learning : A Review. Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022. editor / Luc De Raedt ; Luc De Raedt. International Joint Conferences on Artificial Intelligence, 2022. pp. 5394-5401 (IJCAI International Joint Conference on Artificial Intelligence).

@inproceedings{c7c96f94706e495aaa20c148b13baae1,

title = "Text Transformations in Contrastive Self-Supervised Learning: A Review",

abstract = "Contrastive self-supervised learning has become a prominent technique in representation learning. The main step in these methods is to contrast semantically similar and dissimilar pairs of samples. However, in the domain of Natural Language Processing (NLP), the augmentation methods used in creating similar pairs with regard to contrastive learning (CL) assumptions are challenging. This is because, even simply modifying a word in the input might change the semantic meaning of the sentence, and hence, would violate the distributional hypothesis. In this review paper, we formalize the contrastive learning framework, emphasize the considerations that need to be addressed in the data transformation step, and review the state-of-the-art methods and evaluations for contrastive representation learning in NLP. Finally, we describe some challenges and potential directions for learning better text representations using contrastive methods.",

author = "Amrita Bhattacharjee and Mansooreh Karami and Huan Liu",

note = "Funding Information: This research is supported by the DARPA (HR001120C0123) and ONR (N00014-21-1-4002). The views, opinions and/or findings expressed are those of the authors and should not be interpreted as representing the official views or policies of the Department of Defense or the U.S. Government. Publisher Copyright: {\textcopyright} 2022 International Joint Conferences on Artificial Intelligence. All rights reserved.; 31st International Joint Conference on Artificial Intelligence, IJCAI 2022 ; Conference date: 23-07-2022 Through 29-07-2022",

year = "2022",

language = "English (US)",

series = "IJCAI International Joint Conference on Artificial Intelligence",

publisher = "International Joint Conferences on Artificial Intelligence",

pages = "5394--5401",

editor = "{De Raedt}, Luc and {De Raedt}, Luc",

booktitle = "Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022",

}

TY - GEN

T1 - Text Transformations in Contrastive Self-Supervised Learning

T2 - 31st International Joint Conference on Artificial Intelligence, IJCAI 2022

AU - Bhattacharjee, Amrita

AU - Karami, Mansooreh

AU - Liu, Huan

N1 - Funding Information: This research is supported by the DARPA (HR001120C0123) and ONR (N00014-21-1-4002). The views, opinions and/or findings expressed are those of the authors and should not be interpreted as representing the official views or policies of the Department of Defense or the U.S. Government. Publisher Copyright: © 2022 International Joint Conferences on Artificial Intelligence. All rights reserved.

PY - 2022

Y1 - 2022

N2 - Contrastive self-supervised learning has become a prominent technique in representation learning. The main step in these methods is to contrast semantically similar and dissimilar pairs of samples. However, in the domain of Natural Language Processing (NLP), the augmentation methods used in creating similar pairs with regard to contrastive learning (CL) assumptions are challenging. This is because, even simply modifying a word in the input might change the semantic meaning of the sentence, and hence, would violate the distributional hypothesis. In this review paper, we formalize the contrastive learning framework, emphasize the considerations that need to be addressed in the data transformation step, and review the state-of-the-art methods and evaluations for contrastive representation learning in NLP. Finally, we describe some challenges and potential directions for learning better text representations using contrastive methods.

AB - Contrastive self-supervised learning has become a prominent technique in representation learning. The main step in these methods is to contrast semantically similar and dissimilar pairs of samples. However, in the domain of Natural Language Processing (NLP), the augmentation methods used in creating similar pairs with regard to contrastive learning (CL) assumptions are challenging. This is because, even simply modifying a word in the input might change the semantic meaning of the sentence, and hence, would violate the distributional hypothesis. In this review paper, we formalize the contrastive learning framework, emphasize the considerations that need to be addressed in the data transformation step, and review the state-of-the-art methods and evaluations for contrastive representation learning in NLP. Finally, we describe some challenges and potential directions for learning better text representations using contrastive methods.

UR - http://www.scopus.com/inward/record.url?scp=85137893839&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85137893839&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85137893839

T3 - IJCAI International Joint Conference on Artificial Intelligence

SP - 5394

EP - 5401

BT - Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022

A2 - De Raedt, Luc

PB - International Joint Conferences on Artificial Intelligence

Y2 - 23 July 2022 through 29 July 2022

ER -

Text Transformations in Contrastive Self-Supervised Learning: A Review

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this