Modeling semantics between programming codes and annotations

Yihan Lu, Ihan Hsiao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

It is a common practice for programmers to leave annotations during program development. Most of the annotated documentations are predominantly being used as the archive of the coding events for limited developers. We hypothesize that these annotations captured mass amount of valuable information which can be utilized to identify similar codes or to examine code quality. However, due to the annotating behaviors vary and the language composition can be complex, this work sets out to investigate a systematic method to examine the annotation semantics and their relations with codes. We designed a semantic parser to extract concepts from codes and the corresponding annotations. Additionally, text mining techniques are applied to summarize linguistic features from the annotations. We then build models to predict concepts in programming code annotations. Results show that the proposed semantic modeling method achieved a higher performance compared to a random guessed baseline.

Original languageEnglish (US)
Title of host publicationHT 2018 - Proceedings of the 29th ACM Conference on Hypertext and Social Media
PublisherAssociation for Computing Machinery, Inc
Pages101-105
Number of pages5
ISBN (Electronic)9781450354271
DOIs
StatePublished - Jul 3 2018
Event29th ACM International Conference on Hypertext and Social Media, HT 2018 - Baltimore, United States
Duration: Jul 9 2018Jul 12 2018

Publication series

NameHT 2018 - Proceedings of the 29th ACM Conference on Hypertext and Social Media

Conference

Conference29th ACM International Conference on Hypertext and Social Media, HT 2018
Country/TerritoryUnited States
CityBaltimore
Period7/9/187/12/18

Keywords

  • Coding concept detection
  • Programming semantics
  • Semantic modeling
  • Text based classification

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence
  • Human-Computer Interaction
  • Computer Graphics and Computer-Aided Design

Fingerprint

Dive into the research topics of 'Modeling semantics between programming codes and annotations'. Together they form a unique fingerprint.

Cite this