Data Extraction and Integration for Scholar Recommendation System

Jaydeep Chakraborty, Gurusrikar Thopugunta, Srividya Bansal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Recommendation systems have been an integral part of massive open online courses (MOOCs). With a large amount of availability of data and resources, recommending scholars and professors through general reviews and academic advisor applications has become a tiresome job. Finding professors and scholars relevant to a student's area of interest involves a combination of multiple factors like field of study, depth of research area, research background of professors, ongoing research opportunities, etc. As recommending scholars and professors deals with so many different factors, it is very complex and unreliable when done manually. In this paper, we present a content-based mining approach to go through all relevant resources, extract required information, and use it to recommend a list of scholars based on student's area of interest. For our experimental model, we gathered information about a number of professors at our institution from various web resources such as IEEE, Springer, ACM, Sciencedirect, arxiv and department website. We use topic modeling and clustering algorithms in our content-based mining approach. We present a comparative analysis of the following topic model algorithms: latent dirichlet allocation (LDA), hierarchical dirichlet process(HDP), latent semantic analysis (LSA) and clustering techniques: k-means and hierarchical clustering in determining the most accurate recommendation list of professors or scholars.

Original languageEnglish (US)
Title of host publicationProceedings - 12th IEEE International Conference on Semantic Computing, ICSC 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages397-402
Number of pages6
Volume2018-January
ISBN (Electronic)9781538644072
DOIs
StatePublished - Apr 9 2018
Event12th IEEE International Conference on Semantic Computing, ICSC 2018 - Laguna Hills, United States
Duration: Jan 31 2018Feb 2 2018

Other

Other12th IEEE International Conference on Semantic Computing, ICSC 2018
CountryUnited States
CityLaguna Hills
Period1/31/182/2/18

Fingerprint

Recommender systems
Students
Clustering algorithms
Websites
Semantics
Availability
Data extraction
Resources
Data integration
Recommendation system
Factors

Keywords

  • Clustering
  • Hierarchical clustering
  • Hierarchical Dirichlet Process
  • k-means clustering
  • Latent Dirichlet Allocation
  • Latent Semantic Analysis
  • Recommendation System
  • Topic modeling

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Human-Computer Interaction
  • Information Systems and Management

Cite this

Chakraborty, J., Thopugunta, G., & Bansal, S. (2018). Data Extraction and Integration for Scholar Recommendation System. In Proceedings - 12th IEEE International Conference on Semantic Computing, ICSC 2018 (Vol. 2018-January, pp. 397-402). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSC.2018.00079

Data Extraction and Integration for Scholar Recommendation System. / Chakraborty, Jaydeep; Thopugunta, Gurusrikar; Bansal, Srividya.

Proceedings - 12th IEEE International Conference on Semantic Computing, ICSC 2018. Vol. 2018-January Institute of Electrical and Electronics Engineers Inc., 2018. p. 397-402.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chakraborty, J, Thopugunta, G & Bansal, S 2018, Data Extraction and Integration for Scholar Recommendation System. in Proceedings - 12th IEEE International Conference on Semantic Computing, ICSC 2018. vol. 2018-January, Institute of Electrical and Electronics Engineers Inc., pp. 397-402, 12th IEEE International Conference on Semantic Computing, ICSC 2018, Laguna Hills, United States, 1/31/18. https://doi.org/10.1109/ICSC.2018.00079
Chakraborty J, Thopugunta G, Bansal S. Data Extraction and Integration for Scholar Recommendation System. In Proceedings - 12th IEEE International Conference on Semantic Computing, ICSC 2018. Vol. 2018-January. Institute of Electrical and Electronics Engineers Inc. 2018. p. 397-402 https://doi.org/10.1109/ICSC.2018.00079
Chakraborty, Jaydeep ; Thopugunta, Gurusrikar ; Bansal, Srividya. / Data Extraction and Integration for Scholar Recommendation System. Proceedings - 12th IEEE International Conference on Semantic Computing, ICSC 2018. Vol. 2018-January Institute of Electrical and Electronics Engineers Inc., 2018. pp. 397-402
@inproceedings{43bde838ca024c6e9b2c56caeed6f9ad,
title = "Data Extraction and Integration for Scholar Recommendation System",
abstract = "Recommendation systems have been an integral part of massive open online courses (MOOCs). With a large amount of availability of data and resources, recommending scholars and professors through general reviews and academic advisor applications has become a tiresome job. Finding professors and scholars relevant to a student's area of interest involves a combination of multiple factors like field of study, depth of research area, research background of professors, ongoing research opportunities, etc. As recommending scholars and professors deals with so many different factors, it is very complex and unreliable when done manually. In this paper, we present a content-based mining approach to go through all relevant resources, extract required information, and use it to recommend a list of scholars based on student's area of interest. For our experimental model, we gathered information about a number of professors at our institution from various web resources such as IEEE, Springer, ACM, Sciencedirect, arxiv and department website. We use topic modeling and clustering algorithms in our content-based mining approach. We present a comparative analysis of the following topic model algorithms: latent dirichlet allocation (LDA), hierarchical dirichlet process(HDP), latent semantic analysis (LSA) and clustering techniques: k-means and hierarchical clustering in determining the most accurate recommendation list of professors or scholars.",
keywords = "Clustering, Hierarchical clustering, Hierarchical Dirichlet Process, k-means clustering, Latent Dirichlet Allocation, Latent Semantic Analysis, Recommendation System, Topic modeling",
author = "Jaydeep Chakraborty and Gurusrikar Thopugunta and Srividya Bansal",
year = "2018",
month = "4",
day = "9",
doi = "10.1109/ICSC.2018.00079",
language = "English (US)",
volume = "2018-January",
pages = "397--402",
booktitle = "Proceedings - 12th IEEE International Conference on Semantic Computing, ICSC 2018",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Data Extraction and Integration for Scholar Recommendation System

AU - Chakraborty, Jaydeep

AU - Thopugunta, Gurusrikar

AU - Bansal, Srividya

PY - 2018/4/9

Y1 - 2018/4/9

N2 - Recommendation systems have been an integral part of massive open online courses (MOOCs). With a large amount of availability of data and resources, recommending scholars and professors through general reviews and academic advisor applications has become a tiresome job. Finding professors and scholars relevant to a student's area of interest involves a combination of multiple factors like field of study, depth of research area, research background of professors, ongoing research opportunities, etc. As recommending scholars and professors deals with so many different factors, it is very complex and unreliable when done manually. In this paper, we present a content-based mining approach to go through all relevant resources, extract required information, and use it to recommend a list of scholars based on student's area of interest. For our experimental model, we gathered information about a number of professors at our institution from various web resources such as IEEE, Springer, ACM, Sciencedirect, arxiv and department website. We use topic modeling and clustering algorithms in our content-based mining approach. We present a comparative analysis of the following topic model algorithms: latent dirichlet allocation (LDA), hierarchical dirichlet process(HDP), latent semantic analysis (LSA) and clustering techniques: k-means and hierarchical clustering in determining the most accurate recommendation list of professors or scholars.

AB - Recommendation systems have been an integral part of massive open online courses (MOOCs). With a large amount of availability of data and resources, recommending scholars and professors through general reviews and academic advisor applications has become a tiresome job. Finding professors and scholars relevant to a student's area of interest involves a combination of multiple factors like field of study, depth of research area, research background of professors, ongoing research opportunities, etc. As recommending scholars and professors deals with so many different factors, it is very complex and unreliable when done manually. In this paper, we present a content-based mining approach to go through all relevant resources, extract required information, and use it to recommend a list of scholars based on student's area of interest. For our experimental model, we gathered information about a number of professors at our institution from various web resources such as IEEE, Springer, ACM, Sciencedirect, arxiv and department website. We use topic modeling and clustering algorithms in our content-based mining approach. We present a comparative analysis of the following topic model algorithms: latent dirichlet allocation (LDA), hierarchical dirichlet process(HDP), latent semantic analysis (LSA) and clustering techniques: k-means and hierarchical clustering in determining the most accurate recommendation list of professors or scholars.

KW - Clustering

KW - Hierarchical clustering

KW - Hierarchical Dirichlet Process

KW - k-means clustering

KW - Latent Dirichlet Allocation

KW - Latent Semantic Analysis

KW - Recommendation System

KW - Topic modeling

UR - http://www.scopus.com/inward/record.url?scp=85048399510&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85048399510&partnerID=8YFLogxK

U2 - 10.1109/ICSC.2018.00079

DO - 10.1109/ICSC.2018.00079

M3 - Conference contribution

VL - 2018-January

SP - 397

EP - 402

BT - Proceedings - 12th IEEE International Conference on Semantic Computing, ICSC 2018

PB - Institute of Electrical and Electronics Engineers Inc.

ER -