Semantic ETL-State-of-the-Art and Open Research Challenges

Jaydeep Chakraborty, Aparna Padki, Srividya Bansal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

There has been an exponential growth and availability of data, both structured and unstructured. Massive amounts of data are available to be harvested for competitive business advantage, sound government policies, and new insights in a broad array of applications (including healthcare, biomedicine, energy, smart cities, genomics, transportation, etc.). Yet, most of this data is inaccessible for users, as we need technology and tools to find, transform, analyze, and visualize data in order to make it consumable for decision-making. Meaningful data integration in a schema-less, and complex Big Data world of databases is a big open challenge. This survey paper presents a holistic view of literature in data integration and Extract-Transform-Load (ETL) techniques. Limitations and gaps in existing approaches are identified and open research challenges are discussed.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 11th International Conference on Semantic Computing, ICSC 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages413-418
Number of pages6
ISBN (Electronic)9781509048960
DOIs
StatePublished - Mar 29 2017
Event11th IEEE International Conference on Semantic Computing, ICSC 2017 - San Diego, United States
Duration: Jan 30 2017Feb 1 2017

Other

Other11th IEEE International Conference on Semantic Computing, ICSC 2017
CountryUnited States
CitySan Diego
Period1/30/172/1/17

Fingerprint

Data integration
Semantics
Mathematical transformations
Decision making
Availability
Acoustic waves
Industry
Genomics
Big data
Smart city

Keywords

  • Data Integration
  • Linked Data
  • Semantic Computing
  • Semantic ETL

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Computer Networks and Communications

Cite this

Chakraborty, J., Padki, A., & Bansal, S. (2017). Semantic ETL-State-of-the-Art and Open Research Challenges. In Proceedings - IEEE 11th International Conference on Semantic Computing, ICSC 2017 (pp. 413-418). [7889572] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSC.2017.94

Semantic ETL-State-of-the-Art and Open Research Challenges. / Chakraborty, Jaydeep; Padki, Aparna; Bansal, Srividya.

Proceedings - IEEE 11th International Conference on Semantic Computing, ICSC 2017. Institute of Electrical and Electronics Engineers Inc., 2017. p. 413-418 7889572.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chakraborty, J, Padki, A & Bansal, S 2017, Semantic ETL-State-of-the-Art and Open Research Challenges. in Proceedings - IEEE 11th International Conference on Semantic Computing, ICSC 2017., 7889572, Institute of Electrical and Electronics Engineers Inc., pp. 413-418, 11th IEEE International Conference on Semantic Computing, ICSC 2017, San Diego, United States, 1/30/17. https://doi.org/10.1109/ICSC.2017.94
Chakraborty J, Padki A, Bansal S. Semantic ETL-State-of-the-Art and Open Research Challenges. In Proceedings - IEEE 11th International Conference on Semantic Computing, ICSC 2017. Institute of Electrical and Electronics Engineers Inc. 2017. p. 413-418. 7889572 https://doi.org/10.1109/ICSC.2017.94
Chakraborty, Jaydeep ; Padki, Aparna ; Bansal, Srividya. / Semantic ETL-State-of-the-Art and Open Research Challenges. Proceedings - IEEE 11th International Conference on Semantic Computing, ICSC 2017. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 413-418
@inproceedings{8b4d9ba6884e4b1198ac992dc7dc09f9,
title = "Semantic ETL-State-of-the-Art and Open Research Challenges",
abstract = "There has been an exponential growth and availability of data, both structured and unstructured. Massive amounts of data are available to be harvested for competitive business advantage, sound government policies, and new insights in a broad array of applications (including healthcare, biomedicine, energy, smart cities, genomics, transportation, etc.). Yet, most of this data is inaccessible for users, as we need technology and tools to find, transform, analyze, and visualize data in order to make it consumable for decision-making. Meaningful data integration in a schema-less, and complex Big Data world of databases is a big open challenge. This survey paper presents a holistic view of literature in data integration and Extract-Transform-Load (ETL) techniques. Limitations and gaps in existing approaches are identified and open research challenges are discussed.",
keywords = "Data Integration, Linked Data, Semantic Computing, Semantic ETL",
author = "Jaydeep Chakraborty and Aparna Padki and Srividya Bansal",
year = "2017",
month = "3",
day = "29",
doi = "10.1109/ICSC.2017.94",
language = "English (US)",
pages = "413--418",
booktitle = "Proceedings - IEEE 11th International Conference on Semantic Computing, ICSC 2017",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

TY - GEN

T1 - Semantic ETL-State-of-the-Art and Open Research Challenges

AU - Chakraborty, Jaydeep

AU - Padki, Aparna

AU - Bansal, Srividya

PY - 2017/3/29

Y1 - 2017/3/29

N2 - There has been an exponential growth and availability of data, both structured and unstructured. Massive amounts of data are available to be harvested for competitive business advantage, sound government policies, and new insights in a broad array of applications (including healthcare, biomedicine, energy, smart cities, genomics, transportation, etc.). Yet, most of this data is inaccessible for users, as we need technology and tools to find, transform, analyze, and visualize data in order to make it consumable for decision-making. Meaningful data integration in a schema-less, and complex Big Data world of databases is a big open challenge. This survey paper presents a holistic view of literature in data integration and Extract-Transform-Load (ETL) techniques. Limitations and gaps in existing approaches are identified and open research challenges are discussed.

AB - There has been an exponential growth and availability of data, both structured and unstructured. Massive amounts of data are available to be harvested for competitive business advantage, sound government policies, and new insights in a broad array of applications (including healthcare, biomedicine, energy, smart cities, genomics, transportation, etc.). Yet, most of this data is inaccessible for users, as we need technology and tools to find, transform, analyze, and visualize data in order to make it consumable for decision-making. Meaningful data integration in a schema-less, and complex Big Data world of databases is a big open challenge. This survey paper presents a holistic view of literature in data integration and Extract-Transform-Load (ETL) techniques. Limitations and gaps in existing approaches are identified and open research challenges are discussed.

KW - Data Integration

KW - Linked Data

KW - Semantic Computing

KW - Semantic ETL

UR - http://www.scopus.com/inward/record.url?scp=85018331929&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85018331929&partnerID=8YFLogxK

U2 - 10.1109/ICSC.2017.94

DO - 10.1109/ICSC.2017.94

M3 - Conference contribution

SP - 413

EP - 418

BT - Proceedings - IEEE 11th International Conference on Semantic Computing, ICSC 2017

PB - Institute of Electrical and Electronics Engineers Inc.

ER -