Information Extraction and Visualization of Unstructured Textual Data

Syed Usama Hashmi, Ajay Bansal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

There is a large amount of textual data on the web that has to be analyzed manually by humans in order to use it in meaningful ways. Techniques that can analyze the data, convert it to meaningful information, connect it with other sources of information, and allow querying would be extremely useful. There are different kinds of textual information available with each kid catering to a different kind of audience. This paper presents an information extraction approach that is a modified traversal algorithm on dependency parse output of text to extract all subject predicate object pairs from text while ensuring that no information is missed out. The output format is designed specifically to fit on a node-edge-node model and form the building blocks of a network that makes understanding of the text and querying of information from corpus quick and intuitive.

Original languageEnglish (US)
Title of host publicationProceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages142-145
Number of pages4
ISBN (Electronic)9781538667835
DOIs
StatePublished - Mar 11 2019
Event13th IEEE International Conference on Semantic Computing, ICSC 2019 - Newport Beach, United States
Duration: Jan 30 2019Feb 1 2019

Publication series

NameProceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019

Conference

Conference13th IEEE International Conference on Semantic Computing, ICSC 2019
CountryUnited States
CityNewport Beach
Period1/30/192/1/19

Fingerprint

Visualization

Keywords

  • Information Extraction
  • Natural Language processing
  • Visualization of Unstructured data

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software

Cite this

Hashmi, S. U., & Bansal, A. (2019). Information Extraction and Visualization of Unstructured Textual Data. In Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019 (pp. 142-145). [8665534] (Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICOSC.2019.8665534

Information Extraction and Visualization of Unstructured Textual Data. / Hashmi, Syed Usama; Bansal, Ajay.

Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 142-145 8665534 (Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hashmi, SU & Bansal, A 2019, Information Extraction and Visualization of Unstructured Textual Data. in Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019., 8665534, Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019, Institute of Electrical and Electronics Engineers Inc., pp. 142-145, 13th IEEE International Conference on Semantic Computing, ICSC 2019, Newport Beach, United States, 1/30/19. https://doi.org/10.1109/ICOSC.2019.8665534
Hashmi SU, Bansal A. Information Extraction and Visualization of Unstructured Textual Data. In Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 142-145. 8665534. (Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019). https://doi.org/10.1109/ICOSC.2019.8665534
Hashmi, Syed Usama ; Bansal, Ajay. / Information Extraction and Visualization of Unstructured Textual Data. Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 142-145 (Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019).
@inproceedings{94915f35b4ac47f9832cff53de425118,
title = "Information Extraction and Visualization of Unstructured Textual Data",
abstract = "There is a large amount of textual data on the web that has to be analyzed manually by humans in order to use it in meaningful ways. Techniques that can analyze the data, convert it to meaningful information, connect it with other sources of information, and allow querying would be extremely useful. There are different kinds of textual information available with each kid catering to a different kind of audience. This paper presents an information extraction approach that is a modified traversal algorithm on dependency parse output of text to extract all subject predicate object pairs from text while ensuring that no information is missed out. The output format is designed specifically to fit on a node-edge-node model and form the building blocks of a network that makes understanding of the text and querying of information from corpus quick and intuitive.",
keywords = "Information Extraction, Natural Language processing, Visualization of Unstructured data",
author = "Hashmi, {Syed Usama} and Ajay Bansal",
year = "2019",
month = "3",
day = "11",
doi = "10.1109/ICOSC.2019.8665534",
language = "English (US)",
series = "Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "142--145",
booktitle = "Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019",

}

TY - GEN

T1 - Information Extraction and Visualization of Unstructured Textual Data

AU - Hashmi, Syed Usama

AU - Bansal, Ajay

PY - 2019/3/11

Y1 - 2019/3/11

N2 - There is a large amount of textual data on the web that has to be analyzed manually by humans in order to use it in meaningful ways. Techniques that can analyze the data, convert it to meaningful information, connect it with other sources of information, and allow querying would be extremely useful. There are different kinds of textual information available with each kid catering to a different kind of audience. This paper presents an information extraction approach that is a modified traversal algorithm on dependency parse output of text to extract all subject predicate object pairs from text while ensuring that no information is missed out. The output format is designed specifically to fit on a node-edge-node model and form the building blocks of a network that makes understanding of the text and querying of information from corpus quick and intuitive.

AB - There is a large amount of textual data on the web that has to be analyzed manually by humans in order to use it in meaningful ways. Techniques that can analyze the data, convert it to meaningful information, connect it with other sources of information, and allow querying would be extremely useful. There are different kinds of textual information available with each kid catering to a different kind of audience. This paper presents an information extraction approach that is a modified traversal algorithm on dependency parse output of text to extract all subject predicate object pairs from text while ensuring that no information is missed out. The output format is designed specifically to fit on a node-edge-node model and form the building blocks of a network that makes understanding of the text and querying of information from corpus quick and intuitive.

KW - Information Extraction

KW - Natural Language processing

KW - Visualization of Unstructured data

UR - http://www.scopus.com/inward/record.url?scp=85064123131&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85064123131&partnerID=8YFLogxK

U2 - 10.1109/ICOSC.2019.8665534

DO - 10.1109/ICOSC.2019.8665534

M3 - Conference contribution

T3 - Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019

SP - 142

EP - 145

BT - Proceedings - 13th IEEE International Conference on Semantic Computing, ICSC 2019

PB - Institute of Electrical and Electronics Engineers Inc.

ER -