XML data integration: Schema extraction and mapping

Huiping Cao, Yan Qi, Kasim Candan, Maria Luisa Sapino

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Many applications require exchange and integration of data from multiple, heterogeneous sources. eXtensible Markup Language (XML) is a standard developed to satisfy the convenient data exchange needs of these applications. However, XML by itself does not address the data integration requirements. This chapter discusses the challenges and techniques in XML Data Integration. It first presents a four step outline, illustrating the steps involved in the integration of XML data. This chapter, then, focuses on the first two of these steps: schema extraction and data/schema mapping. More specifically, schema extraction presents techniques to extract tree summaries, DTDs, or XML Schemas from XML documents. The discussion on data/schema mapping focuses on techniques for aligning XML data and schemas.

Original languageEnglish (US)
Title of host publicationAdvanced Applications and Structures in XML Processing
Subtitle of host publicationLabel Streams, Semantics Utilization and Data Query Technologies
PublisherIGI Global
Pages308-332
Number of pages25
ISBN (Print)9781615207275
DOIs
StatePublished - 2010

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'XML data integration: Schema extraction and mapping'. Together they form a unique fingerprint.

Cite this