Extraction techniques for mining services from web sources

Hasan Davulcu, Saikat Mukherjee, I. V. Ramakrishnan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Abstract

The Web has established itself as the dominant medium for doing electronic commerce. Consequently the number of service providers, both large and small, advertising their services on the web continues to proliferate. In this paper we describe new extraction algorithms for mining service directories from web pages. We develop a novel propagation technique for identifying and accumulating all of the attributes related to a service entity in a web page. We provide experimental results of the effectiveness of our extraction techniques by mining a database of veterinarian service providers from web sources.

Original languageEnglish (US)
Title of host publicationProceedings - 2002 IEEE International Conference on Data Mining, ICDM 2002
Pages601-604
Number of pages4
StatePublished - Dec 1 2002
Event2nd IEEE International Conference on Data Mining, ICDM '02 - Maebashi, Japan
Duration: Dec 9 2002Dec 12 2002

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Other

Other2nd IEEE International Conference on Data Mining, ICDM '02
CountryJapan
CityMaebashi
Period12/9/0212/12/02

    Fingerprint

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Davulcu, H., Mukherjee, S., & Ramakrishnan, I. V. (2002). Extraction techniques for mining services from web sources. In Proceedings - 2002 IEEE International Conference on Data Mining, ICDM 2002 (pp. 601-604). (Proceedings - IEEE International Conference on Data Mining, ICDM).