TY - GEN
T1 - Extraction techniques for mining services from web sources
AU - Davulcu, Hasan
AU - Mukherjee, Saikat
AU - Ramakrishnan, I. V.
PY - 2002
Y1 - 2002
N2 - The Web has established itself as the dominant medium for doing electronic commerce. Consequently the number of service providers, both large and small, advertising their services on the web continues to proliferate. In this paper we describe new extraction algorithms for mining service directories from web pages. We develop a novel propagation technique for identifying and accumulating all of the attributes related to a service entity in a web page. We provide experimental results of the effectiveness of our extraction techniques by mining a database of veterinarian service providers from web sources.
AB - The Web has established itself as the dominant medium for doing electronic commerce. Consequently the number of service providers, both large and small, advertising their services on the web continues to proliferate. In this paper we describe new extraction algorithms for mining service directories from web pages. We develop a novel propagation technique for identifying and accumulating all of the attributes related to a service entity in a web page. We provide experimental results of the effectiveness of our extraction techniques by mining a database of veterinarian service providers from web sources.
UR - http://www.scopus.com/inward/record.url?scp=78149340372&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78149340372&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:78149340372
SN - 0769517544
SN - 9780769517544
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 601
EP - 604
BT - Proceedings - 2002 IEEE International Conference on Data Mining, ICDM 2002
T2 - 2nd IEEE International Conference on Data Mining, ICDM '02
Y2 - 9 December 2002 through 12 December 2002
ER -