CDIP: Collection-driven, yet individuality-preserving automated blog tagging

Wook Kim Jong, Kasim Candan, Junichi Tatemura

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

With the success of blogs as popular information sharing media, searches on blogs have become popular. In the blogosphere, tagging is used as a means of annotating blog entries with contextually meaningful keywords, which enable users more easily locate blog content. Yet, although tags provided by bloggers are effective for organizing blog entries, in many cases, they are not always sufficient in properly capturing the semantics of the blog content. In our previous work [7], we observed that there exists large degree of content overlap (not only in the form of quotation/commentary pairs, but also as content borrowing across media outlets) among blog entries, which makes it hard for effective, discriminating keyword searches. In this paper, we further note that these implicit or explicit quotations could be leveraged to identify the contexts in which entries occur; thus, resulting in more effective tagging. Thus, we propose CDIP (a collection-driven, yet individuality-preserving tagging system) which relies on relationships provided by quotation/reuse detection and semantic-focus analysis to automatically tag the blogs in such a way that, not-only the related blogs share tags, but also individuality of the entries is preserved for discriminating tag-based accesses.

Original languageEnglish (US)
Title of host publicationICSC 2007 International Conference on Semantic Computing
Pages87-94
Number of pages8
DOIs
StatePublished - 2007
EventICSC 2007 International Conference on Semantic Computing - Irvine CA, United States
Duration: Sep 17 2007Sep 19 2007

Other

OtherICSC 2007 International Conference on Semantic Computing
CountryUnited States
CityIrvine CA
Period9/17/079/19/07

Fingerprint

Blogs
Semantics

ASJC Scopus subject areas

  • Computer Science(all)
  • Computer Science Applications

Cite this

Jong, W. K., Candan, K., & Tatemura, J. (2007). CDIP: Collection-driven, yet individuality-preserving automated blog tagging. In ICSC 2007 International Conference on Semantic Computing (pp. 87-94). [4338336] https://doi.org/10.1109/ICSC.2007.98

CDIP : Collection-driven, yet individuality-preserving automated blog tagging. / Jong, Wook Kim; Candan, Kasim; Tatemura, Junichi.

ICSC 2007 International Conference on Semantic Computing. 2007. p. 87-94 4338336.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Jong, WK, Candan, K & Tatemura, J 2007, CDIP: Collection-driven, yet individuality-preserving automated blog tagging. in ICSC 2007 International Conference on Semantic Computing., 4338336, pp. 87-94, ICSC 2007 International Conference on Semantic Computing, Irvine CA, United States, 9/17/07. https://doi.org/10.1109/ICSC.2007.98
Jong WK, Candan K, Tatemura J. CDIP: Collection-driven, yet individuality-preserving automated blog tagging. In ICSC 2007 International Conference on Semantic Computing. 2007. p. 87-94. 4338336 https://doi.org/10.1109/ICSC.2007.98
Jong, Wook Kim ; Candan, Kasim ; Tatemura, Junichi. / CDIP : Collection-driven, yet individuality-preserving automated blog tagging. ICSC 2007 International Conference on Semantic Computing. 2007. pp. 87-94
@inproceedings{9692c8cee2ab47dcbd70fc9b35aaf421,
title = "CDIP: Collection-driven, yet individuality-preserving automated blog tagging",
abstract = "With the success of blogs as popular information sharing media, searches on blogs have become popular. In the blogosphere, tagging is used as a means of annotating blog entries with contextually meaningful keywords, which enable users more easily locate blog content. Yet, although tags provided by bloggers are effective for organizing blog entries, in many cases, they are not always sufficient in properly capturing the semantics of the blog content. In our previous work [7], we observed that there exists large degree of content overlap (not only in the form of quotation/commentary pairs, but also as content borrowing across media outlets) among blog entries, which makes it hard for effective, discriminating keyword searches. In this paper, we further note that these implicit or explicit quotations could be leveraged to identify the contexts in which entries occur; thus, resulting in more effective tagging. Thus, we propose CDIP (a collection-driven, yet individuality-preserving tagging system) which relies on relationships provided by quotation/reuse detection and semantic-focus analysis to automatically tag the blogs in such a way that, not-only the related blogs share tags, but also individuality of the entries is preserved for discriminating tag-based accesses.",
author = "Jong, {Wook Kim} and Kasim Candan and Junichi Tatemura",
year = "2007",
doi = "10.1109/ICSC.2007.98",
language = "English (US)",
isbn = "0769529976",
pages = "87--94",
booktitle = "ICSC 2007 International Conference on Semantic Computing",

}

TY - GEN

T1 - CDIP

T2 - Collection-driven, yet individuality-preserving automated blog tagging

AU - Jong, Wook Kim

AU - Candan, Kasim

AU - Tatemura, Junichi

PY - 2007

Y1 - 2007

N2 - With the success of blogs as popular information sharing media, searches on blogs have become popular. In the blogosphere, tagging is used as a means of annotating blog entries with contextually meaningful keywords, which enable users more easily locate blog content. Yet, although tags provided by bloggers are effective for organizing blog entries, in many cases, they are not always sufficient in properly capturing the semantics of the blog content. In our previous work [7], we observed that there exists large degree of content overlap (not only in the form of quotation/commentary pairs, but also as content borrowing across media outlets) among blog entries, which makes it hard for effective, discriminating keyword searches. In this paper, we further note that these implicit or explicit quotations could be leveraged to identify the contexts in which entries occur; thus, resulting in more effective tagging. Thus, we propose CDIP (a collection-driven, yet individuality-preserving tagging system) which relies on relationships provided by quotation/reuse detection and semantic-focus analysis to automatically tag the blogs in such a way that, not-only the related blogs share tags, but also individuality of the entries is preserved for discriminating tag-based accesses.

AB - With the success of blogs as popular information sharing media, searches on blogs have become popular. In the blogosphere, tagging is used as a means of annotating blog entries with contextually meaningful keywords, which enable users more easily locate blog content. Yet, although tags provided by bloggers are effective for organizing blog entries, in many cases, they are not always sufficient in properly capturing the semantics of the blog content. In our previous work [7], we observed that there exists large degree of content overlap (not only in the form of quotation/commentary pairs, but also as content borrowing across media outlets) among blog entries, which makes it hard for effective, discriminating keyword searches. In this paper, we further note that these implicit or explicit quotations could be leveraged to identify the contexts in which entries occur; thus, resulting in more effective tagging. Thus, we propose CDIP (a collection-driven, yet individuality-preserving tagging system) which relies on relationships provided by quotation/reuse detection and semantic-focus analysis to automatically tag the blogs in such a way that, not-only the related blogs share tags, but also individuality of the entries is preserved for discriminating tag-based accesses.

UR - http://www.scopus.com/inward/record.url?scp=47749095961&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=47749095961&partnerID=8YFLogxK

U2 - 10.1109/ICSC.2007.98

DO - 10.1109/ICSC.2007.98

M3 - Conference contribution

AN - SCOPUS:47749095961

SN - 0769529976

SN - 9780769529974

SP - 87

EP - 94

BT - ICSC 2007 International Conference on Semantic Computing

ER -