TY - GEN
T1 - CDIP
T2 - ICSC 2007 International Conference on Semantic Computing
AU - Jong, Wook Kim
AU - Candan, Kasim
AU - Tatemura, Junichi
PY - 2007
Y1 - 2007
N2 - With the success of blogs as popular information sharing media, searches on blogs have become popular. In the blogosphere, tagging is used as a means of annotating blog entries with contextually meaningful keywords, which enable users more easily locate blog content. Yet, although tags provided by bloggers are effective for organizing blog entries, in many cases, they are not always sufficient in properly capturing the semantics of the blog content. In our previous work [7], we observed that there exists large degree of content overlap (not only in the form of quotation/commentary pairs, but also as content borrowing across media outlets) among blog entries, which makes it hard for effective, discriminating keyword searches. In this paper, we further note that these implicit or explicit quotations could be leveraged to identify the contexts in which entries occur; thus, resulting in more effective tagging. Thus, we propose CDIP (a collection-driven, yet individuality-preserving tagging system) which relies on relationships provided by quotation/reuse detection and semantic-focus analysis to automatically tag the blogs in such a way that, not-only the related blogs share tags, but also individuality of the entries is preserved for discriminating tag-based accesses.
AB - With the success of blogs as popular information sharing media, searches on blogs have become popular. In the blogosphere, tagging is used as a means of annotating blog entries with contextually meaningful keywords, which enable users more easily locate blog content. Yet, although tags provided by bloggers are effective for organizing blog entries, in many cases, they are not always sufficient in properly capturing the semantics of the blog content. In our previous work [7], we observed that there exists large degree of content overlap (not only in the form of quotation/commentary pairs, but also as content borrowing across media outlets) among blog entries, which makes it hard for effective, discriminating keyword searches. In this paper, we further note that these implicit or explicit quotations could be leveraged to identify the contexts in which entries occur; thus, resulting in more effective tagging. Thus, we propose CDIP (a collection-driven, yet individuality-preserving tagging system) which relies on relationships provided by quotation/reuse detection and semantic-focus analysis to automatically tag the blogs in such a way that, not-only the related blogs share tags, but also individuality of the entries is preserved for discriminating tag-based accesses.
UR - http://www.scopus.com/inward/record.url?scp=47749095961&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=47749095961&partnerID=8YFLogxK
U2 - 10.1109/ICSC.2007.98
DO - 10.1109/ICSC.2007.98
M3 - Conference contribution
AN - SCOPUS:47749095961
SN - 0769529976
SN - 9780769529974
T3 - ICSC 2007 International Conference on Semantic Computing
SP - 87
EP - 94
BT - ICSC 2007 International Conference on Semantic Computing
Y2 - 17 September 2007 through 19 September 2007
ER -