We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Composition pattern oriented tag extraction from short documents using a structural learning method.
- Authors
Shin, Yongwook; Lee, Sung-Jun; Park, Jonghun
- Abstract
With the rapid growth of web, automatic tagging that detects informative terms from a document becomes an important problem for information aggregation and sharing services. In particular, automatic tagging for short documents becomes more interesting as many users are increasingly publishing information through social media services which encourage users to create the documents of short length. In this paper, we propose a novel automatic tagging model for short text documents from social media services, following the framework of supervised learning. We redefine traditional frequency-based term features so that they can address the properties of the documents created under length limitation and consider sequential dependencies between successive terms in a document based on a structural support vector machine. In addition, our proposed approach incorporates composition patterns by which users put informative terms into their documents. Extensive experiments have been conducted to validate the presented approach, and it was found that the proposed term features were effective for extracting tags, and the tag extractor trained by considering the sequential dependencies and composition patterns achieved superior performance results over the existing alternative methods.
- Subjects
DATA mining; STRUCTURAL learning theory; INFORMATION sharing; SUPPORT vector machines; INFORMATION retrieval; SOCIAL media
- Publication
Knowledge & Information Systems, 2014, Vol 38, Issue 2, p447
- ISSN
0219-1377
- Publication type
Article
- DOI
10.1007/s10115-012-0594-6