We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
A PROBABILISTIC APPROACH TO MULTI-DOCUMENT SUMMARIZATION FOR GENERATING A TILED SUMMARY.
- Authors
SARAVANAN, M.; RAMAN, S.; RAVINDRAN, B.
- Abstract
Data availability is not a major issue at present times in view of the widespread use of Internet; however, information and knowledge availability are the issues. Due to data overload and time-critical nature of information need, automatic summarization of documents plays a significant role in information retrieval and text data mining. This paper discusses the design of a multi-document summarizer that uses Katz's K-mixture model for term distribution. The model helps in ranking the sentences by a modified term weight assignment. Highly ranked sentences are selected for the final summary. The sentences that are repetitive in nature are eliminated, and a tiled summary is produced. Our method avoids redundancy and produces a readable (even browsable) summary, which we refer to as an event-specific tiled summary. The system has been evaluated against the frequently occurring sentences in the summaries generated by a set of human subjects. Our system outperforms other auto-summarizers at different extraction levels of summarization with respect to the ideal summary, and is close to the ideal summary at 40% extraction level.
- Subjects
INFORMATION services; KNOWLEDGE management; INFORMATION retrieval; INTERNET; COMPUTATIONAL intelligence
- Publication
International Journal of Computational Intelligence & Applications, 2006, Vol 6, Issue 2, p231
- ISSN
1469-0268
- Publication type
Article
- DOI
10.1142/S1469026806001976