We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
A hybrid approach for text summarization using semantic latent Dirichlet allocation and sentence concept mapping with transformer.
- Authors
Gurusamy, Bharathi Mohan; Rengarajan, Prasanna Kumar; Srinivasan, Parthasarathy
- Abstract
Automatic text summarization generates a summary that contains sentences reflecting the essential and relevant information of the original documents. Extractive summarization requires semantic understanding, while abstractive summarization requires a better intermediate text representation. This paper proposes a hybrid approach for generating text summaries that combine extractive and abstractive methods. To improve the semantic understanding of the model, we propose two novel extractive methods: semantic latent Dirichlet allocation (semantic LDA) and sentence concept mapping. We then generate an intermediate summary by applying our proposed sentence ranking algorithm over the sentence concept mapping. This intermediate summary is input to a transformer-based abstractive model fine-tuned with a multi-head attention mechanism. Our experimental results demonstrate that the proposed hybrid model generates coherent summaries using the intermediate extractive summary covering semantics. As we increase the concepts and number of words in the summary the rouge scores are improved for precision and F1 scores in our proposed model.
- Subjects
TEXT summarization; CONCEPT mapping; LATENT semantic analysis; NUMBER concept
- Publication
International Journal of Electrical & Computer Engineering (2088-8708), 2023, Vol 13, Issue 6, p6663
- ISSN
2088-8708
- Publication type
Article
- DOI
10.11591/ijece.v13i6.pp6663-6672