We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
News Modeling and Retrieving Information: Data-Driven Approach.
- Authors
Hossain, Elias; Alshahrani, Abdullah; Rahman, Wahidur
- Abstract
This paper aims to develop Machine Learning algorithms to classify electronic articles related to this phenomenon by retrieving information and topic modelling. The Methodology of this study is categorized into three phases: the Text Classification Approach (TCA), the Proposed Algorithms Interpretation (PAI), and finally, Information Retrieval Approach (IRA). The TCA reflects the text preprocessing pipeline called a clean corpus. The Global Vectors for Word Representation (Glove) pre-trained model, FastText, Term Frequency-Inverse Document Frequency (TF-IDF), and Bag-of-Words (BOW) for extracting the features have been interpreted in this research. The PAI manifests the Bidirectional Long Short-Term Memory (Bi-LSTM) and Convolutional Neural Network (CNN) to classify the COVID-19 news. Again, the IRA explains the mathematical interpretation of Latent Dirichlet Allocation (LDA), obtained for modelling the topic of Information Retrieval (IR). In this study, 99% accuracy was obtained by performing K-fold cross-validation on Bi-LSTM with Glove. A comparative analysis between Deep Learning and Machine Learning based on feature extraction and computational complexity exploration has been performed in this research. Furthermore, some text analyses and the most influential aspects of each document have been explored in this study. We have utilized Bidirectional Encoder Representations from Transformers (BERT) as a Deep Learning mechanism in our model training, but the result has not been uncovered satisfactory. However, the proposed system can be adjustable in the real-time news classification of COVID-19.
- Subjects
LANGUAGE models; MACHINE learning; CONVOLUTIONAL neural networks; DEEP learning; SURGICAL gloves; FEATURE extraction
- Publication
Intelligent Automation & Soft Computing, 2023, Vol 38, Issue 2, p109
- ISSN
1079-8587
- Publication type
Article
- DOI
10.32604/iasc.2022.029511