We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Automatic Annotation Performance of TextBlob and VADER on Covid Vaccination Dataset.
- Authors
Alenzi, Badriya Murdhi; Khan, Muhammad Badruddin; Hasanat, Mozaherul Hoque Abul; Saudagar, Abdul Khader Jilani; AlKhathami, Mohammed; AlTameem, Abdullah
- Abstract
With the recent boom in the corpus size of sentiment analysis tasks, automatic annotation is poised to be a necessary alternative to manual annotation for generating ground truth dataset labels. This article aims to investigate and validate the performance of two widely used lexicon-based automatic annotation approaches, TextBlob and Valence Aware Dictionary and Sentiment Reasoner (VADER), by comparing them with manual annotation. The dataset of 5402 Arabic tweets was annotated manually, containing 3124 positive tweets, 1463 negative tweets, and 815 neutral tweets. The tweets were translated into English so that TextBlob and VADER could be used for their annotation. TextBlob and VADER automatically classified the tweets to positive, negative, and neutral sentiments and compared them with manual annotation. This study shows that automatic annotation cannot be trusted as the gold standard for annotation. In addition, the study discussed many drawbacks and limitations of automatic annotation using lexicon-based algorithms. The highest level of accuracies of 75% and 70% were achieved by TextBlob and VADER, respectively.
- Subjects
COVID-19 vaccines; ANNOTATIONS; SENTIMENT analysis; TASK analysis
- Publication
Intelligent Automation & Soft Computing, 2022, Vol 34, Issue 2, p1311
- ISSN
1079-8587
- Publication type
Article
- DOI
10.32604/iasc.2022.025861