We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Social Media Rumor Refuter Feature Analysis and Crowd Identification Based on XGBoost and NLP.
- Authors
Li, Zongmin; Zhang, Qi; Wang, Yuhong; Wang, Shihang
- Abstract
Featured Application: Results of this work can be applied to anti-rumor microblog recommendation decisions for social media platforms, in order to reduce the impact of rumors by promoting the spread of the truth. One prominent dark side of online information behavior is the spreading of rumors. The feature analysis and crowd identification of social media rumor refuters based on machine learning methods can shed light on the rumor refutation process. This paper analyzed the association between user features and rumor refuting behavior in five main rumor categories: economics, society, disaster, politics, and military. Natural language processing (NLP) techniques are applied to quantify the user's sentiment tendency and recent interests. Then, those results were combined with other personalized features to train an XGBoost classification model, and potential refuters can be identified. Information from 58,807 Sina Weibo users (including their 646,877 microblogs) for the five anti-rumor microblog categories was collected for model training and feature analysis. The results revealed that there were significant differences between rumor stiflers and refuters, as well as between refuters for different categories. Refuters tended to be more active on social media and a large proportion of them gathered in more developed regions. Tweeting history was a vital reference as well, and refuters showed higher interest in topics related with the rumor refuting message. Meanwhile, features such as gender, age, user labels and sentiment tendency also varied between refuters considering categories.
- Subjects
WEIBO (Web resource); SOCIAL media; RUMOR; INFORMATION-seeking behavior; ACTIVE medium; NATURAL language processing
- Publication
Applied Sciences (2076-3417), 2020, Vol 10, Issue 14, p4711
- ISSN
2076-3417
- Publication type
Article
- DOI
10.3390/app10144711