We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Applied Techniques for Twitter Data Retrieval in an Urban Area: Insight for Trip Production Modeling.
- Authors
Rayat, Rempu Sora; Dwicaksono, Adenantera; Putro, Heru P. H.; Dirgahayani, Puspita
- Abstract
This paper presents methods of retrieving Twitter data, both streaming and archive data, using Application Programming Interfaces. Twitter data are a kind of Location Based Social Network Data that, nowadays, is emerging in transportation demand modeling. Data regarding the locations of trip makers represent the most crucial step in the modeling. No research article has specifically addressed this topic with an up-to-date method; hence, this paper aims to refresh methods for retrieving Twitter data that can capture relevant data. The method is unique as the data are gathered for trip production modeling in zonal urban areas. Python script programs were built for both data retrieving methods. The programs were run for streaming data from May 2020 to April 2021 and archive data from 2018. The data were collected within Serang City, which is the nearest provincial city to Jakarta, the capital of Indonesia. In order to gather streaming data with no loss, the program has been run with referencing on sub-district office coordinate locations. Retrieving the intended data produces 1,090,623 documents, of which 54,103 are geotagged data from 2495 users. The study concluded that streaming data produce more geolocation data, while historical data capture more Twitter user data with relatively very little geotagged data and greater textual data than the period covered in this research. Thus, both techniques of retrieving Twitter data for urban personal trip modeling are necessary. Obtaining sufficient data collection using data streaming retrieval resulted in the most effective data preprocessing. This research contributes to Location Based Social Network data mining knowledge, both geolocation and text mining, and is useful for insight into developing trip production modeling in passenger transportation demand modeling using Machine Learning. This study also aims to provide useful methods for transportation system researchers and data scientists in utilizing Location Based Social Network data.
- Subjects
INDONESIA; JAKARTA (Indonesia); INFORMATION retrieval; MACHINE learning; LOCATION data; PYTHON programming language; INTELLIGENT transportation systems; TEXT mining; GEOTAGGING
- Publication
Applied Sciences (2076-3417), 2023, Vol 13, Issue 14, p8539
- ISSN
2076-3417
- Publication type
Article
- DOI
10.3390/app13148539