We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Optimizing Geospatial Data for ML/CV Applications: A Python-Based Approach to Streamlining Map Processing by Removing Irrelevant Areas.
- Authors
Kasperek, David; Podpora, Michal
- Abstract
Massive image datasets are often required for the proper functioning of Machine Learning (ML) and Computer Vision (CV) applications. This paper offers a solution to computational challenges in the Image Processing of satellite imagery, by proposing an optimization procedure. The presented approach is verified by an exemplary Python implementation, constituting a standalone tool for automating the dataset creation and labeling, including the extraction of road network data from the national satellite cartography provider. The collected data include detailed road maps along with the parcel information obtained via WebMapService endpoints. The method presented in this paper involves three basic steps: road segmentation (using the Shapely module) to facilitate handling high-resolution orthoimagery, and then a modified Region-of-Interest approach, i.e., removing irrelevant areas, with only roads remaining. This results in obtaining file sizes that are significantly smaller. The presented algorithm also involves asynchronous tile downloading, which, combined with the masking of irrelevant areas, improves not only the efficiency but surprisingly also the accuracy of subsequent ML/CV procedures. The research results of the paper reveal substantial file size reduction, and improved processing efficiency, thus making the optimized geospatial graphical data more practical for ML/CV applications, while still maintaining the original data quality and relevance of the analyzed parcels or infrastructure.
- Subjects
GEOSPATIAL data; IMAGE processing; REMOTE-sensing images; MACHINE learning; CARTOGRAPHY
- Publication
Applied Sciences (2076-3417), 2024, Vol 14, Issue 24, p11978
- ISSN
2076-3417
- Publication type
Academic Journal
- DOI
10.3390/app142411978