EBSCO Logo
Connecting you to content on EBSCOhost
Title

Optimizing Geospatial Data for ML/CV Applications: A Python-Based Approach to Streamlining Map Processing by Removing Irrelevant Areas.

Authors

Kasperek, David; Podpora, Michal

Abstract

Massive image datasets are often required for the proper functioning of Machine Learning (ML) and Computer Vision (CV) applications. This paper offers a solution to computational challenges in the Image Processing of satellite imagery, by proposing an optimization procedure. The presented approach is verified by an exemplary Python implementation, constituting a standalone tool for automating the dataset creation and labeling, including the extraction of road network data from the national satellite cartography provider. The collected data include detailed road maps along with the parcel information obtained via WebMapService endpoints. The method presented in this paper involves three basic steps: road segmentation (using the Shapely module) to facilitate handling high-resolution orthoimagery, and then a modified Region-of-Interest approach, i.e., removing irrelevant areas, with only roads remaining. This results in obtaining file sizes that are significantly smaller. The presented algorithm also involves asynchronous tile downloading, which, combined with the masking of irrelevant areas, improves not only the efficiency but surprisingly also the accuracy of subsequent ML/CV procedures. The research results of the paper reveal substantial file size reduction, and improved processing efficiency, thus making the optimized geospatial graphical data more practical for ML/CV applications, while still maintaining the original data quality and relevance of the analyzed parcels or infrastructure.

Subjects

GEOSPATIAL data; IMAGE processing; REMOTE-sensing images; MACHINE learning; CARTOGRAPHY

Publication

Applied Sciences (2076-3417), 2024, Vol 14, Issue 24, p11978

ISSN

2076-3417

Publication type

Academic Journal

DOI

10.3390/app142411978

EBSCO Connect | Privacy policy | Terms of use | Copyright | Manage my cookies
Journals | Subjects | Sitemap
© 2025 EBSCO Industries, Inc. All rights reserved