We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
GeoKnowledgeFusion: A Platform for Multimodal Data Compilation from Geoscience Literature.
- Authors
Guo, Zhixin; Wang, Chaoyang; Zhou, Jianping; Zheng, Guanjie; Wang, Xinbing; Zhou, Chenghu
- Abstract
With the advent of big data science, the field of geoscience has undergone a paradigm shift toward data-driven scientific discovery. However, the abundance of geoscience data distributed across multiple sources poses significant challenges to researchers in terms of data compilation, which includes data collection, collation, and database construction. To streamline the data compilation process, we present GeoKnowledgeFusion, a publicly accessible platform for the fusion of text, visual, and tabular knowledge extracted from the geoscience literature. GeoKnowledgeFusion leverages a powerful network of models that provide a joint multimodal understanding of text, image, and tabular data, enabling researchers to efficiently curate and continuously update their databases. To demonstrate the practical applications of GeoKnowledgeFusion, we present two scenarios: the compilation of Sm-Nd isotope data for constructing a domain-specific database and geographic analysis, and the data extraction process for debris flow disasters. The data compilation process for these use cases encompasses various tasks, including PDF pre-processing, target element recognition, human-in-the-loop annotation, and joint multimodal knowledge understanding. The findings consistently reveal patterns that align with manually compiled data, thus affirming the credibility and dependability of our automated data processing tool. To date, GeoKnowledgeFusion has supported forty geoscience research teams within the program by processing over 40,000 documents uploaded by geoscientists.
- Subjects
DATABASES; EARTH sciences; GEODATABASES; DATA extraction; BIG data
- Publication
Remote Sensing, 2024, Vol 16, Issue 9, p1484
- ISSN
2072-4292
- Publication type
Article
- DOI
10.3390/rs16091484