Wang, Aili; Dai, Shiyu; Wu, Haibin; Iwahori, Yuji

doi:10.3390/rs16163082

Back to matches

Your institution may have access to this item. Find your institution then sign in to continue.

Title: Multimodal Semantic Collaborative Classification for Hyperspectral Images and LiDAR Data.
Authors: Wang, Aili; Dai, Shiyu; Wu, Haibin; Iwahori, Yuji
Abstract: Although the collaborative use of hyperspectral images (HSIs) and LiDAR data in land cover classification tasks has demonstrated significant importance and potential, several challenges remain. Notably, the heterogeneity in cross-modal information integration presents a major obstacle. Furthermore, most existing research relies heavily on category names, neglecting the rich contextual information from language descriptions. Visual-language pretraining (VLP) has achieved notable success in image recognition within natural domains by using multimodal information to enhance training efficiency and effectiveness. VLP has also shown great potential for land cover classification in remote sensing. This paper introduces a dual-sensor multimodal semantic collaborative classification network (DSMSC2N). It uses large language models (LLMs) in an instruction-driven manner to generate land cover category descriptions enriched with domain-specific knowledge in remote sensing. This approach aims to guide the model to accurately focus on and extract key features. Simultaneously, we integrate and optimize the complementary relationship between HSI and LiDAR data, enhancing the separability of land cover categories and improving classification accuracy. We conduct comprehensive experiments on benchmark datasets like Houston 2013, Trento, and MUUFL Gulfport, validating DSMSC2N's effectiveness compared to various baseline methods.
Subjects: LANGUAGE models; REMOTE sensing; LAND cover; IMAGE recognition (Computer vision); LIDAR
Publication: Remote Sensing, 2024, Vol 16, Issue 16, p3082
ISSN: 2072-4292
Publication type: Article
DOI: 10.3390/rs16163082

We found a match

Multimodal Semantic Collaborative Classification for Hyperspectral Images and LiDAR Data.

Wang, Aili; Dai, Shiyu; Wu, Haibin; Iwahori, Yuji

LANGUAGE models; REMOTE sensing; LAND cover; IMAGE recognition (Computer vision); LIDAR

Remote Sensing, 2024, Vol 16, Issue 16, p3082

2072-4292

Article

10.3390/rs16163082