EBSCO Logo
Connecting you to content on EBSCOhost
Results
Title

Bi-Gram Term Collocations-based Query Expansion Approach for Improving Arabic Information Retrieval.

Authors

Moawad, Ibrahim; Alromima, Waseem; Elgohary, Rania

Abstract

In the era of information overloading, information retrieval systems are vital applications. Many researchers try to enhance the search results by introducing new methods. Unlike the English language, some languages like Arabic have complex morphological aspects and lack both linguistic and semantic resources. This paper proposes a language-independent semantic-based information retrieval approach, which expands the user query using bi-gram term collocations. The proposed approach has two main contributions. First, the bi-gram term collocations employed to expand the user query are automatically mined from the text corpus, therefore there is no need for an external semantic resource. Second, due to the complexity of the language morphology, the system index is constructed using the corpus words to save the cost and effort of the stemming process. A system prototype for the Arabic language was implemented and evaluated versus the stem-based method. The experimental evaluation has been conducted on the scripts of the Arabic Holy Quran. The evaluation results demonstrate that the proposed system outperforms the stem-based method in terms of precision and recall.

Subjects

DATA mining; AUTOMATIC extracting (Information science); INFORMATION storage & retrieval systems

Publication

Arabian Journal for Science & Engineering (Springer Science & Business Media B.V. ), 2018, Vol 43, Issue 12, p7705

ISSN

2193-567X

Publication type

Academic Journal

DOI

10.1007/s13369-018-3145-y

EBSCO Connect | Privacy policy | Terms of use | Copyright | Manage my cookies
Journals | Subjects | Sitemap
© 2025 EBSCO Industries, Inc. All rights reserved