We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
A NEW OPEN INFORMATION EXTRACTION SYSTEM USING SENTENCE DIFFICULTY ESTIMATION.
- Authors
RESHADAT, Vahideh; FAILI, Heshaam
- Abstract
The World Wide Web has a considerable amount of information expressed using natural language. While unstructured text is often difficult for machines to understand, Open Information Extraction (OIE) is a relation-independent extraction paradigm designed to extract assertions directly from massive and heterogeneous corpora. Allocation of low-cost computational resources is a main demand for Open Relation Extraction (ORE) systems. A large number of ORE methods have been proposed recently, covering a wide range of NLP tools, from "shallow" (e.g., part-of-speech tagging) to "deep" (e.g., semantic role labeling). There is a trade-off between NLP tools depth versus efficiency (computational cost) of ORE systems. This paper describes a novel approach called Sentence Difficulty Estimator for Open Information Extraction (SDE-OIE) for automatic estimation of relation extraction difficulty by developing some difficulty classifiers. These classifiers dedicate the input sentence to an appropriate OIE extractor in order to decrease the overall computational cost. Our evaluations show that an intelligent selection of a proper depth of ORE systems has a significant improvement on the effectiveness and scalability of SDE-OIE. It avoids wasting resources and achieves almost the same performance as its constituent deep extractor in a more reasonable time.
- Subjects
DATA mining; INFORMATION storage &; retrieval systems; SCALABILITY; EXTRACTION (Chemistry); WORLD Wide Web; NATURAL languages
- Publication
Computing & Informatics, 2019, Vol 38, Issue 4, p986
- ISSN
1335-9150
- Publication type
Article
- DOI
10.31577/cai_2019_4_986