Automatic Ontology Learning from Multiple Knowledge Sources of Text
The prime textual sources used for ontology learning are a domain corpus and dynamic large text from web pages. The first source is limited and possibly outdated, while the second is uncertain. To overcome these shortcomings, a novel ontology learning methodology is proposed to utilize the different sources of text such as a corpus, web pages and the massive probabilistic knowledge base, Probase, for an effective automated construction of ontology. Specifically, to discover taxonomical relations among the concept of the ontology, a new web page based two-level semantic query formation methodology using the lexical syntactic patterns (LSP) and a novel scoring measure: Fitness built on Probase are proposed. Also, a syntactic and statistical measure called COS (Co-occurrence Strength) scoring, and Domain and Range-NTRD (Non-Taxonomical Relation Discovery) algorithms are proposed to accurately identify non-taxonomical relations(NTR) among concepts, using evidence from the corpus and web pages.
Year of publication: |
2018
|
---|---|
Authors: | Sathiya, B ; Geetha, T.V. |
Published in: |
International Journal of Intelligent Information Technologies (IJIIT). - IGI Global, ISSN 1548-3665, ZDB-ID 2400990-8. - Vol. 14.2018, 2 (01.04.), p. 1-21
|
Publisher: |
IGI Global |
Subject: | Knowledge Engineering | Pattern Analysis | Semantic Web | Ontology Learning | Probase | Statistical measure |
Saved in:
Online Resource
Saved in favorites
Similar items by subject
-
Semantic service oriented architectures in research and practice
Schillinger, Rolf, (2011)
-
Pull : the power of the semantic web to transform your business
Siegel, David, (2009)
-
Towards the ontology-based consolidation of production-centric standards
Chungoora, N., (2013)
- More ...
Similar items by person