Recognition of Chemical Entities using Pattern Matching and Functional Group Classification
The two main challenges in chemical entity recognition are: (i) New chemical compounds are constantly being synthesized infinitely. (ii) High ambiguity in chemical representation in which a chemical entity is being described by different nomenclatures. Therefore, the identification and maintenance of chemical terminologies is a tough task. Since most of the existing text mining methods followed the term-based approaches, the problems of polysemy and synonymy came into the picture. So, a Named Entity Recognition (NER) system based on pattern matching in chemical domain is developed to extract the chemical entities from chemical documents. The Tf-idf and PMI association measures are used to filter out the non-chemical terms. The F-score of 92.19% is achieved for chemical NER. This proposed method is compared with the baseline method and other existing approaches. As the final step, the filtered chemical entities are classified into sixteen functional groups. The classification is done using SVM One against All multiclass classification approach and achieved the accuracy of 87%. One-way ANOVA is used to test the quality of pattern matching method with the other existing chemical NER methods.
Year of publication: |
2016
|
---|---|
Authors: | Geetha, T. V. ; Hema, R. |
Published in: |
International Journal of Intelligent Information Technologies (IJIIT). - IGI Global, ISSN 1548-3665, ZDB-ID 2400990-8. - Vol. 12.2016, 4 (01.10.), p. 21-44
|
Publisher: |
IGI Global |
Subject: | Chemical NER | Classification | One-Way ANOVA | Pattern Matching | Patterns | Pointwise Mutual Information | Tf-idf |
Saved in:
Saved in favorites
Similar items by subject
-
Alamoudi, Eman Saeed, (2021)
-
An information-theoretic approach to the analysis of location and colocation patterns
Dam, Alje van, (2023)
-
The Impact of Different Types of External Lecturers in Higher Education on Student Learning Outcomes
Erjavec, Jure, (2014)
- More ...
Similar items by person