Machine Learning Approach for Multi-Layered Detection of Chemical Named Entities in Text
Identification of chemical named entities in text and subsequent linkage of information to biological events is of immense value to fulfill the knowledge needs of pharmaceutical and chemical R&D. A significant amount of investigation has been carried out since a decade for identifying chemical named entities at morphological level. However, a barrier still remains in terms of value proposition to scientists at chemistry level. Therefore, the work described here aims to circumvent the information barrier by adaptation of a Conditional Random Fields-based approach for identifying chemical named entities at various levels namely generic chemical level, morphological level, and chemistry level. Substantial effort has been invested on generation of suitable multi-level annotated corpora. Recommended machine learning practices such as active learning-based training corpus generation and feature optimization have been systematically performed. Evaluation of system performance and benchmarking against the other state-of-the-approaches showed improved results.
Year of publication: |
2016
|
---|---|
Authors: | Biradar, Usha B. ; Gurulingappa, Harsha ; Khamari, Lokanath ; Giriyan, Shashikala |
Published in: |
International Journal of Software Science and Computational Intelligence (IJSSCI). - IGI Global, ISSN 1942-9037, ZDB-ID 2703774-5. - Vol. 8.2016, 1 (01.01.), p. 1-15
|
Publisher: |
IGI Global |
Subject: | Active Learning | Chemical Named Entity Recognition | Conditional Random Fields | Ensemble Module | F1 Measure | Machine Learning |
Saved in:
Saved in favorites
Similar items by subject
-
Economical Active Feature-value Acquisition through Expected Utility Estimation
Melville, Prem, (2005)
-
Active Feature-Value Acquisition
Saar-Tsechansky, Maytal, (2009)
-
Stock market extreme risk prediction based on machine learning : evidence from the American market
Ren, Tingting, (2024)
- More ...