A New Approach Using Hidden Markov Model and Bayesian Method for Estimate of Word Types in Text Mining
Determining the structure of words in the text for the operations such as automated information extraction and text summarization of the text is essential. In computers, textual analysis to define the type of the word is considered as a vital advantage. Defining the types of words provides an estimate of the sequence of words in the sentence. In this article, estimating types of Turkish words is provided by developing a Hidden Markov Model and a Bayesian-based new model. In this model, an algorithm is developed which separates the suffixes of the words and grouping the words by counts of characters that suffixes of the words receive. A text composed of 584 Turkish words is used for the testing the dependability of the model. The model has achieved a high success rate in predicting the types of Turkish words.
Year of publication: |
2017
|
---|---|
Authors: | Doganer, Adem ; Calik, Sinan |
Published in: |
International Journal of Knowledge and Systems Science (IJKSS). - IGI Global, ISSN 1947-8216, ZDB-ID 2703502-5. - Vol. 8.2017, 4 (01.10.), p. 17-29
|
Publisher: |
IGI Global |
Subject: | Bayesian Methods | HMM | Modeling | Text Mining | Word Type |
Saved in:
Saved in favorites
Similar items by subject
-
Kim, Chae-yŏng, (2014)
-
A Bayesian local likelihood method for modelling parameter time variation in DSGE models
Galvão, Ana Beatriz C., (2015)
-
Solving and estimating indeterminate DSGE models
Farmer, Roger E. A., (2015)
- More ...