Methodology for Text Classification using Manually Created Corpora-Based Sentiment Dictionary
This paper presented the methodology of Textual Content Classification, which is based on a combination of algorithms: preliminary forming a contextual framework for the texts in particular problem area; manual creation of the Hierarchical Sentiment Dictionary (HSD) on the basis of a topically-oriented Corpus; tonality texts recognition via using HSD for analysing the documents as a collection of topically completed fragments (paragraphs). For verification of the proposed methodology a case study of Polish-language film reviews Corpora was used. The main scientific contributions of this research are: the writing style of analysed text determines the possibility of adaptation of the Texts Classification algorithms; the Hierarchically-oriented Structure of the HSD allows customizing the classification process to qualitative recognize the text tonality in the context of the individual paragraphs topics; the text of Persuasive style most often initially empowered authors with a certain tonality. The tone, expressed in the author's opinion, effect on the qualitative indicators of sentiment recognition. Negative emotions of the author usually reduce the level of vocabulary variability as well as the variety of topics raised in the document, but simultaneously increase the level of unpredictability of words contextually use with both positive and negative emotional colouring
Year of publication: |
2019
|
---|---|
Authors: | Rizun, Nina |
Other Persons: | Waloszek, Wojciech (contributor) |
Publisher: |
[2019]: [S.l.] : SSRN |
Saved in:
freely available
Saved in favorites
Similar items by person
-
Rizun, Nina, (2019)
-
Rizun, Nina, (2019)
-
Rizun, Nina, (2021)
- More ...