Research on Digital Forensics Based on Uyghur Web Text Classification
This paper mainly discusses the use of mutual information (MI) and Support Vector Machines (SVMs) for Uyghur Web text classification and digital forensics process of web text categorization: automatic classification and identification, conversion and pretreatment of plain text based on encoding features of various existing Uyghur Web documents etc., introduces the pre-paratory work for Uyghur Web text encoding. Focusing on the non-Uyghur characters and stop words in the web texts filtering, we put forward a Multi-feature Space Normalized Mutual Information (M-FNMI) algorithm and replace MI between single feature and category with mutual information (MI) between input feature combination and category so as to extract more accurate feature words; finally, we classify features with support vector machine (SVM) algorithm. The experimental result shows that this scheme has a high precision of classification and can provide criterion for digital forensics with specific purpose.
Year of publication: |
2017
|
---|---|
Authors: | Aizezi, Yasen ; Jamal, Anwar ; Abudurexiti, Ruxianguli ; Muming, Mutalipu |
Published in: |
International Journal of Digital Crime and Forensics (IJDCF). - IGI Global, ISSN 1941-6229, ZDB-ID 2703224-3. - Vol. 9.2017, 4 (01.10.), p. 30-39
|
Publisher: |
IGI Global |
Subject: | Digital Forensic | Mutual Information | Support Vector Machine | Text Classification | Uyghur |
Saved in:
Saved in favorites
Similar items by subject
-
Classifying businesses by economic activity using web-based text mining
Roelands, Maarten, (2017)
-
Text classification for organizational researchers : a tutorial
Kobayashi, Vladimer B., (2018)
-
Bhraguram, T. M., (2024)
- More ...