txttool: Utilities for text analysis in Stata
This article describes txttool, a command that provides a set of tools for managing free-form text. The command integrates several built-in Stata functions with new text capabilities. These latter functions include a utility to create a bag-of-words representation of text and an implementation of Porter’s (1980, Program: Electronic library and information systems 14: 130–137) wordstemming algorithm. Collectively, these utilities provide a text-processing suite for text mining and other text-based applications in Stata. Copyright 2014 by StataCorp LP.
Year of publication: |
2014
|
---|---|
Authors: | Williams, Unislawa ; Williams, Sean P. |
Published in: |
Stata Journal. - StataCorp LP. - Vol. 14.2014, 4, p. 817-829
|
Publisher: |
StataCorp LP |
Subject: | txttool | text mining | Porter stemmer | bag of words | cleaning | stop words | subwords |
Saved in:
Online Resource
Saved in favorites
Similar items by subject
-
Davalos, Sergio, (2022)
-
Textual analysis in accounting and finance : a survey
Loughran, Tim, (2016)
-
A Comparison of Similarity Measures for Text Documents
Hariharan, Shanmugasundaram, (2008)
- More ...