Software

The following bits of software may be of interest. They are all released under the Gnu Public License.

Yoshikoder: Multilingual content analysis software in Java

Yoshikoder Converter: Converts pdf, doc and html files to text for subsequent content analysis.

Python scripts for simple content analysis

RWordscores: R functions for Wordscoring.

JFreq: A Java command line application for computing word frequencies. Includes stemmers for 12 languages and optional stopword, currency and number removal.