The following bits of software may be of interest. They are all released under the Gnu Public License.
Yoshikoder: Multilingual content analysis software in Java
Yoshikoder Converter: Converts pdf, doc and html files to text for subsequent content analysis.
Python scripts for simple content analysis
RWordscores: R functions for Wordscoring.
JFreq: A Java command line application for computing word frequencies. Includes stemmers for 12 languages and optional stopword, currency and number removal.