The Yoshikoder

Yoshikoder is a cross-platform multilingual content analysis program developed as part of the Identity Project at Harvard's Center for International Affairs. This page will always contain the latest version. Contact me (Will Lowe) if you want to be notified of new releases, make a request, complain, or send me a large check.

New Version (December 2005)

There's a new version! Go to http://www.yoshikoder.org to test it out.

Functionality

Yoshikoder allows you to load documents, construct and apply content analysis dictionaries, examine keywords-in-context, and perform basic content analyses, in any language.

In more detail: Yoshikoder works with text documents, whether in plain ASCII, Unicode (e.g. UTF-8), or a national encodings (e.g. Big5 Chinese.) You can construct, view, and save keywords-in-context. You can write content analysis dictionaries can be constructed using PERL-style regular expressions. Yoshikoder provides summaries of documents, either as word frequency tables or according to a content analysis dictionary. You can also compare documents according to word frequency profile or with respect to a content dictionary. Yoshikoder's native file format is XML, so dictionaries and keyword-in-context files are non-proprietary and human readable.

Screenshot

Yoshikoder on OSX (now out of date)

Download

Licensing

Yoshikoder is open-source software, released under the Gnu Public License. This licensing implies, among other things, that Yoshikoder is free for academic use.

Running Yoshikoder

Yoshikoder is packaged as a single Java archive. You need to have a recent version of Java on your machine to run it. If you are a Windows user and you downloaded the executable file, you should be automatically offered the chance to upgrade your Java if it is absent or insufficiently recent. For all others, see below for upgrade instructions.

Mac OSX:

Download and double-click the jar file, or click on the webstart link

Linux:

Type: java -jar Yoshikoder.jar or click on the webstart link.

Windows:

Download the executable file double-click to start, or click on the webstart link.

If you downloaded the java archive, you should also be able to double-click to start. However, Some users find that Winzip unpacks the jar file as if it were a zip archive. To avoid this behavior right-click save-as the jar file onto your desktop. Then right-click on the file and choose the open-with menu item. Choose java from your applications - it may be called 'javaw'. Check the 'remember my choice' box. Double-clicking should now launch the application.

Upgrading Java

Mac OSX:

Mac OSX version 10.3 (Panther) needs no update.

If you clicked on the webstart link, your java installation be checked automatically.

OSX version 10.2 (Jaguar) users may not have a sufficiently recent version of Java. To check: Open the Terminal application (found in the Utilities subfolder of Applications), and type 'java -version', without the quotes. If the version is 1.4.1 then no upgrade is necessary. If the version number is 1.3.1, then update Java using the update mechanism in System Preferences.

Windows:

If you downloaded the executable file, your java installation will be checked automatically, and you may be given the opportunity to upgrade your java at that time.

If you clicked on the webstart link your java installation will also be checked automatically.

To install Java on your machine manually, go to Sun's download page, and choose a suitable download link. If you expect to write in Java, choose Download J2SE SDK. If you only intend to run java programs, choose Download J2SE JRE, which will install only the relevant subpart of the SDK. Then follow the instructions.

Linux:

See the Windows description above. Note that Yoshikoder works with recent versions of IBM's Java (Blackdown).

On the Shoulders of Giants...

Yoshikoder depends on several excellent open-source java libraries viz. the Apache Foundation's Apache XML and POI Projects, and the Gnu Foundation's regular expressions package. Thanks folks.

...Stands a Midget

Despite this exemplary supporting cast, I learnt Java while writing the Yoshikoder, so don't expect much by way of elegant code...