Summary

This page gives access to our code for active learning and text classification, developed by Mauro Maggioni's research group at Duke University.

The basic idea is that expert labeling of documents is expensive, so we try to minimize this cost while labeling as many documents as possible through diffusing the expert's labels over the document-document graph.

Code & Data

This is Eric Monson's current code as of 14 Oct 2010. There are explanations of some of the contents in README.txt files in the various directories.