Main Page
From Clairlib
The Clair library is a suite of open-source Perl modules intended to simplify a number of generic tasks in natural language processing (NLP), information retrieval (IR), and network analysis (NA). Its architecture also allows for external software to be plugged in with very little effort.
- Code - Clairlib comprises over 100 modules covering functionality for a wide range of tasks
- Documentation - full API documentation in PDF and HTML format
[edit] Getting Started
- Installation - instructions for installing Perl and Clairlib
- Getting Started - how to get up and running, including trying out Clairlib's examples
[edit] Latest Version: Clairlib-Core 1.07 [Jun 2009]
- Added Clair::Network::Spectral for spectral partitioning using Fiedler Vector.
- Made Clairlib independent of MEAD (MEAD is no more required for Clairlib).
- Added Naive Bayes learning and classification.
- Added tests for feature extraction, learning, classification.
- Fixed a bug in Clair::Cluster::create_lexical_network().
- Added sampling options to Clair::Cluster.
- Added "No IDF" option and sampling capabilities to corpus_to_cos.pl utility.
- Fixed documentation typos.
- Added new tutorials to the documentation.
- Fixed bug in Clair::Utils::CorpusDownload.
- Added 'manual weights' option to make_synth_collection util.
- Fixed bug in extract_ngrams.
For full details see the Development page.
[edit] More about Clairlib
- Contribute - ways to contribute to Clairlib
- Development - learn about Clairlib development
- FAQ - answers to frequently asked questions
- Clairlib-dev - mailing list for discussion among Clairlib developers (and users)
- People - Clairlib developers and contributors
- Presentation - an introduction to Clairlib (from October 2006)
- Projects - ideas for student projects using Clairlib
- NLP - Wikipedia entry on natural language processing
- If you publish using clairlib, you should acknowledge its creators. Please use the following bibtex:
@techreport{Radev&al.07a,
author = "Radev, Dragomir R. and Hodges, Mark and Fader,
Anthony and Joseph, Mark and Gerrish, Joshua and
Schaller, Mark and dePeri, Jonathan and Gibson,
Bryan",
title = "CLAIRLIB Documentation v1.07",
institution = "University of Michigan. Department of Electrical
Engineering and Computer Science",
pdf =
"http://tangra.si.umich.edu/~radev/papers/csetr536-07.pdf",
postscript =
"http://tangra.si.umich.edu/~radev/papers/csetr536-07.ps",
year = "2007",
number = "CSE-TR-536-07",

