Martin Porter’s Stemming algorithm1.
In fact, this is just a refactored version of the initial ANCI C implementation, with a nice CLI.
$ make install
CLib:
$ clib install wooorm/stmr.c
Usage: stmr [options] file
Options:
-h, --help output usage information
-v, --version output version number
-e, --eval string output stemmed word
Usage:
# stem a word
$ stmr -e nationalism
# nation
# print stems
$ stmr in.txt
# write stems to out.txt
$ stmr in.txt > out.txt
# stdin and stdout
$ echo "Internationalise" | stmr
# internationalis
MIT © Titus Wormer