Intelligent Systems Lab Amsterdam


Please see the new ISLA website at: http://isla.science.uva.nl

Hungarian stemmers

We provide two stemmers for Hungarian: a light stemmer and a heavy stemmer. The light stemmer is less invasive than the heavy stemmer altering fewer words. It attempts to stem nouns and adjectives. The heavy stemmer also stems verbs and numerals.

The package (click here) contains Snowball, C and Java versions of both stemmers, as well as a read me file.

If you publish results obtained using this resource, please cite this paper:

  • Four Stemmers and a Funeral: Stemming in Hungarian at CLEF 2005, A. Tordai and M. de Rijke. In: C. Peters et al, editors, Proceedings CLEF 2005, to appear. [PDF]

Maintained by Balder ten Cate. Last edited on Mon, 25 Jan 2010 13:39:10 +0100