Customise PoS tags with TreeTagger

It’s possible to add/modify the tagset employed by Treetagger. The solution involves a threefold methodology:
1. Retag a Penn Treebank compliant corpus
2. Train Treetagger on it
3. Used the trained .par file to tag another corpus with TreeTagger

More details on this paper (Gaillat, 2013).

This entry was posted in NLP and tagged . Bookmark the permalink.

Comments are closed.