International Corpus of English
home home

Tagging and Lemmatization

Along with publishing the part-of-speech tagged and lemmatized versions of VOICE, we regard it as essential to provide a detailed documentation of the guiding principles and decisions which have been taken in the process. Such a documentation is especially important with regard to the data at hand, as it often had to be dealt with in novel and unprecedented ways because of its spoken and variable nature. It is, therefore, highly recommended to users of VOICE to familiarize themselves with this manual when working with VOICE POS Online 2.0 and VOICE POS XML 2.0.

The Tagging and Lemmatization Manual can be downloaded here.

The recommended citation for the VOICE Tagging and Lemmatization Manual is:

VOICE Project. 2014. VOICE Part-of-Speech Tagging and Lemmatization Manual. (date of last access).