The English-Norwegian Parallel Corpus: word, lemma and tag lists

The English-Norwegian Parallel Corpus

 

The following lists are meant as guides to searching in the tagged version of the ENPC. Whenever frequencies are included, they are only indicative of the frequency of a particular word, lemma or tag, and cannot and must not be used to draw any (statistical) conclusions about the material. The main reason for this is that many words (tokens) in the material give rise to several lemmas and tags. This again is a consequence of the fact that little or no disambiguation has been performed after tagging.

Words, lemmas and tags in the tagged version of the ENPC

  1. English (original texts only)
  2. Norwegian original texts
  3. Norwegian translated texts

January 2002, Jarle Ebeling

Published July 6, 2010 10:39 AM - Last modified Nov. 12, 2010 2:37 PM