Annotations in the Nordic Dialect Corpus
Chapter by Janne Bondi Johannessen in Handbook of Linguistic Annotation, 2017.
In this chapter I focus on annotation in the Nordic Dialect Corpus, a dialect corpus that consists of dialectal speech from five closely related languages. There are two main types of annotation that are central: the annotation of speech itself, i.e. transcription, and the annotation of grammatical categories, i.e. tagging. Both are described and discussed, with a special focus on the success, or lack thereof, of some key choices.