Annotations in the Nordic Dialect Corpus

Chapter by Janne Bondi Johannessen in Handbook of Linguistic Annotation, 2017.

Handbook of Linguistic Annotation front page


In this chapter I focus on annotation in the Nordic Dialect Corpus, a dialect corpus that consists of dialectal speech from five closely related languages. There are two main types of annotation that are central: the annotation of speech itself, i.e. transcription, and the annotation of grammatical categories, i.e. tagging. Both are described and discussed, with a special focus on the success, or lack thereof, of some key choices.

Access the chapter on the homepage of Handbook of Linguistic Annotation.

Published Aug. 4, 2017 2:14 PM - Last modified Aug. 18, 2017 3:22 PM