Norwegian version of this page

The NORM Corpus

The NORM Corpus consists of more than 5000 student essays collected in the research project Developing national standards for writing. A tool for teaching and learning (The NORM Project). The essays were written by pupils in school years 3, 4, 6 and 7 in comprehensive school. The corpus contains almost 1.3 million words.

The essays are of different kinds. There are essays from most school subjects, written for different purposes and through different acts of writing. In sum the essays give access to how writing as a key competency is put into practice in Norwegian comprehensive school. All essays are in Norwegian but were written by pupils from a variety of language backgrounds. Some texts are stored in multiple versions, a first draft and an elaborated version developed on the basis of teacher feedback.

Every text is available in three different formats: a scanned version of the original handwritten essay, a transcribed version of the original text (including errors and corrections), and a version corrected to comply with official orthography. Both transcribed versions are searchable.

The texts were collected during the period 2012-2014 and were transcribed by the NORM Project. The corpus was established in collaboration with the Text Laboratory.

The corpus is freely available for purposes of research and teaching. We point out that the texts cannot be used for commercial purposes.

When using the texts it should be referred to the Norm Corpus:

Published June 15, 2017 1:19 PM - Last modified June 26, 2017 3:20 PM