Anders Nøklestad

Senior Engineer - Research Section
Image of Anders Nøklestad
Norwegian version of this page
Room HW 240
Username
Visiting address Niels Henrik Abels vei 36 P. A. Munchs hus 0313 Oslo
Postal address Postboks 1079 Blindern 0316 Oslo

I work at Humit, developing resources and tools for language researchers and students, in particular databases, language technology tools, and web interfaces for searching in corpora (grammatically analysed text or speech). I am also in charge of one of the servers belonging to Humit (previously the Text Laboratory).

I have a master's thesis (hovedfag) in linguistics and a PhD in computational linguistics/language technology, both from UiO.

Tags: Language Technology, Corpora, Digital Humanities, Linguistics, Norwegian Language, IT support for research

Publications

  • Honkapohja, Alpo; Thaisen, Jacob & Nøklestad, Anders (2024). A search tool based on language modelling developed for The Index of Middle English Prose. Open Research Europe. ISSN 2732-5121. 3. doi: 10.12688/openreseurope.16590.1. Full text in Research Archive
  • Haug, Dag Trygve Truslew; Yildirim, Ahmet; Hagen, Kristin & Nøklestad, Anders (2023). Rules and neural nets for morphological tagging of Norwegian - Results and challenges. In Alumäe, Tanel & Fishel, Mark (Ed.), Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa). University of Tartu. ISSN 978-99-1621-999-7. p. 425–435. Full text in Research Archive
  • Mæhlum, Petter; Haug, Dag Trygve Truslew; Jørgensen, Tollef Emil; Kåsen, Andre; Nøklestad, Anders & Rønningstad, Egil [Show all 9 contributors for this article] (2022). NARC – Norwegian Anaphora Resolution Corpus. International Conference on Computational Linguistics (ICCL) (COLING). ISSN 1525-2477. 29(7), p. 48–60. Full text in Research Archive
  • Kåsen, Andre; Hagen, Kristin; Nøklestad, Anders; Priestley, Joel; Solberg, Per Erik & Haug, Dag Trygve Truslew (2022). The Norwegian Dialect Corpus Treebank. In Calzolari, Nicoletta; Béchet, Frédéric; Blache, Philippe; Choukri, Khalid; Cieri, Christopher; Declerck, Thierry; Goggi, Sara; Isahara, Hitoshi; Maegaard, Bente; Mariani, Joseph; Mazo, Hélène; Odijk, Jan & Piperidis, Stelios (Ed.), Proceedings of the Thirteenth Language Resources and Evaluation Conference. European Language Resources Association. ISSN 979-10-95546-72-6. p. 4827–4832. Full text in Research Archive
  • Lane, Pia; Hagen, Kristin; Nøklestad, Anders & Priestley, Joel (2022). Creating a corpus for Kven, a minority language in Norway. Nordlyd. ISSN 0332-7531. 46(1), p. 159–170. doi: 10.7557/12.6345. Full text in Research Archive
  • Borthen, Kaja; Søfteland, Åshild; Kveen, Perlaug Marie; Karagjosova, Elena & Nøklestad, Anders (2021). Finalpartiklar i norske talemål: Ei undersøking av variasjon knytt til geografi, alder og kjønn. Maal og Minne. ISSN 0024-855X. 113(1), p. 1–63. Full text in Research Archive
  • Søfteland, Åshild; Nøklestad, Anders; Priestley, Joel & Hagen, Kristin (2020). Glossa som forskningsverktøy. Hva folk søker etter og hva resultatene brukes til. Oslo Studies in Language (OSLa). ISSN 1890-9639. 11(2), p. 449–464. doi: 10.5617/osla.8512. Full text in Research Archive
  • Kåsen, Andre; Hagen, Kristin; Johannessen, Janne Bondi; Nøklestad, Anders & Priestley, Joel (2020). Comparing methods for measuring dialect similarity in Norwegian. In Calzolari, Nicoletta; Béchet, Frédéric; Blache, Philippe; Choukri, Khalid; Cieri, Christopher; Declerck, Thierry; Goggi, Sara; Isahara, Hitoshi; Maegaard, Bente; Mariani, Joseph; Mazo, Hélène; Moreno, Asuncion; Odijk, Jan & Piperidis, Stelios (Ed.), Proceedings of The 12th Language Resources and Evaluation Conference. European Language Resources Association. ISSN 979-10-95546-34-4. p. 5343–5350. Full text in Research Archive
  • Fjeld, Ruth E. Vatvedt; Nøklestad, Anders & Hagen, Kristin (2020). Leksikografisk Bokmålskorpus (LBK) – Bakgrunn og bruk. In Johannessen, Janne Bondi & Hagen, Kristin (Ed.), Leksikografi og korpus. En hyllest til Ruth Vatvedt Fjeld. Universitetet i Oslo. ISSN 9788291398129. p. 101–124. Full text in Research Archive
  • Kåsen, Andre; Hagen, Kristin; Nøklestad, Anders & Priestley, Joel (2019). Tagging a Norwegian Dialect Corpus. In Hartmann, Mareike & Plank, Barbara (Ed.), Proceedings of the 22nd Nordic Conference on Computational Linguistics (NoDaLiDa). Linköping University Electronic Press. ISSN 978-91-7929-995-8. p. 350–355. Full text in Research Archive
  • Lundquist, Bjørn; Larsson, Ida; Westendorp, Maud; Tengesdal, Eirik & Nøklestad, Anders (2019). Nordic Word Order Database: Motivations, methods, material and infrastructure. Nordic Atlas of Language Structures (NALS) Journal. ISSN 2387-2667. 4(1), p. 1–33. doi: 10.5617/nals.7529. Full text in Research Archive
  • Øvrelid, Lilja; Kåsen, Andre; Hagen, Kristin; Solberg, Per Erik; Johannessen, Janne Bondi & Nøklestad, Anders (2018). The LIA Treebank of Spoken Norwegian Dialects. In Calzolari, Nicoletta; Choukri, Khalid; Cieri, Christopher; Declerck, Thierry; Goggi, Sara; Hasida, Koiti; Isahara, Hitoshi; Maegaard, Bente; Mariani, Joseph; Mazo, Hélène; Moreno, Asuncion; Odijk, Jan; Piperidis, Stelios & Tokunaga, Takenobu (Ed.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation. European Language Resources Association. ISSN 979-10-95546-00-9. p. 4482–4488. Full text in Research Archive
  • Nøklestad, Anders; Hagen, Kristin; Johannessen, Janne Bondi; Kosek, Michał & Priestley, Joel (2017). A modernised version of the Glossa corpus search system. In Tiedemann, Jörg (Eds.), Proceedings of the 21st Nordic Conference on Computational Linguistics (NoDaLiDa). Linköping University Electronic Press. ISSN 978-91-7685-601-7. p. 251–254. Full text in Research Archive
  • Bick, Eckhard; Hagen, Kristin & Nøklestad, Anders (2015). Optimizing the Oslo-Bergen Tagger. In Bick, Eckhard & Hagen, Kristin (Ed.), Proceedings of the Workshop on “Constraint Grammar - methods, tools and applications” at NODALIDA 2015, May 11-13, 2015, Institute of the Lithuanian Language, Vilnius, Lithuania. Linköping University Electronic Press. ISSN 978-91-7519-037-2. p. 11–17.
  • Kosek, Michał; Nøklestad, Anders; Priestley, Joel; Hagen, Kristin & Johannessen, Janne Bondi (2015). Visualisation in Speech Corpora: Maps and Waves in the Glossa System. In Grigonytė, Gintarė; Clematide, Simon; Utka, Andrius & Volk, Martin (Ed.), Proceedings of the Workshop on Innovative Corpus Query and Visualization Tools at NODALIDA 2015. Linköping University Electronic Press. ISSN 978-91-7519-035-8. p. 23–31. Full text in Research Archive
  • Kapociute-Dzikiene, Jurgita; Nøklestad, Anders; Johannessen, Janne Bondi & Krupavicius, Algis (2013). Exploring Features for Named Entity Recognition in Lithuanian Text Corpus. In Oepen, Stephan; Hagen, Kristin & Johannessen, Janne Bondi (Ed.), Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013). Linköping University Electronic Press. ISSN 978-91-7519-589-6. p. 73–88. Full text in Research Archive
  • Johannessen, Janne Bondi; Priestley, Joel; Hagen, Kristin; Nøklestad, Anders & Lynum, Andre (2012). The Nordic Dialect Corpus. In Calzolari, Nicoletta; Choukri, Khalid; Declerck, Thierry; Ugur Dogan, Mehmet; Maegaard, Bente; Mariani, Joseph & Odijk, Jan (Ed.), Proceedings of the Eighth International Conference on Language Resources and Evaluation. European Language Resources Association. ISSN 978-2-9517408-7-7. p. 3388–3391. Full text in Research Archive
  • Johannessen, Janne Bondi; Hagen, Kristin; Lynum, André & Nøklestad, Anders (2012). OBT-stat A combined rule-based and statistical tagger. In Andersen, Gisle (Eds.), Exploring newspaper language : using the web to create and investigate a large corpus of modern Norwegian. John Benjamins Publishing Company. ISSN 978-90-272-0354-0. p. 51–65. doi: 10.1075/scl.49.03joh.
  • Lynum, Andre; Hagen, Kristin; Johannessen, Janne Bondi & Nøklestad, Anders (2011). OBT+Stat: Evaluation of a combined CG and statistical tagger. NEALT Proceedings Series. ISSN 1736-8197. 14, p. 26–34.
  • Hagen, Kristin & Nøklestad, Anders (2010). Bruk av et norsk leksikon til tagging og andre språkteknologiske formål. LexicoNordica. ISSN 0805-2735. p. 55–72.
  • Johannessen, Janne Bondi; Hagen, Kristin; Nøklestad, Anders & Priestley, Joel (2010). Enhancing Language Resources with Maps. In Calzolari, Nicoletta; Choukri, Khalid; Maegaard, Bente; Mariani, Joseph; Odijk, Jan; Piperidis, Stelios; Rosner, Mike & Tapias, Daniel (Ed.), Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). European Language Resources Association. ISSN 2-9517408-6-7. p. 1081–1088. Full text in Research Archive
  • Johannessen, Janne Bondi; Priestley, Joel & Nøklestad, Anders (2010). A MultilingualSpeech Resource: The Nordic Dialect Corpus. In Otoguro, Ryo; Ishikawa, Kiyoshi; Umemoto, Hiroshi; Yoshimoto, Kei & Harada, Yasunari (Ed.), Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation. Waseda University. ISSN 978-4-905166-00-9. p. 749–758. Full text in Research Archive
  • Lindstad, Arne Martinus; Nøklestad, Anders; Johannessen, Janne Bondi & Vangsnes, Øystein Alexander (2009). The Nordic Dialect Database: Mapping Microsyntactic Variation in the Scandinavian Languages. NEALT Proceedings Series. ISSN 1736-8197. 4.
  • Johannessen, Janne Bondi; Nygaard, Lars; Priestley, Joel & Nøklestad, Anders (2008). Glossa: a Multilingual, Multimodal, Configurable User Interface. In Calzolari, Nicoletta (Eds.), Proceedings of the 6th International Conference on Language Resources and Evaluation. European Language Resources Association. ISSN 2-9517408-4-0. Full text in Research Archive
  • Søfteland, Åshild & Nøklestad, Anders (2008). "Manuell morfologisk tagging av NoTa-materialet med støtte fra en statistisk tagger" . In Johannessen, Janne Bondi & Hagen, Kristin (Ed.), Språk i Oslo. Ny forskning omkring talespråk. Novus Forlag. ISSN 978-82-7099-471-7. p. 226–234.
  • Nøklestad, Anders & Søfteland, Åshild (2007). "Tagging a Norwegian speech corpus". In Nivre, Joakim; Kaalep, Heiki-Jaan; Muischnek, Kadri & Koit, Mare (Ed.), NODALIDA 2007 PROCEEDINGS. University of Tartu. ISSN 978-9985-4-0513-0. p. 245–248.
  • Johannessen, Janne Bondi; Hagen, Kristin; Haaland, Åsne ; Nøklestad, Anders; Jónsdottir, Andra Björk & Kokkinakis, Dimitris [Show all 9 contributors for this article] (2005). Named Entity Recognition for the Mainland Scandinavian Languages. Literary & Linguistic Computing. ISSN 0268-1145. 20(1), p. 91–102.
  • Nøklestad, Anders (2005). Semi-unsupervised PP attachment disambiguation for Norwegian. Archives of Control Sciences. ISSN 0004-072X. 15(1), p. 385–396.
  • Nøklestad, Anders (2001). A Connectionist Model of Past Tense Acquisition in Norwegian, A Cognitive Approach to the Verb: Morphological and Constructional Perspectives. De Gruyter Mouton. ISSN 3-11-017031-0. p. 165–187.

View all works in Cristin

  • Haug, Dag Trygve Truslew; Yildirim, Ahmet; Nøklestad, Anders & Kristen, Hagen (2023). Rules and neural nets for morphological tagging of Norwegian - Results and challenges.
  • Larsson, Ida; Lundquist, Bjørn; Westendorp, Maud; Nøklestad, Anders & Tengesdal, Eirik (2020). The Nordic Word Order Database.
  • Golden, Anne; Nøklestad, Anders & Johansson, Sofie (2019). The vocabulary of the Nordic heritage speakers in the US. An attempt of categorization.
  • Lundquist, Bjørn; Larsson, Ida; Westendorp, Maud; Tengesdal, Eirik & Nøklestad, Anders (2019). Presenting the Nordic Word Order Database.
  • Borthen, Kaja; Nøklestad, Anders; Søfteland, Åshild; Kveen, Perlaug Marie & Karagjosova, Elena (2018). Database for etterstilte småord i norsk (Småord-databasen).
  • Johannessen, Janne Bondi; Askeland, Anne Renette; Hagen, Kristin; Håberg, Live; Jensen, Bård Uri & Nøklestad, Anders [Show all 8 contributors for this article] (2018). Utfordringa med fonetisk transkripsjon av dialekter i den digitale tidsalderen: Oslo-translitteratoren.
  • Nøklestad, Anders; Hagen, Kristin; Johannessen, Janne Bondi; Kosek, Michał & Priestley, Joel (2017). A Modernised Version of the Glossa Corpus Search System.
  • Johannessen, Janne Bondi; Askeland, Anne Renette; Hagen, Kristin; Håberg, Live; Kristoffersen, Gjert & Nøklestad, Anders [Show all 9 contributors for this article] (2017). Hundre nye dialektordlister.
  • Søfteland, Åshild & Nøklestad, Anders (2016). Korpus-workshop.
  • Johannessen, Janne Bondi; Vangsnes, Øystein A; Lundquist, Bjørn; Larsson, Ida; Bentzen, Kristine & Garbacz, Piotr [Show all 10 contributors for this article] (2014). Nye isoglosser illustrert i det nye nettstedet for nordisk språk: NALS – Nordic Atlas of Language Structures (Online).
  • Priestley, Joel; Johannessen, Janne Bondi; Hagen, Kristin; Nøklestad, Anders & Lynum, André (2012). Maps as a central linguistic research tool.
  • Johannessen, Janne Bondi; Priestley, Joel; Hagen, Kristin; Nøklestad, Anders & Lynum, André (2012). The Nordic Dialect Corpus.
  • Johannessen, Janne Bondi & Nøklestad, Anders (2010). Recent developments in the Nordic Dialect Corpus and the Nordic Syntactic Judgments Database.
  • Johannessen, Janne Bondi; Hagen, Kristin; Nøklestad, Anders & Priestley, Joel (2010). Enhancing Language Resources with Maps.
  • Nøklestad, Anders (2010). Bruk av et norsk leksikon til tagging og andre språkteknologiske formål.
  • Johannessen, Janne Bondi; Nøklestad, Anders & Priestley, Joel (2007). Developing multi-Scandinavian word-lists for multi-Scandinavian texts.
  • Johannessen, Janne Bondi; Hagen, Kristin; Laake, Signe; Lindstad, Arne Martinus; Vangsnes, Øystein A. & Åfarli, Tor A. [Show all 7 contributors for this article] (2007). Dialektkorpus - presentasjon av prosjekt, metode, innsamling og materiale.
  • Søfteland, Åshild & Nøklestad, Anders (2006). ”Manuell morfologisk tagging av NoTa-materialet med støtte fra en statistisk tagger”.
  • Nøklestad, Anders (2005). Memory-based PP Attachment Disambiguation for Norwegian.
  • Samdal, Gunn Inger Lyse & Nøklestad, Anders (2005). A distributional or a translational basis for data-driven sense discrimination?
  • Nøklestad, Anders; Johansson, Christer & van den Bosch, Antal (2004). Pronominal anaphora resolution in Norwegian using TiMBL and z-scores.
  • Nøklestad, Anders (2004). Memory-based Classification of Proper Names in Norwegian.
  • Johannessen, Janne Bondi; Nøklestad, Anders; Hagen, Kristin & Lindstad, Arne Martinus (2000). Det åpne laboratoriet. [Newspaper]. Apollon.
  • Johannessen, Janne Bondi; Nøklestad, Anders & Hagen, Kristin (2000). A Web-Based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts.
  • Johannessen, Janne Bondi; Nøklestad, Anders & Hagen, Kristin (2000). A Web-Based Advanced and User Friendly System: The Oslo Corpus of Tagged Norwegian Texts.
  • Johannessen, Janne Bondi & Nøklestad, Anders (1999). Tavle-analyse ut - data-analyse inn. [Newspaper]. Aftenposten.
  • Johannessen, Janne Bondi & Nøklestad, Anders (1999). Mot et maksimalt brukervennlig korpus.
  • Hagen, Kristin; Johannessen, Janne Bondi & Nøklestad, Anders (1999). The shortcomings of a tagger.
  • Johannessen, Janne Bondi & Nøklestad, Anders (1999). Oslo-korpuset av taggede, norske tekster.
  • Johannessen, Janne Bondi & Nøklestad, Anders (1999). Oslo-korpuset av taggede, norske tekster.
  • Hagen, Kristin; Nøklestad, Anders & Johannessen, Janne Bondi (1998). A Constraint-based Tagger for Norwegian.
  • Nøklestad, Anders (2009). A Machine Learning Approach to Anaphora Resolution Including Named Entity Recognition, PP Attachment Disambiguation, and Animacy Detection. Unipub forlag. ISSN 0806-3222.

View all works in Cristin

Published Dec. 14, 2010 3:20 PM - Last modified Nov. 17, 2023 9:49 AM

Projects