Norwegian version of this page

Fakespeak – the language of fake news

Linguistic cues in fake news may be the key to its detection. 

The picture shows the words real and fake.

Photo: Silje Susanne Alvestad.

About the project

Fake news is defined as "news" items in which case the author knows that they are false and intends to deceive. The lion's share of the research on the detection of fake news is conducted by computer scientists alone.

However, linguists have shown that the linguistic features of a text vary according to its purpose. Thus, the language of fake news may be the key to its detection. This is the background of the linguistics-driven project "Fakespeak - the language of fake news. Fake news detection based on linguistic cues". The project involves a core team of linguists and computer scientists based in Norway and the UK.

Methods

The linguists will seek to reveal the grammatical and stylistic features of the language of fake news, referred to as "Fakespeak", in English, Norwegian and Russian.

To achieve this goal they will first build, and make use of existing, corpora of fake and real news from various online media outlets in all three languages, and then subject the datasets to thorough linguistic analyses.

The fake and genuine articles that we will compare will be written by one and the same author. This is to control for several potential sources of error. The linguists will apply methods and draw on insights from corpus linguistics, computational linguistics, applied linguistics, including forensic linguistics, as well as pragmatics and rhetoric.

Taking the linguists' findings as their point of departure, along with existing fake news detection systems, the computer scientists will seek to improve these systems by automating the defining features of Fakespeak.

Objectives

The overall aim of the project is to enable fake news detection systems to discover and flag potentially harmful fake news items in a more accurate, efficient and timely manner than offered by current state-of-the-art systems.

By automating all and only the features of Fakespeak, the project team will enable the systems to detect and flag only deliberate disinformation, excluding, for example, (inadvertent) misinformation, satirical texts, parody, and texts reflecting a certain set of opinions. Thus, the project will take societal safety and security into consideration while at the same time guarding the freedom of speech.

Financing

The Research Council of Norway, project-ID 302573.

Duration

2020-2025.

Cooperation

Events

The annual Fakespeak workshop 2022

Venue: PAM 389 (This room only takes 16 people, but the whole workshop can also be followed via Zoom.)

Zoom links: 

Topic: The annual Fakespeak workshop, Day 1

Time: Nov 14, 2022 09:15 AM Oslo

Join Zoom Meeting
  • https://uio.zoom.us/j/64430865176?pwd=QnU2cVVaTTRZMGJMTEFraVVWOUZCZz09
  • Meeting ID: 644 3086 5176
  • Passcode: 830664

Topic: The annual Fakespeak workshop, Day 2

Time: Nov 15, 2022 09:15 AM Oslo

Join Zoom Meeting
  • https://uio.zoom.us/j/61103380184?pwd=bVNkTnZKQlIrZHBpSDFLbXYrV2xRZz09
  • Meeting ID: 611 0338 0184
  • Passcode: 776473

Workshop programme

Monday, November 14

09:15 – 09:20 Opening

09:20 – 09:55 The Fakespeak project

Introduction, Silje S. Alvestad, ILOS, UiO

The English part of the project, Nele Põldvere, ILOS, UiO and Zia Uddin, SINTEF, Oslo

Collaborating projects:

10:00 – 10:30 “Tech companies’ approaches to fake news”, Bente Kalsnes, the SCAM project, Kristiania University College  

10:30 - 11:00 "Navigating social media. experiences from PAR-TS (Pandemic rhetoric, trust and social media)", Jannicke Fiskvik, SINTEF, Trondheim (via Zoom)

11:00 – 11:30 Break, lunch for project members and presenters

Industrial partners:

11:30 – 12:00 “How the Norwegian Media Authority works to fight fake news”, Pernille Huseby, Director of Communications and Consulting at the Norwegian Media Authority (via Zoom)

12:00 – 12:30 “A story of a team of rivals: a historical collaboration”, Helje Solberg, News Director in NRK and Chair of Faktisk

12:30 – 13:00 "Covid, conspiracy and war: thoughts on the challenges ahead", Kristoffer Egeberg, Chief Editor of Faktisk

13:00 – 13:30 Break

Computer science:

13:30 – 14:00 “Challenges and Opportunities in Explainable Fact-Checking”, Vinay J. Setty, University of Stavanger (via Zoom)

14:00 – 14:30 “Content-based Fake News Detection with Logical Tsetlin Machine Rules”, Ole Christoffer Granmo, University of Agder (via Zoom)

Tuesday, November 15

09:15 – 10:00 The Fakespeak project, ctd.

09:15 – 09:35  The Norwegian part of the project, Aleena Thomas, SINTEF, Oslo and Silje S. Alvestad, ILOS, UiO  

09:35 – 09:55 The Russian part of the project, Elizaveta Kibisova and Silje S. Alvestad, ILOS, UiO

09:55 – 10:00 Break

Media science:

10:00 – 10:30 “The ritual function of propaganda: Observations about language in Russian collective trolling of Norway”, Johanne Berge Kalsaas, University of Bergen (via Zoom)

Industrial partner:

10:30 – 11:00 “The role of fact-checking in defending Ukraine against Russian aggression” Yevhen Fedchenko, Editor in Chief of StopFake.org (via Zoom)

Computer science:

11:00 – 11:30 “Technologies serving the needs of verification”, Nikos Sarris, CERTH (via Zoom)

11:30 – 12:00 Break, lunch for project members and presenters

12:00 – 14:30 Linguistics

12:00 – 12:30 “Exploring hybrid information operations online: how the Russian Internet Research Agency weaponised text and images”, William Dance, Lancaster University (via Zoom)

12:30 – 13:00 “Lying during speaking and writing: evidence from studying the process in experimentally collected narratives”, Victoria Johansson, Högskolan Kristianstad & Lund University) and Kajsa Gullberg, Lund University (via Zoom)

13:00 – 13:30 “The functional domain of epistemicity in language”, Henrik Bergqvist, Stockholm University (via Zoom)

Collaborating project:

13:30 – 14:00 “New linguistic methods applied to Russian political discourse”, Laura A. Janda, THREAT DEFUSER, University of Tromsø (via Zoom)

14:00 – 14:30 “A taxonomy of fake news for linguistic analysis”, Jack Grieve, University of Birmingham (via Zoom)

14:30 – 14:35 Closing

Start-up workshop, February 16-17

Programme

Tuesday, February 16: 

  • 9:15-9:45 Edson C. Tandoc Jr. Scholarly definitions of fake news
  • 9:50-10:20 Sharon Levy On Fakeddit: A New Multimodal Benchmark Dataset for Fine-grained Fake News Detection, and related works from William Wang’s lab
  • 10:30-11:00 Geir Hågen Karlsen Influence operations in social and other kinds of media, political communication, one-sided history writing etc. The case of Russia
  • 12:00-12:45 Industrial collaboration partners
  • 12:00-12:20 NTB (Geir Terje Ruud/Sarah Sørheim)
  • 12:20-12:45 Faktisk.no (Kristoffer Egeberg)

Wednesday, February 17:

  • 9:00-9:30 Maite Taboada The language of fake news and misinformation
  • 9:30-10:00 Helena Woodfield and Jack Grieve The language of fake news. Corpus studies
  • 10:15- Potentially collaborating projects
  • 10:15-10:40 PAR-TS (Tor Olav Grøtan)
  • 10:40-11:00 Threat-defuser (Laura Janda)
  • 11:00-11:20 SCAM (Bente Kalsnes)
Published June 17, 2020 11:55 AM - Last modified Nov. 11, 2022 1:07 PM