Fakespeak – the language of fake news
Linguistic cues in fake news may be the key to its detection.
Photo: Silje Susanne Alvestad.
About the project
Fake news is defined as "news" items in which case the author knows that they are false and intends to deceive. The lion's share of the research on the detection of fake news is conducted by computer scientists alone.
However, linguists have shown that the linguistic features of a text vary according to its purpose. Thus, the language of fake news may be the key to its detection. This is the background of the linguistics-driven project "Fakespeak - the language of fake news. Fake news detection based on linguistic cues". The project involves a core team of linguists and computer scientists based in Norway and the UK.
The linguists will seek to reveal the grammatical and stylistic features of the language of fake news, referred to as "Fakespeak", in English, Norwegian and Russian.
To achieve this goal they will first build, and make use of existing, corpora of fake and real news from various online media outlets in all three languages, and then subject the datasets to thorough linguistic analyses.
The fake and genuine articles that we will compare will be written by one and the same author. This is to control for several potential sources of error. The linguists will apply methods and draw on insights from corpus linguistics, computational linguistics, applied linguistics, including forensic linguistics, as well as pragmatics and rhetoric.
Taking the linguists' findings as their point of departure, along with existing fake news detection systems, the computer scientists will seek to improve these systems by automating the defining features of Fakespeak.
The overall aim of the project is to enable fake news detection systems to discover and flag potentially harmful fake news items in a more accurate, efficient and timely manner than offered by current state-of-the-art systems.
By automating all and only the features of Fakespeak, the project team will enable the systems to detect and flag only deliberate disinformation, excluding, for example, (inadvertent) misinformation, satirical texts, parody, and texts reflecting a certain set of opinions. Thus, the project will take societal safety and security into consideration while at the same time guarding the freedom of speech.
The Research Council of Norway, project-ID 302573.
The annual Fakespeak workshop 2022
Venue: PAM 389 (This room only takes 16 people, but the whole workshop can also be followed via Zoom.)
Topic: The annual Fakespeak workshop, Day 1
Time: Nov 14, 2022 09:15 AM Oslo
Join Zoom Meeting
- Meeting ID: 644 3086 5176
- Passcode: 830664
Topic: The annual Fakespeak workshop, Day 2
Time: Nov 15, 2022 09:15 AM Oslo
Join Zoom Meeting
- Meeting ID: 611 0338 0184
- Passcode: 776473
Monday, November 14
09:15 – 09:20 Opening
09:20 – 09:55 The Fakespeak project
Introduction, Silje S. Alvestad, ILOS, UiO
The English part of the project, Nele Põldvere, ILOS, UiO and Zia Uddin, SINTEF, Oslo
10:00 – 10:30 “Tech companies’ approaches to fake news”, Bente Kalsnes, the SCAM project, Kristiania University College
10:30 - 11:00 "Navigating social media. experiences from PAR-TS (Pandemic rhetoric, trust and social media)", Jannicke Fiskvik, SINTEF, Trondheim (via Zoom)
11:00 – 11:30 Break, lunch for project members and presenters
11:30 – 12:00 “How the Norwegian Media Authority works to fight fake news”, Pernille Huseby, Director of Communications and Consulting at the Norwegian Media Authority (via Zoom)
12:00 – 12:30 “A story of a team of rivals: a historical collaboration”, Helje Solberg, News Director in NRK and Chair of Faktisk
12:30 – 13:00 "Covid, conspiracy and war: thoughts on the challenges ahead", Kristoffer Egeberg, Chief Editor of Faktisk
13:00 – 13:30 Break
13:30 – 14:00 “Challenges and Opportunities in Explainable Fact-Checking”, Vinay J. Setty, University of Stavanger (via Zoom)
14:00 – 14:30 “Content-based Fake News Detection with Logical Tsetlin Machine Rules”, Ole Christoffer Granmo, University of Agder (via Zoom)
Tuesday, November 15
09:15 – 10:00 The Fakespeak project, ctd.
09:15 – 09:35 The Norwegian part of the project, Aleena Thomas, SINTEF, Oslo and Silje S. Alvestad, ILOS, UiO
09:35 – 09:55 The Russian part of the project, Elizaveta Kibisova and Silje S. Alvestad, ILOS, UiO
09:55 – 10:00 Break
10:00 – 10:30 “The ritual function of propaganda: Observations about language in Russian collective trolling of Norway”, Johanne Berge Kalsaas, University of Bergen (via Zoom)
10:30 – 11:00 “The role of fact-checking in defending Ukraine against Russian aggression” Yevhen Fedchenko, Editor in Chief of StopFake.org (via Zoom)
11:00 – 11:30 “Technologies serving the needs of verification”, Nikos Sarris, CERTH (via Zoom)
11:30 – 12:00 Break, lunch for project members and presenters
12:00 – 14:30 Linguistics
12:00 – 12:30 “Exploring hybrid information operations online: how the Russian Internet Research Agency weaponised text and images”, William Dance, Lancaster University (via Zoom)
12:30 – 13:00 “Lying during speaking and writing: evidence from studying the process in experimentally collected narratives”, Victoria Johansson, Högskolan Kristianstad & Lund University) and Kajsa Gullberg, Lund University (via Zoom)
13:00 – 13:30 “The functional domain of epistemicity in language”, Henrik Bergqvist, Stockholm University (via Zoom)
13:30 – 14:00 “New linguistic methods applied to Russian political discourse”, Laura A. Janda, THREAT DEFUSER, University of Tromsø (via Zoom)
14:00 – 14:30 “A taxonomy of fake news for linguistic analysis”, Jack Grieve, University of Birmingham (via Zoom)
14:30 – 14:35 Closing
Start-up workshop, February 16-17
Tuesday, February 16:
- 9:15-9:45 Edson C. Tandoc Jr. Scholarly definitions of fake news
- 9:50-10:20 Sharon Levy On Fakeddit: A New Multimodal Benchmark Dataset for Fine-grained Fake News Detection, and related works from William Wang’s lab
- 10:30-11:00 Geir Hågen Karlsen Influence operations in social and other kinds of media, political communication, one-sided history writing etc. The case of Russia
- 12:00-12:45 Industrial collaboration partners
- 12:00-12:20 NTB (Geir Terje Ruud/Sarah Sørheim)
- 12:20-12:45 Faktisk.no (Kristoffer Egeberg)
Wednesday, February 17:
- 9:00-9:30 Maite Taboada The language of fake news and misinformation
- 9:30-10:00 Helena Woodfield and Jack Grieve The language of fake news. Corpus studies
- 10:15- Potentially collaborating projects
- 10:15-10:40 PAR-TS (Tor Olav Grøtan)
- 10:40-11:00 Threat-defuser (Laura Janda)
- 11:00-11:20 SCAM (Bente Kalsnes)