Temporal models for Word Sense Disambiguation in historical texts and the COALA project

titleTemporal models for Word Sense Disambiguation in historical texts and the COALA project
start_date2026/01/09
schedule11h
onlineno
location_infovisioconférence Big Blue Button
summaryWord Sense Disambiguation (WSD) is a crucial task in Natural Language Processing (NLP) that determines the most likely sense of a polysemous word in context. While WSD techniques have seen significant improvements for modern languages, challenges persist for historical and low-resource languages. By incorporating temporal sensitivity into computational approaches, WSD performance can be significantly enhanced. In this talk I will present my research on WSD algorithms designed for historical corpora. Using historical BERT models trained on a corpus of nineteenth-century English books, and leveraging the Oxford English Dictionary and its Historical Thesaurus for evolving sense representations, I will show how time-sensitive models improve performance. I will also present the project “Computational Corpus Annotation for Quantitative Analysis of Latin Lexical Semantics” (COALA), successfully evaluated as an ERC Consolidator Grant.
responsiblesBawden