Modelling the Information Density of Event Sequences in Texts (A03)

Subject Area General and Comparative Linguistics, Experimental Linguistics, Typology, Non-European Languages
Term from 2014 to 2022
Project identifier Deutsche Forschungsgemeinschaft (DFG) - Project number 232722074
 

Project Description

Project A3 aims at collecting formalized knowledge about prototypical sequences of events – script knowledge – from data, and using it to improve algorithms for natural language processing and our understanding of linguistic encoding choice and interpretation in human communication. The project will develop methods for learning scripts with wide coverage from unannotated texts and extend the representations of script events with information about their preconditions and effects to keep track of causal connections between events. These deeper and wider-coverage script models will be applied to various natural language processing tasks, and used to model pragmatic interpretations; we will use and extend the Rational Speech Act (RSA) model as a framework for modelling pragmatic inferences and explore how the RSA model can be related to existing notions used in the SFB, specifically the UID hypothesis.
DFG Programme Collaborative Research Centres
Subproject of SFB 1102:  Information Density and Linguistic Encoding
Applicant Institution Universität des Saarlandes
Project Heads Professorin Dr. Vera Demberg; Professor Dr. Alexander Koller; Professor Dr. Manfred Pinkal, until 6/2018; Dr. Stefan Thater