Options
Shallow Discourse Parsing for Open Information Extraction and Text Simplification
Type
conference paper
Date Issued
2022-10
Author(s)
Abstract (De)
We present a discourse-aware text simplification (TS) approach that recursively splits and rephrases complex English sentences into a semantic hierarchy of simplified sentences. Using a set of linguistically principled transformation patterns, sentences are converted into a hierarchical representation in the form of core sentences and accompanying contexts that are linked via rhetorical relations. As opposed to previously proposed sentence splitting approaches, which commonly do not take into account discourse-level aspects, our TS approach preserves the semantic relationship of the decomposed constituents in the output. A comparative analysis with the annotations contained in RST-DT shows that we capture the contextual hierarchy between the split sentences with a precision of 89% and reach an average precision of 69% for the classification of the rhetorical relations that hold between them. Moreover, an integration into state-of-the-art Open Information Extraction (IE) systems reveals that when applying our TS approach as a pre-processing step, the generated relational tuples are enriched with additional meta information, resulting in a novel lightweight semantic representation for the task of Open IE.
Language
English
HSG Classification
contribution to scientific community
HSG Profile Area
None
Book title
Proceedings of the 3rd Workshop on Computational Approaches to Discourse
Publisher
International Conference on Computational Linguistics
Publisher place
Gyeongju, Republic of Korea and Online
Start page
64
End page
76
Pages
13
Event Title
CODI
Event Location
Gyeongju, Republic of Korea and Online
Official URL
Subject(s)
Division(s)
Eprints ID
269520
File(s)
Loading...
open access
Name
2022.codi-1.9.pdf
Size
356.27 KB
Format
Adobe PDF
Checksum (MD5)
848eb91a17d5599e0f966471ae55a1e8