Now showing 1 - 10 of 10
No Thumbnail Available
Publication

A Canonical Context-Preserving Representation for Open IE: Extracting Semantically Typed Relational Tuples from Complex Sentences

2023-05-23 , Niklaus, Christina Marianne , Cetto, Matthias , Freitas, André , Handschuh, Siegfried

Modern systems that deal with inference in texts need automatized methods to extract meaning representations (MRs) from texts at scale. Open Information Extraction (IE) is a prominent way of extracting all potential relations from a given text in a comprehensive manner. Previous work in this area has mainly focused on the extraction of isolated relational tuples. Ignoring the cohesive nature of texts where important contextual information is spread across clauses or sentences, state-of-the- art Open IE approaches are thus prone to generating a loose arrangement of tuples that lack the expressiveness needed to infer the true meaning of complex assertions. To overcome this limitation, we present a method that allows existing Open IE systems to enrich their output with additional meta information. By leveraging the semantic hierarchy of minimal propositions generated by the discourse-aware Text Simplification (TS) approach presented in Niklaus et al. (2019), we propose a mechanism to extract semantically typed relational tuples from complex source sentences. Based on this novel type of output, we introduce a lightweight semantic representation for Open IE in the form of normalized and context-preserving relational tuples. It extends the shallow semantic representation of state-of-the-art approaches in the form of predicate-argument structures by capturing intra-sentential rhetorical structures and hierarchical relationships between the relational tuples. In that way, the semantic context of the extracted tuples is preserved, resulting in more informative and coherent predicate-argument structures which are easier to interpret. In addition, in a comparative analysis, we show that the semantic hierarchy of minimal propositions benefits Open IE approaches in a second dimension: the canonical structure of the simplified sentences is easier to process and analyze, and thus facilitates the extraction of relational tuples, resulting in an improved precision (up to 32%) and recall (up to 30%) of the extracted relations on a large benchmark corpus.

No Thumbnail Available
Publication

AL: An Adaptive Learning Support System for Argumentation Skills

2020-04 , Wambsganss, Thiemo , Niklaus, Christina , Cetto, Matthias , Söllner, Matthias , Handschuh, Siegfried , Leimeister, Jan Marco

Recent advances in Natural Language Processing (NLP) bear the opportunity to analyze the argumentation quality of texts. This can be leveraged to provide students with individual and adaptive feedback in their personal learning journey. To test if individual feedback on students' argumentation will help them to write more convincing texts, we developed AL, an adaptive IT tool that provides students with feedback on the argumentation structure of a given text. We compared AL with 54 students to a proven argumentation support tool. We found students using AL wrote more convincing texts with better formal quality of argumentation compared to the ones using the traditional approach. The measured technology acceptance provided promising results to use this tool as a feedback application in different learning settings. The results suggest that learning applications based on NLP may have a beneficial use for developing better writing and reasoning for students in traditional learning settings.

No Thumbnail Available
Publication

Graphene: Semantically-Linked Propositions in Open Information Extraction

2018-08 , Cetto, Matthias , Niklaus, Christina , Freitas, Andre , Handschuh, Siegfried

No Thumbnail Available
Publication

Mathematical Foundations of Data Science

2023-03-13 , Hrycej, Tomas , Bermeitinger, Bernhard , Cetto, Matthias , Handschuh, Siegfried

This textbook aims to point out the most important principles of data analysis from the mathematical point of view. Specifically, it selected these questions for exploring: Which are the principles necessary to understand the implications of an application, and which are necessary to understand the conditions for the success of methods used? Theory is presented only to the degree necessary to apply it properly, striving for the balance between excessive complexity and oversimplification. Its primary focus is on principles crucial for application success. Topics and features: Focuses on approaches supported by mathematical arguments, rather than sole computing experiences Investigates conditions under which numerical algorithms used in data science operate, and what performance can be expected from them Considers key data science problems: problem formulation including optimality measure; learning and generalization in relationships to training set size and number of free parameters; and convergence of numerical algorithms Examines original mathematical disciplines (statistics, numerical mathematics, system theory) as they are specifically relevant to a given problem Addresses the trade-off between model size and volume of data available for its identification and its consequences for model parametrization Investigates the mathematical principles involves with natural language processing and computer vision Keeps subject coverage intentionally compact, focusing on key issues of each topic to encourage full comprehension of the entire book Although this core textbook aims directly at students of computer science and/or data science, it will be of real appeal, too, to researchers in the field who want to gain a proper understanding of the mathematical foundations “beyond” the sole computing experience.

No Thumbnail Available
Publication

The Textbook Learns to Talk: How to Design Chatbot-Mediated Learning to Foster Collaborative High-Order Learning?

2021-11-09 , Burkhard, Michael , Seufert, Sabine , Cetto, Matthias , Handschuh, Siegfried

No Thumbnail Available
Publication

DisSim: A Discourse-Aware Syntactic Text Simplification Framework for English and German

2019 , Niklaus, Christina , Cetto, Matthias , Freitas, André , Handschuh, Siegfried

We introduce DisSim, a discourse-aware sentence splitting framework for English and German whose goal is to transform syntactically complex sentences into an intermediate representation that presents a simple and more regular structure which is easier to process for downstream semantic applications. For this purpose, we turn input sentences into a two-layered semantic hierarchy in the form of core facts and accompanying contexts, while identifying the rhetorical relations that hold between them. In that way, we preserve the coherence structure of the input and, hence, its interpretability for downstream tasks.

No Thumbnail Available
Publication

Graphene: A Context-Preserving Open Information Extraction System

2018-08 , Cetto, Matthias , Niklaus, Christina , Freitas, André , Handschuh, Siegfried

We introduce Graphene, an Open IE system whose goal is to generate accurate, meaningful and complete propositions that may facilitate a variety of downstream semantic applications. For this purpose, we transform syntactically complex input sentences into clean, compact structures in the form of core facts and accompanying contexts, while identifying the rhetorical relations that hold between them in order to maintain their semantic relationship. In that way, we preserve the context of the relational tuples extracted from a source sentence, generating a novel lightweight semantic representation for Open IE that enhances the expressiveness of the extracted propositions.

No Thumbnail Available
Publication

Educational Chatbots for Collaborative Learing: Results of a Design Experiment in a Middle School

2022-11-09 , Burkhard, Michael , Seufert, Sabine , Cetto, Matthias , Handschuh, Siegfried

Educational chatbots promise many benefits for teaching and learning. Although chatbot use cases in this research field are rapidly growing, most studies focus on individual users rather than on collaborative group settings. To address this issue, this paper investigates how chatbot-mediated learning can be designed to foster middle school students in team-based assignments. Using an educational design research approach, quality indicators of educational chatbots were derived from the literature, which served as a guideline for the development of the chatbot Tubo (meaning tutoring bot). Tubo is part of a web-based team learning environment in which students can chat with each other and collaboratively work on their group assignments. As a team member and tutor of each group, Tubo guides the students through the learning journey by different scaffolding elements and helps with content-related questions the students have. As part of a first design cycle, the chatbot application was tested with a school class of a technical vocational school in Switzerland. The received feedback suggests that the approach of team-based learning with chatbots has a lot of potential from the students' and teachers' point of view. However, the role distribution of the individual group members may have to be further specified to address the different needs of autonomous as well as more control-oriented students.

No Thumbnail Available
Publication

Transforming Complex Sentences into a Semantic Hierarchy

2019-07 , Niklaus, Christina , Cetto, Matthias , Freitas, André , Handschuh, Siegfried

We present an approach for recursively splitting and rephrasing complex English sentences into a novel semantic hierarchy of simplified sentences, with each of them presenting a more regular structure that may facilitate a wide variety of artificial intelligence tasks, such as machine translation (MT) or information extraction (IE). Using a set of hand-crafted transformation rules, input sentences are recursively transformed into a two-layered hierarchical representation in the form of core sentences and accompanying contexts that are linked via rhetorical relations. In this way, the semantic relationship of the decomposed constituents is preserved in the output, maintaining its interpretability for downstream applications. Both a thorough manual analysis and automatic evaluation across three datasets from two different domains demonstrate that the proposed syntactic simplification approach outperforms the state of the art in structural text simplification. Moreover, an extrinsic evaluation shows that when applying our framework as a preprocessing step the performance of state-of-the-art Open IE systems can be improved by up to 346% in precision and 52% in recall. To enable reproducible research, all code is provided online.

No Thumbnail Available
Publication

A Survey on Open Information Extraction

2018-08 , Niklaus, Christina , Cetto, Matthias , Freitas, André , Handschuh, Siegfried