Normalization Techniques For Improving The Performance Of Knowledge Graph Creation Pipelines

Torabinejad, Mohammad

Startseite
→
Fakultäten
→
Fakultät für Elektrotechnik und Informatik
→
Dokumentanzeige

Originalpublikation

Torabinejad, Mohammad: Normalization Techniques For Improving The Performance Of Knowledge Graph Creation Pipelines. Hannover : Gottfried Wilhelm Leibniz Universität Hannover, Master-Thesis, 2020, X, 61 S. DOI: https://doi.org/10.15488/10081

Name: Normalization_tec ...

Größe: 5.070Mb

Format: PDF

Öffnen

Zusammenfassung:
With the rapid growth of data within the web, demands on discovering information within data and consecutively exploiting knowledge graphs rise much more than we think it does. Data integration systems can be of great help to meet this precious demand in that they offer transformation of data from various sources and with different volumes. To this end, a data integration system takes advantage of utilizing mapping rules-- specified in a language like RML -- to integrate data collected from various data sources into a knowledge graph. However, large data sources may suffer from various data quality issues, being redundant one of them. Regarding this, the Semantic Web community contributes to Knowledge Engineering with techniques to create a knowledge graph efficiently. The thesis reported in this document tackles creating knowledge graphs in the presence of data sources with redundant data, and a novel normalization theory is proposed to solve this problem. This theory covers not only the characteristics of the data sources but also mapping rules used to integrate the data sources into a knowledge graph. Based on this, three normal forms are proposed and an algorithm for transforming mapping rules and data sources into these normal forms. The proposed approach's performance is evaluated in different testbeds composed of real-world data and synthetic data. The observed results suggest that the proposed techniques can dramatically reduce the execution time of knowledge graph creation. Therefore, this thesis's normalization theory contributes to the repertoire of tools that facilitate the creation of knowledge graphs at scale.
Lizenzbestimmungen:	Es gilt deutsches Urheberrecht. Das Dokument darf zum eigenen Gebrauch kostenfrei genutzt, aber nicht im Internet bereitgestellt oder an Außenstehende weitergegeben werden.
Publikationstyp:	MasterThesis
Publikationsstatus:	publishedVersion
Erstveröffentlichung:	2020
Schlagwörter (deutsch):	Normalisierung, Mapping-Regeln, Wissensdatenbank, Informationsintegration, Datenbank
Schlagwörter (englisch):	Database, Normalization, Mapping rules, Knowledge Graph, Data Integration System
Fachliche Zuordnung (DDC):	004 \| Informatik
Kontrollierte Schlagwörter:	Datenbank, Information Retrieval, Semantisches Netz, Netzwerk, Wissensbasiertes System

Downloadstatistik

Zur Langanzeige

Die Publikation erscheint in Sammlung(en):

Fakultät für Elektrotechnik und Informatik
Frei zugängliche Publikationen aus der Fakultät für Elektrotechnik und Informatik

Normalization Techniques For Improving The Performance Of Knowledge Graph Creation Pipelines

Originalpublikation

Die Publikation erscheint in Sammlung(en):

Suche im Repositorium

Durchblättern

Gesamter Bestand

Diese Sammlung

Mein Nutzer/innenkonto

Nutzungsstatistiken