Information Retrieval Service Aspects of the Open Research Knowledge Graph

Arab Oghli, Omar

Information Retrieval Service Aspects of the Open Research Knowledge Graph

Services

Deutsch English

Über das Repositorium Suchen und Entdecken Publizieren

Startseite
→
Fakultäten
→
Fakultät für Elektrotechnik und Informatik
→
Dokumentanzeige

Downloadstatistik des Dokuments (Auswertung nach COUNTER):

Arab Oghli, Omar: Information Retrieval Service Aspects of the Open Research Knowledge Graph. Hannover : Gottfried Wilhelm Leibniz Universität, Master Thesis, 2022, 85 S. DOI: https://doi.org/10.15488/11834

Zeitraum, für den die Download-Zahlen angezeigt werden:

Summe der Downloads: 591

Verteilung der Downloads über den gewählten Zeitraum
Herkunft der Downloads nach Ländern

zurück zum Einzeltitelnachweis (Ansicht Nutzungsstatistik schließen)

NameMSc_Thesis_Omar_A ...

Größe3,68 MB

FormatAdobe PDF

Öffnen

Zusammenfassung:
Information Retrieval (IR) takes a fresh perspective in the context of the next-generation digital libraries such as the Open Research Knowledge Graph (ORKG). As scholarly digital libraries evolve from document-based to knowledge-graph-based representations of content, there is a need for their information technology services to suitably adapt as well. The ORKG enables a structured representation of scholarly contributions data as RDF triples - in turn, it fosters FAIR (Findable, Accessible, Interoperable, and Reusable) scholarly contributions. This thesis has practically examined three different IR service aspects in the ORKG with the aim to help users: (i) easily find and compare relevant scholarly contributions; and (ii) structure new contributions in a manner consistent to the existing ORKG knowledge base of structured contributions. In the first part, it will evaluate and enhance the performance of the default ORKG “Contributions Similarity Service.” An optimal representation of contributions as documents obtains better retrieval performance of the BM25 algorithm in Elasticsearch. To achieve this, evaluation datasets were created and the contributions search index reinitialized with the new documents. In its second part, this thesis will introduce a “Templates Recommendation Service.” Two approaches were tested. A supervised approach with a Natural Language Inference (NLI) objective that tries to infer a contribution template for a given paper if one exists or none. And an unsupervised approach based on search that tries to return the most relevant template for a queried paper. Our experiments favoring ease of practical installation resulted in the conclusion that the unsupervised approach was better suited to the task. In a third and final part, a “Grouped Predicates Recommendation Service” will be introduced. Inspired from prior work, the service implements K-Means clustering with an IR spin. Similar structured papers are grouped, their in-cluster predicate groups computed, and new papers are semantified based on the predicate groups of the most similar cluster. The resulting micro-averaged F-measure of 65.5% using TF-IDF vectors has shown a sufficient homogeneity in the clusters.
Lizenzbestimmungen:	CC BY-NC 3.0 DE
Publikationstyp:	MasterThesis
Publikationsstatus:	publishedVersion
Erstveröffentlichung:	2022-01-14
Die Publikation erscheint in Sammlung(en):	Fakultät für Elektrotechnik und Informatik

nach oben

Verteilung der Downloads über den gewählten Zeitraum:

nach oben

Herkunft der Downloads nach Ländern:

Pos.	Land		Downloads
Pos.	Land		Anzahl	Proz.
1		Germany	322	54,48%
2		United States	54	9,14%
3		Netherlands	27	4,57%
4		Russian Federation	21	3,55%
5		Czech Republic	20	3,38%
6		Canada	13	2,20%
7		China	12	2,03%
8		Vietnam	11	1,86%
9		No geo information available	11	1,86%
10		Pakistan	6	1,02%
		andere	94	15,91%

nach oben

Weitere Download-Zahlen und Ranglisten:

Hinweis

Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.

Suche im Repositorium

Durchblättern

Gesamter Bestand
Diese Sammlung

Information Retrieval Service Aspects of the Open Research Knowledge Graph

Downloadstatistik des Dokuments (Auswertung nach COUNTER):

Zeitraum, für den die Download-Zahlen angezeigt werden:

Summe der Downloads: 591

Verteilung der Downloads über den gewählten Zeitraum:

Herkunft der Downloads nach Ländern:

Weitere Download-Zahlen und Ranglisten:

Suche im Repositorium

Durchblättern

Gesamter Bestand

Diese Sammlung