FAIR data representation in times of eScience: a comparison of instance-based and class-based semantic representations of empirical data using phenotype descriptions as example

Vogt, Lars

FAIR data representation in times of eScience: a comparison of instance-based and class-based semantic representations of empirical data using phenotype descriptions as example

Services

Deutsch English

Über das Repositorium Suchen und Entdecken Publizieren

Startseite
→
Weitere Einrichtungen
→
Zentrale Einrichtungen
→
Dokumentanzeige

Downloadstatistik des Dokuments (Auswertung nach COUNTER):

Vogt, L.: FAIR data representation in times of eScience: a comparison of instance-based and class-based semantic representations of empirical data using phenotype descriptions as example. In: Journal of biomedical semantics 12 (2021), 20. DOI: https://doi.org/10.1186/s13326-021-00254-0

Version im Repositorium

Zum Zitieren der Version im Repositorium verwenden Sie bitte diesen DOI: https://doi.org/10.15488/12211

Zeitraum, für den die Download-Zahlen angezeigt werden:

Summe der Downloads: 99

Verteilung der Downloads über den gewählten Zeitraum
Herkunft der Downloads nach Ländern

zurück zum Einzeltitelnachweis (Ansicht Nutzungsstatistik schließen)

NameFAIR_data_represe ...

Größe3,29 MB

FormatAdobe PDF

Öffnen

Zusammenfassung:
Background: The size, velocity, and heterogeneity of Big Data outclasses conventional data management tools and requires data and metadata to be fully machine-actionable (i.e., eScience-compliant) and thus findable, accessible, interoperable, and reusable (FAIR). This can be achieved by using ontologies and through representing them as semantic graphs. Here, we discuss two different semantic graph approaches of representing empirical data and metadata in a knowledge graph, with phenotype descriptions as an example. Almost all phenotype descriptions are still being published as unstructured natural language texts, with far-reaching consequences for their FAIRness, substantially impeding their overall usability within the life sciences. However, with an increasing amount of anatomy ontologies becoming available and semantic applications emerging, a solution to this problem becomes available. Researchers are starting to document and communicate phenotype descriptions through the Web in the form of highly formalized and structured semantic graphs that use ontology terms and Uniform Resource Identifiers (URIs) to circumvent the problems connected with unstructured texts. Results: Using phenotype descriptions as an example, we compare and evaluate two basic representations of empirical data and their accompanying metadata in the form of semantic graphs: the class-based TBox semantic graph approach called Semantic Phenotype and the instance-based ABox semantic graph approach called Phenotype Knowledge Graph. Their main difference is that only the ABox approach allows for identifying every individual part and property mentioned in the description in a knowledge graph. This technical difference results in substantial practical consequences that significantly affect the overall usability of empirical data. The consequences affect findability, accessibility, and explorability of empirical data as well as their comparability, expandability, universal usability and reusability, and overall machine-actionability. Moreover, TBox semantic graphs often require querying under entailment regimes, which is computationally more complex. Conclusions: We conclude that, from a conceptual point of view, the advantages of the instance-based ABox semantic graph approach outweigh its shortcomings and outweigh the advantages of the class-based TBox semantic graph approach. Therefore, we recommend the instance-based ABox approach as a FAIR approach for documenting and communicating empirical data and metadata in a knowledge graph.
Lizenzbestimmungen:	CC BY 4.0 Unported
Publikationstyp:	Article
Publikationsstatus:	publishedVersion
Erstveröffentlichung:	2021
Die Publikation erscheint in Sammlung(en):	Zentrale Einrichtungen

nach oben

Verteilung der Downloads über den gewählten Zeitraum:

nach oben

Herkunft der Downloads nach Ländern:

Pos.	Land		Downloads
Pos.	Land		Anzahl	Proz.
1		Germany	38	38,38%
2		United States	20	20,20%
3		No geo information available	18	18,18%
4		United Kingdom	8	8,08%
5		China	5	5,05%
6		Botswana	2	2,02%
7		Zimbabwe	1	1,01%
8		Vietnam	1	1,01%
9		Ukraine	1	1,01%
10		Switzerland	1	1,01%
		andere	4	4,04%

nach oben

Weitere Download-Zahlen und Ranglisten:

Hinweis

Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.

Suche im Repositorium

Durchblättern

Gesamter Bestand
Diese Sammlung

FAIR data representation in times of eScience: a comparison of instance-based and class-based semantic representations of empirical data using phenotype descriptions as example

Downloadstatistik des Dokuments (Auswertung nach COUNTER):

Version im Repositorium

Zeitraum, für den die Download-Zahlen angezeigt werden:

Summe der Downloads: 99

Verteilung der Downloads über den gewählten Zeitraum:

Herkunft der Downloads nach Ländern:

Weitere Download-Zahlen und Ranglisten:

Suche im Repositorium

Durchblättern

Gesamter Bestand

Diese Sammlung