Retrieval, Crawling and Fusion of Entity-centric Data on the Web

Download statistics - Document (COUNTER):

Dietze, S.: Retrieval, Crawling and Fusion of Entity-centric Data on the Web. In: Cali, A.; Gorgan, D.; Ugarte, M. (Eds.): Semantic keyword-based search on structured data sources. Berlin ; Heidelberg : Springer, 2017 (Lecture notes in computer science ; 10151), S. 3-16. DOI: https://doi.org/10.1007/978-3-319-53640-8_1

Repository version

To cite the version in the repository, please use this identifier: https://doi.org/10.15488/1258

Selected time period:

year: 
month: 

Sum total of downloads: 540




Thumbnail
Abstract: 
While the Web of (entity-centric) data has seen tremendous growth over the past years, take-up and re-use is still limited. Data vary heavily with respect to their scale, quality, coverage or dynamics, what poses challenges for tasks such as entity retrieval or search. This chapter provides an overview of approaches to deal with the increasing heterogeneity of Web data. On the one hand, recommendation, linking, profiling and retrieval can provide efficient means to enable discovery and search of entity-centric data, specifically when dealing with traditional knowledge graphs and linked data. On the other hand, embedded markup such as Microdata and RDFa has emerged a novel, Web-scale source of entitycentric knowledge. While markup has seen increasing adoption over the last few years, driven by initiatives such as schema.org, it constitutes an increasingly important source of entity-centric data on the Web, being in the same order of magnitude as the Web itself with regards to dynamics and scale. To this end, markup data lends itself as a data source for aiding tasks such as knowledge base augmentation, where data fusion techniques are required to address the inherent characteristics of markup data, such as its redundancy, heterogeneity and lack of links. Future directions are concerned with the exploitation of the complementary nature of markup data and traditional knowledge graphs. The final publication is available at Springer via http://dx.doi.org/ 10.1007/978-3-319-53640-8_1.
License of this version: Es gilt deutsches Urheberrecht. Das Dokument darf zum eigenen Gebrauch kostenfrei genutzt, aber nicht im Internet bereitgestellt oder an Außenstehende weitergegeben werden.
Document Type: BookPart
Publishing status: acceptedVersion
Issue Date: 2017
Appears in Collections:Forschungszentren

distribution of downloads over the selected time period:

downloads by country:

pos. country downloads
total perc.
1 image of flag of Germany Germany 282 52.22%
2 image of flag of Algeria Algeria 81 15.00%
3 image of flag of United States United States 35 6.48%
4 image of flag of China China 33 6.11%
5 image of flag of Russian Federation Russian Federation 12 2.22%
6 image of flag of France France 10 1.85%
7 image of flag of Italy Italy 9 1.67%
8 image of flag of United Kingdom United Kingdom 9 1.67%
9 image of flag of No geo information available No geo information available 8 1.48%
10 image of flag of India India 6 1.11%
    other countries 55 10.19%

Further download figures and rankings:


Hinweis

Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.

Search the repository


Browse