Zur Kurzanzeige

dc.identifier.uri http://dx.doi.org/10.15488/8793
dc.identifier.uri https://www.repo.uni-hannover.de/handle/123456789/8846
dc.contributor.author Holzmann, Helge
dc.contributor.author Anand, Avishek
dc.contributor.author Khosla, Megha
dc.date.accessioned 2019-12-11T16:12:41Z
dc.date.available 2019-12-11T16:12:41Z
dc.date.issued 2019
dc.identifier.citation Holzmann, H.; Anand, A.; Khosla, M.: Estimating PageRank deviations in crawled graphs. In: Applied Network Science 4 (2019), Nr. 1, 86. DOI: https://doi.org/10.1007/s41109-019-0201-9
dc.description.abstract Most real-world graphs collected from the Web like Web graphs and social network graphs are partially discovered or crawled. This leads to inaccurate estimates of graph properties based on link analysis such as PageRank. In this paper we focus on studying such deviations in ordering/ranking imposed by PageRank over crawled graphs. We first show that deviations in rankings induced by PageRank are indeed possible. We measure how much a ranking, induced by PageRank, on an input graph could deviate from the original unseen graph. More importantly, we are interested in conceiving a measure that approximates the rank correlation among them without any knowledge of the original graph. To this extent we formulate the HAK measure that is based on computing the impact redistribution of PageRank according to the local graph structure. We further propose an algorithm that identifies connected subgraphs over the input graph for which the relative ordering is preserved. Finally, we perform extensive experiments on both real-world Web and social network graphs with more than 100M vertices and 10B edges as well as synthetic graphs to showcase the utility of HAK and our High-fidelity Component Selection approach. eng
dc.language.iso eng
dc.publisher Heidelberg : Springer
dc.relation.ispartofseries Applied Network Science 4 (2019), Nr. 1
dc.rights CC BY 4.0 Unported
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.subject Crawls eng
dc.subject PageRank eng
dc.subject Ranking deviations eng
dc.subject.ddc 004 | Informatik ger
dc.title Estimating PageRank deviations in crawled graphs
dc.type Article
dc.type Text
dc.relation.issn 23648228
dc.relation.doi https://doi.org/10.1007/s41109-019-0201-9
dc.bibliographicCitation.volume 4
dc.bibliographicCitation.firstPage 86
dc.description.version publishedVersion
tib.accessRights frei zug�nglich


Die Publikation erscheint in Sammlung(en):

Zur Kurzanzeige

 

Suche im Repositorium


Durchblättern

Mein Nutzer/innenkonto

Nutzungsstatistiken