dc.identifier.uri |
https://www.repo.uni-hannover.de/handle/123456789/12624 |
|
dc.identifier.uri |
https://doi.org/10.15488/12525 |
|
dc.contributor.author |
Wang, Yumeng
|
eng |
dc.contributor.other |
L3S Research Center |
|
dc.date.accessioned |
2022-07-18T10:41:34Z |
|
dc.date.available |
2022-07-18T10:41:34Z |
|
dc.date.issued |
2022 |
|
dc.identifier.citation |
Wang, Yumeng: Global Triggers for Attacking and Analyzing Ranking Models. Hannover : Gottfried Wilhelm Leibniz Universität Hannover, Institut für Verteilte Systeme, Master Thesis, 2022, VII, 70 S. DOI: https://doi.org/10.15488/12525 |
eng |
dc.description.abstract |
Text ranking models based on BERT are now well established for a wide range of pas-
sage and document ranking tasks. However, the robustness of BERT-based ranking
models under adversarial attack is under-explored. In this work, we argue that BERT-
rankers are vulnerable to adversarial attacks targeting retrieved documents given a
query.
We propose algorithms for generating adversarial perturbation of documents locally
to individual queries or globally across the dataset using gradient-based optimization
methods. The aim of our algorithms is to add a small number of tokens to a highly
relevant or non-relevant document to cause a significant rank demotion or promotion.
Our experiments show that a few number of tokens can already change the document
rank by a large margin. Besides, we find that BERT-rankers heavily rely on the docu-
ment start/head for relevance prediction, making the initial part of the document more
susceptible to adversarial attacks.
More interestingly, our statistical analysis finds a small set of recurring adversar-
ial tokens that when concatenated to documents result in successful rank demo-
tion/promotion of any relevant/non-relevant document respectively. Finally, our ad-
versarial tokens also show particular topic preferences within and across datasets,
exposing potential biases from BERT pre-training or downstream datasets. |
eng |
dc.language.iso |
eng |
eng |
dc.publisher |
Hannover : Gottfried Wilhelm Leibniz Universität |
|
dc.rights |
CC BY 3.0 DE |
eng |
dc.rights.uri |
http://creativecommons.org/licenses/by/3.0/de/ |
eng |
dc.subject |
ranking |
eng |
dc.subject |
BERT |
eng |
dc.subject |
neural networks |
eng |
dc.subject |
neural network |
eng |
dc.subject |
biases |
eng |
dc.subject.ddc |
004 | Informatik
|
eng |
dc.title |
Global Triggers for Attacking and Analyzing Ranking Models |
eng |
dc.type |
MasterThesis |
eng |
dc.type |
Text |
eng |
dcterms.extent |
VII, 70 S. |
|
dc.description.version |
publishedVersion |
eng |
tib.accessRights |
frei zug�nglich |
eng |