Global Triggers for Attacking and Analyzing Ranking Models

Wang, Yumeng

Startseite
→
Fakultäten
→
Fakultät für Elektrotechnik und Informatik
→
Dokumentanzeige

Originalpublikation

Wang, Yumeng: Global Triggers for Attacking and Analyzing Ranking Models. Hannover : Gottfried Wilhelm Leibniz Universität Hannover, Institut für Verteilte Systeme, Master Thesis, 2022, VII, 70 S. DOI: https://doi.org/10.15488/12525

Name: Master-Thesis-Yum ...

Größe: 5.166Mb

Format: PDF

Öffnen

Zusammenfassung:
Text ranking models based on BERT are now well established for a wide range of pas- sage and document ranking tasks. However, the robustness of BERT-based ranking models under adversarial attack is under-explored. In this work, we argue that BERT- rankers are vulnerable to adversarial attacks targeting retrieved documents given a query. We propose algorithms for generating adversarial perturbation of documents locally to individual queries or globally across the dataset using gradient-based optimization methods. The aim of our algorithms is to add a small number of tokens to a highly relevant or non-relevant document to cause a significant rank demotion or promotion. Our experiments show that a few number of tokens can already change the document rank by a large margin. Besides, we find that BERT-rankers heavily rely on the docu- ment start/head for relevance prediction, making the initial part of the document more susceptible to adversarial attacks. More interestingly, our statistical analysis finds a small set of recurring adversar- ial tokens that when concatenated to documents result in successful rank demo- tion/promotion of any relevant/non-relevant document respectively. Finally, our ad- versarial tokens also show particular topic preferences within and across datasets, exposing potential biases from BERT pre-training or downstream datasets.
Lizenzbestimmungen:	CC BY 3.0 DE - http://creativecommons.org/licenses/by/3.0/de/
Publikationstyp:	MasterThesis
Publikationsstatus:	publishedVersion
Erstveröffentlichung:	2022
Schlagwörter (englisch):	ranking, BERT, neural networks, neural network, biases
Fachliche Zuordnung (DDC):	004 \| Informatik

Downloadstatistik

Zur Langanzeige

Die Publikation erscheint in Sammlung(en):

Fakultät für Elektrotechnik und Informatik
Frei zugängliche Publikationen aus der Fakultät für Elektrotechnik und Informatik

Global Triggers for Attacking and Analyzing Ranking Models

Originalpublikation

Die Publikation erscheint in Sammlung(en):

Suche im Repositorium

Durchblättern

Gesamter Bestand

Diese Sammlung

Mein Nutzer/innenkonto

Nutzungsstatistiken