With the advancement of information retrieval (IR) technologies, robustness is increasingly attracting attention. When deploying technology into practice, we consider not only its average performance under normal conditions but, more importantly, its ability to maintain functionality across a variety of exceptional situations. In recent years, the research on IR robustness covers theory, evaluation, methodology, and application, and all of them show a growing trend.
The purpose of this workshop is to systematize the latest results of each research aspect, to foster comprehensive communication within this niche domain while also bridging robust IR research with the broader community, and to promote further future development of robust IR. To avoid the one-sided talk of mini-conferences, this workshop adopts a highly interactive format, including round-table and panel discussion sessions, to encourage active participation and meaningful exchange among attendees.
Time | Section | Presenter |
---|---|---|
13:30 - 13:50 | Section 1: Introduction | Maarten de Rijke |
13:50 - 14:10 | Section 2: Preliminaries | Yu-An Liu |
14:10 - 15:00 | Section 3: Adversarial robustness | Yu-An Liu |
15:00 - 15:30 | 30min coffee break | |
15:30 - 16:20 | Section 4: Out-of-distribution robustness | Yu-An Liu |
16:20 - 16:30 | Section 5: Robust IR in the age of LLMs | Yu-An Liu |
16:30 - 16:50 | Section 6: Challenges and future directions | Maarten de Rijke |
16:50 - 17:00 | Q & A | All |
A curated list of papers related to robustness in IR can be found at Awesome Robustness in Information Retrieval.
The tutorial extensively covers papers highlighted in bold.
Adversarial retrieval attack
Adversarial ranking attack
Topic-oriented adversarial retrieval/ranking attack
Surrogate model training
Pre-defined position
Output-guided position
Gradient-guided position
Word substitution
Trigger sentence
Multi-granular
Encoding error
Grammatical error
Static: greedy search
Dynamic: reinforcement learning
Data augmentation
Traditional adversarial training
Theory-guided adversarial training
Certified robustness
Perplexity-based detection
Language-based detection
Learning-based detection
Data augmentation
Domain modeling
Architectural modifications
Scaling up the model capacity
Continual learning for dense retrieval
Continual learning for generative retrieval
Self-teaching
Contrastive learning
Hybrid training
@inproceedings{liu2024robust,
author = {Liu, Yu-An and Zhang, Ruqing and Guo, Jiafeng and de Rijke, Maarten},
title = {Robust Information Retrieval},
year = {2024},
booktitle = {SIGIR},
}