Image Retrieval for Arguments 2023

Synopsis
Data
Evaluation
Submission
Results
Related Work
Task Committee

Synopsis

Task: Given a controversial topic, the task is to retrieve images (from web pages) for each stance (pro/con) that show support for that stance.
Input: [topics] [data]
Submission: [submit]

Data

This task uses a focused crawl of about 56K images (and associated web pages) as document collection. The crawl is created from several image search queries for each of the 50 topics. The relevance judgments of last year are available for training supervised approaches. The output of last year's approaches can be inspected on images.args.me. Registered teams were invited in 2022 to submit search queries that were then used to extend the crawl. [data]

Evaluation

Like last year, systems are evaluated on 50 topics by the ratio of images among the 20 retrieved images for each topic (10 images for each stance) that are relevant for the respective topic and stance. [topics]

Submission

This task uses TIRA for submissions, which allows for both run file upload and Docker image submission. For each topic and stance, include 10 retrieved images. Each team can submit up to 5 different runs.

The submission format adapts the standard TREC format. Each line corresponds to an image retrieved for some topic and stance at a certain rank, making a run file 1000 lines long (50 topics, 2 stances, 10 ranks). Each line contains the following fields, separated by single whitespaces:

The topic number (51 to 100).
The stance ("PRO" or "CON").
The image's ID (corresponds to the name of the image's directory in the collection; always 17 characters long and starts with "I").
The rank (1 to 10 in increasing order per topic and stance). Not used in this year's evaluation.
A score (integer or floating point; non-increasing per topic and stance). Not used in this year's evaluation.
A tag that identifies your group and the method you used to produce the run.

For example:

1 PRO I000330ba4ea0ad13 1 17.89 myGroupMyMethod
1 PRO I0005e6fe00ea17fd 2 16.43 myGroupMyMethod
...
1 CON I0009d5f038fe6f2e 1 15.89 myGroupMyMethod
1 CON I000f34bd3f8cb030 2 14.43 myGroupMyMethod
...

TIRA Tutorial and TIRA Baselines

We provide a TIRA tutorial that provides baselines that can be executed in TIRA at https://github.com/touche-webis-de/touche-code/tree/main/clef23.

In case of problems or questions concerning TIRA, please use the TIRA forum.

Results

Results
Team	Tag	Precision@10
		On topic	Argumentative	On stance
Neville Longbottom	clip_chatgpt_args.raw	0.785	0.338	0.222
Neville Longbottom	clip_chatgpt_args.debater	0.684	0.341	0.216
Hikaru Sulu	Keywords	0.664	0.350	0.185
Hikaru Sulu	Topic-title	0.770	0.335	0.179
Neville Longbottom	bm25_chatgpt_args.raw	0.572	0.274	0.166
Jean-Luc Picard	No stance detection (0)	0.523	0.292	0.162
Neville Longbottom	bm25_chatgpt_args.diff	0.442	0.240	0.150
Jean-Luc Picard	Text and image text stance detection (3)	0.502	0.272	0.144
Jean-Luc Picard	BM25 Baseline (-1)	0.536	0.268	0.141
Jean-Luc Picard	Text stance detection (1)	0.498	0.262	0.136
Neville Longbottom	bm25_chatgpt_args.debater	0.416	0.201	0.128
Minsc	Aramis	0.376	0.194	0.102
Jean-Luc Picard	Image text stance detection (2)	0.369	0.196	0.098

Related Work

Johannes Kiesel, Nico Reichenbach, Benno Stein, and Martin Potthast. Image Retrieval for Arguments Using Stance-Aware Query Expansion. 8th Workshop on Argument Mining (ArgMining 2021) at EMNLP, November 2021.
Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov, and Giovanni Da San Martino. SemEval-2021 Task 6: Detection of Persuasion Techniques in Texts and Images. 15th International Workshop on Semantic Evaluation (SemEval 2021), August 2021.
Keiji Yanai. Image collector III: a web image-gathering system with bag-of-keypoints. 16th International Conference on World Wide Web (WWW 2007), May 2007.

Task Committee

Johannes Kiesel

GESIS - Leibniz Institute for the Social Sciences

Maik Fröbe

Friedrich Schiller University Jena

Martin Potthast

University of Kassel, hessian.AI, and ScaDS.AI

Benno Stein

Bauhaus-Universität Weimar