Programme

9:00-9:20Introduction
9:20-10:30Remote presentations (Workshop and Shared Task)
10:30-11:00Break
11:00-12:30Keynote talk: Jose Camacho-Collados
12:30-14:00Lunch break
14:00-15:30Poster session
15:30-16:00Break
16:00-17:00Panel discussion
17:00-17:30Closing remarks

Accepted papers

  • Non-directive corpus annotation to reveal individual perspectives with underspecified guidelines: the case of mental workload
    Iuliia Arsenteva, Caroline Dubois, Philippe Le Goff, Ludovic Tanguy and Sylvie Plantin
  • DeliChess: A Multi-party Dialogue Dataset for Deliberation in Chess Problem Solving
    Xiaochen Zhu, Georgi Karadzhov, Tom Stafford and Andreas Vlachos
  • Calibration as a Proxy for Fairness and Efficiency in a Perspectivist Ensemble Approach to Irony Detection
    Samuel Jesus, Guilherme Dal bianco, Wanderlei Soares, Valerio Basile and Marcos André Gonçalves
  • SAGE: Steering Dialog Generation with Future-Aware State-Action Augmentation
    Yizhe Zhang and Navdeep Jaitly
  • Hypernetworks for Perspectivist Adaptation
    Daniil Ignatev, Denis Paperno and Massimo Poesio
  • Aligning NLP Models with Target Population Perspectives using PAIR: Population-Aligned Instance Replication
    Stephanie Eckman, Bolei Ma, Christoph Kern, Rob Chew, Barbara Plank and Frauke Kreuter
  • Weak Ensemble Learning from Multiple Annotators for Subjective Text Classification
    Ziyi Huang, Nishanthi Rupika Abeynayake and Xia Cui
  • Revisiting Active Learning under (Human) Label Variation
    Cornelia Gruber, Helen Alber, Bernd Bischl, Goeran Kauermann, Barbara Plank and Matthias Assenmacher
  • Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement
    Gavin Abercrombie, Tanvi Dinkar, Amanda Cercas Curry, Dirk Hovy and Verena Rieser
  • Balancing Quality and Variation: Spam Filtering Distorts Data Label Distributions
    Eve Fleisig, Matthias Orlikowski, Philipp Cimiano and Dan Klein
  • From Disagreement to Understanding: The Case for Ambiguity Detection in NLI
    Chathuri Jayaweera and Bonnie J. Dorr
  • Towards a Perspectivist Understanding of Irony through Rhetorical Figures
    Pier Felice Balestrucci, Michael Oliverio, Elisa Chierchiello, Eliana Di Palma, Luca Anselma, Valerio Basile, Cristina Bosco, Alessandro Mazzei and Viviana Patti
  • CINEMETRIC: A Framework for Multi-Perspective Evaluation of Conversational Agents using Human-AI Collaboration
    Vahid Sadiri Javadi, Zain Ul Abedin and Lucie Flek
  • A Disaggregated Dataset on English Offensiveness Containing Spans
    Pia Pachinger, Janis Marc Goldzycher, Anna Maria Planitzer, Julia Neidhardt and Allan Hanbury

Invited talk

Jose Camacho-Collados

“Cultural Awareness in Multilingual Language Models: A Perspectivist Personal Perspective”

Abstract

Language models have become ubiquitous in NLP and beyond. In particular, the new wave of large language models (LLMs) are increasingly used to communicate and solve practical problems in many languages and countries, and by an increasingly diverse set of users. However, even though there is no doubt that these models open up plenty of opportunities, there are important issues and research questions that arise when it comes to LLMs and their application in different languages and cultures. For instance, the language coverage in language models drastically decreases for less-resourced languages and as such, their performance. And not only the general performance is affected, but general-purpose LLMs may be implicitly biased to specific cultures and languages depending on their underlying training data.

In this talk, I will discuss how language models reflect on cultural diversity, including potential shortcomings and how language coverage and cultural awareness may be intrinsically intertwined. I will also share some lessons learned based on recent research in this area – in particular, I will focus on the development of BLEnD, a large effort to develop a cultural benchmark of everyday knowledge for dozens of languages and countries.


Jose Camacho-Collados is a UKRI Future Leaders Fellow and Professor at the School of Computer Science of Cardiff University, where he co-founded the Cardiff Natural Language Processing group (Cardiff NLP). Before joining Cardiff University, he completed his PhD in Sapienza University of Rome and was a Google AI PhD Fellow.

Jose has worked in multiple NLP areas with a particular focus on semantics, multilinguality and computational social science with an interdisciplinary perspective. In this area, he has been developing specialised and efficient NLP models for social media applications, such as TweetNLP and related efforts. His work has received several recognitions, including awards at top NLP conferences, or the 2023 AIJ Prominent Paper Award. He is also the co-author of the “Embeddings in Natural Language Processing” book.