Papers
arxiv:2310.14757

SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research

Published on Oct 23, 2023
Authors:
,
,
,
,
,
,
,

Abstract

A unified benchmark called SuperTweetEval is introduced to evaluate NLP models on social media tasks, revealing that social media remains a challenging domain despite recent language modeling advancements.

AI-generated summary

Despite its relevance, the maturity of NLP for social media pales in comparison with general-purpose models, metrics and benchmarks. This fragmented landscape makes it hard for the community to know, for instance, given a task, which is the best performing model and how it compares with others. To alleviate this issue, we introduce a unified benchmark for NLP evaluation in social media, SuperTweetEval, which includes a heterogeneous set of tasks and datasets combined, adapted and constructed from scratch. We benchmarked the performance of a wide range of models on SuperTweetEval and our results suggest that, despite the recent advances in language modelling, social media remains challenging.

Community

Sign up or log in to comment

Models citing this paper 19

Browse 19 models citing this paper

Datasets citing this paper 1

Spaces citing this paper 3

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.