Article Dans Une Revue Computing Année : 2025

Conversation-Based Multimodal Abuse Detection Through Text and Graph Embeddings

Noé Cecillon
Vincent Labatut

Résumé

Abusive behavior is common on online social networks, and forces the hosts of such platforms to find new solutions to address this problem. Various methods have been proposed to automate this task in the past decade. Most of them rely on the exchanged content, but ignore the structure and dynamics of the conversation, which could provide some relevant information. In this article, we propose to use representation learning methods to automatically produce embeddings of this textual content and of the conversational graphs depicting message exchanges. While the latter could be enhanced by including additional information on top of the raw conversational structure, no method currently exists to learn whole-graph representations using simultaneously edge directions, weights, signs, and vertex attributes. We propose two such methods to fill this gap in the literature. We experiment with 5 textual and 13 graph embedding methods, and apply them to a dataset of online messages annotated for abuse detection. Our best results achieve an F -measure of 81.02 using text alone and 80.61 using graphs alone. We also combine both modalities of information (text and graphs) through three fusion strategies, and show that this strongly improves abuse detection performance, increasing the F -measure to 87.06. Finally, we identify which specific engineered features are captured by the embedding methods under consideration. These features have clear interpretations and help explain what information the representation learning methods deem discriminative.

Fichier principal
Vignette du fichier
Cecillon2024_revised2.pdf (666.5 Ko) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04993092 , version 1 (16-03-2025)
hal-04993092 , version 2 (02-05-2025)

Licence

Identifiants

Citer

Noé Cecillon, Vincent Labatut, Richard Dufour. Conversation-Based Multimodal Abuse Detection Through Text and Graph Embeddings. Computing, 2025, 107, pp.124. ⟨10.1007/s00607-025-01463-6⟩. ⟨hal-04993092v2⟩
113 Consultations
51 Téléchargements

Altmetric

Partager

  • More