We introduce EmoAtlas, a computational library/framework extracting emotions and syntactic/semantic word associations from texts. EmoAtlas combines interpretable artificial intelligence (AI) for syntactic parsing in 18 languages and psychologically validated lexicons for detecting the eight emotions in Plutchik's theory. We show that EmoAtlas can match or surpass transformer-based natural language processing techniques, BERT or large language models like ChatGPT 3.5 or LLaMAntino, in detecting emotions from Italian and English online posts and news articles (e.g., achieving 85.6 % accuracy in detecting anger in posts vs the 68.8 % value of ChatGPT and 89.9% value for BERT). EmoAtlas presents important advantages in terms of speed and absence of fine-tuning, e.g., it runs 12x faster than BERT on the same data. Testing EmoAtlas' and easily trainable transformers' relevance in a psychometric task like reproducing human creativity ratings for 1071 short texts, we find that EmoAtlas and BERT obtain equivalent predictive power (fourfold cross-validation, ρ≈0.495 , p<10-4 ). Combining BERT's semantic features with EmoAtlas' emotional/syntactic networks of words gets substantially better at estimating creativity rates of stories ( ρ=0.628 , p<10-4 ). This indicates an interplay between the creativity of narratives and their semantic, emotional, and syntactic structure. Via interpretable emotional profiles and syntactic networks, EmoAtlas can also quantify how emotions are channeled through specific words in texts, e.g., how did customers frame their ideas and emotions towards "beds" in hotel reviews? We release EmoAtlas as a standalone "text as data" computational tool and discuss its impact in extracting interpretable and reproducible insights from texts.

EmoAtlas: An emotional network analyzer of texts that merges psychological lexicons, artificial intelligence, and network science

Vilella, Salvatore;Ruffo, Giancarlo;
2025-01-01

Abstract

We introduce EmoAtlas, a computational library/framework extracting emotions and syntactic/semantic word associations from texts. EmoAtlas combines interpretable artificial intelligence (AI) for syntactic parsing in 18 languages and psychologically validated lexicons for detecting the eight emotions in Plutchik's theory. We show that EmoAtlas can match or surpass transformer-based natural language processing techniques, BERT or large language models like ChatGPT 3.5 or LLaMAntino, in detecting emotions from Italian and English online posts and news articles (e.g., achieving 85.6 % accuracy in detecting anger in posts vs the 68.8 % value of ChatGPT and 89.9% value for BERT). EmoAtlas presents important advantages in terms of speed and absence of fine-tuning, e.g., it runs 12x faster than BERT on the same data. Testing EmoAtlas' and easily trainable transformers' relevance in a psychometric task like reproducing human creativity ratings for 1071 short texts, we find that EmoAtlas and BERT obtain equivalent predictive power (fourfold cross-validation, ρ≈0.495 , p<10-4 ). Combining BERT's semantic features with EmoAtlas' emotional/syntactic networks of words gets substantially better at estimating creativity rates of stories ( ρ=0.628 , p<10-4 ). This indicates an interplay between the creativity of narratives and their semantic, emotional, and syntactic structure. Via interpretable emotional profiles and syntactic networks, EmoAtlas can also quantify how emotions are channeled through specific words in texts, e.g., how did customers frame their ideas and emotions towards "beds" in hotel reviews? We release EmoAtlas as a standalone "text as data" computational tool and discuss its impact in extracting interpretable and reproducible insights from texts.
File in questo prodotto:
File Dimensione Formato  
s13428-024-02553-7.pdf

file sotto embargo fino al 28/08/2025

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 2.61 MB
Formato Adobe PDF
2.61 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11579/203962
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact