2d ago

Embeddings Show Language Shapes Color-Emotion Associations Across Four Languages

0
Original post

i compared embedding similarity between colors and emotions and normalized it to create this chart and then when i changed the language, the results changed (eg. 赤、愛) see: english, spanish, japanese, hindi

7:02 AM · May 14, 2026 View on X

i compared embedding similarity between colors and emotions and normalized it to create this chart

and then when i changed the language, the results changed (eg. 赤、愛)

see: english, spanish, japanese, hindi

2:02 PM · May 14, 2026 · 9.4K Views

interesting to analyze by color as well

you can see how colors correlate differently to emotions across languages

YoheiYohei@yoheinakajima

i compared embedding similarity between colors and emotions and normalized it to create this chart and then when i changed the language, the results changed (eg. 赤、愛) see: english, spanish, japanese, hindi

2:02 PM · May 14, 2026 · 9.4K Views
2:02 PM · May 14, 2026 · 1.2K Views

you can see more here (color stuff is at bottom): http://colorfeelings.replit.app

- changing embedding model may change results - the similarity range was pretty small so normalizing really stretched it out - some languages seem to generally have stronger correlation between all emotions and colors

this used Xenova/paraphrase-multilingual-MiniLM-L12-v2 embedding model from @huggingface which was decided on by the @replit agent that built this

YoheiYohei@yoheinakajima

interesting to analyze by color as well you can see how colors correlate differently to emotions across languages

2:02 PM · May 14, 2026 · 1.2K Views
2:02 PM · May 14, 2026 · 1.1K Views

Chinese and Japanese have stronger avg similarity between colors and emotions

Arabic and German had the lowest avg similarity between colors and emotions

(this kinda makes sense given chinese/japanese characters often have double meanings, etc?)

YoheiYohei@yoheinakajima

you can see more here (color stuff is at bottom): http://colorfeelings.replit.app - changing embedding model may change results - the similarity range was pretty small so normalizing really stretched it out - some languages seem to generally have stronger correlation between all emotions and colors this used Xenova/paraphrase-multilingual-MiniLM-L12-v2 embedding model from @huggingface which was decided on by the @replit agent that built this

2:02 PM · May 14, 2026 · 1.1K Views
5:15 PM · May 14, 2026 · 260 Views

there were definitely some unexpected results.

i would have assumed blue would be similar to trust, but just because it's commonly discussed in UX discussions, does not mean this appears in embedding similarity

YoheiYohei@yoheinakajima

I analyzed 12 colors and 12 emotions across 12 languages *quick correction on earlier note: i did the normalization after doing similarity scores for all languages

5:18 PM · May 14, 2026 · 492 Views
5:19 PM · May 14, 2026 · 591 Views

here i look at embedding similarity between numbers (spelled out) and emotions

One and Two are similar Love and Surprise Eight is similar to Anticipation Ten is similar to Joy, Love, Surprise, and Anticipation

YoheiYohei@yoheinakajima

there were definitely some unexpected results. i would have assumed blue would be similar to trust, but just because it's commonly discussed in UX discussions, does not mean this appears in embedding similarity

5:19 PM · May 14, 2026 · 591 Views
6:10 PM · May 14, 2026 · 1.2K Views

similarly you can look at how numbers are similar (in embedding space) to different emotions based on the language

YoheiYohei@yoheinakajima

here i look at embedding similarity between numbers (spelled out) and emotions One and Two are similar Love and Surprise Eight is similar to Anticipation Ten is similar to Joy, Love, Surprise, and Anticipation

6:10 PM · May 14, 2026 · 1.2K Views
6:14 PM · May 14, 2026 · 671 Views

germans have the strongest association between numbers and feelings overall

...and other findings

YoheiYohei@yoheinakajima

similarly you can look at how numbers are similar (in embedding space) to different emotions based on the language

6:14 PM · May 14, 2026 · 671 Views
7:17 PM · May 14, 2026 · 391 Views

now i compared similarity of 12 futuristic tech terms with 12 emotions across 12 languages and then normalized them

you can see charts like how does "AGI" feel across languages*

*this is in relation to the other 11 terms

YoheiYohei@yoheinakajima

germans have the strongest association between numbers and feelings overall ...and other findings

7:17 PM · May 14, 2026 · 391 Views
7:26 PM · May 14, 2026 · 3.5K Views

different tech terms and their closest emotion in english (vector similarity)

you'll notice cryptocurrency being close to trust, which could be because both trust and lack of trust are often discussed alongside the topic

surveillance and AGI have highest emotional charge

YoheiYohei@yoheinakajima

now i compared similarity of 12 futuristic tech terms with 12 emotions across 12 languages and then normalized them you can see charts like how does "AGI" feel across languages* *this is in relation to the other 11 terms

7:26 PM · May 14, 2026 · 3.5K Views
7:28 PM · May 14, 2026 · 685 Views

the same word in different languages can have varying connotations due to different associations (e.g. color/emotion)

YoheiYohei@yoheinakajima

i compared embedding similarity between colors and emotions and normalized it to create this chart and then when i changed the language, the results changed (eg. 赤、愛) see: english, spanish, japanese, hindi

2:02 PM · May 14, 2026 · 9.4K Views
3:38 PM · May 14, 2026 · 2K Views

in this embedding analysis, compared to other futuristic tech terms,

AGI maps more strongly to anticipation than fear, especially in english, japanese, chinese, and italian

it also shows stronger negative associations across several european languages: anger in spanish/italian, disgust in german, and shame/envy in french, german, and portuguese

YoheiYohei@yoheinakajima

now i compared similarity of 12 futuristic tech terms with 12 emotions across 12 languages and then normalized them you can see charts like how does "AGI" feel across languages* *this is in relation to the other 11 terms

7:26 PM · May 14, 2026 · 3.5K Views
8:41 PM · May 14, 2026 · 2.7K Views
Embeddings Show Language Shapes Color-Emotion Associations Across Four Languages · Digg