/AI10h ago

Multilingual Representation Workshop releases expanded Global PIQA dataset to evaluate cultural commonsense reasoning across 141 languages

Non-parallel splits feature over 50% culturally-specific examples.

--0--
Original posts
Quote posts
Reposts
Sentiment
Sentiment building, check back later.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
VIEWS587BOOKMARKS1LIKES7RETWEETS3

It is often understated how hard ot is to compare multilingual models, especially small ones. With hundreds of people working on it, you can now enjoy:

We are releasing an expanded version of Global PIQA! It now covers 141 language varieties and includes parallel and non-parallel splits. We are also releasing an updated preprint.

9hViews 587Likes 7Bookmarks 1