We are releasing an expanded version of Global PIQA! It now covers 141 language varieties and includes parallel and non-parallel splits. We are also releasing an updated preprint.
Multilingual Representation Workshop releases expanded Global PIQA dataset to evaluate cultural commonsense reasoning across 141 languages
Non-parallel splits feature over 50% culturally-specific examples.
Most Activity
It is often understated how hard ot is to compare multilingual models, especially small ones. With hundreds of people working on it, you can now enjoy:
We are releasing an expanded version of Global PIQA! It now covers 141 language varieties and includes parallel and non-parallel splits. We are also releasing an updated preprint.