OpenAI dataset aims to improve multilingualism
OpenAI has released a multilingual dataset that evaluates the performance of AI models in 14 languages. As Michael Nuñez reports for VentureBeat, the Multilingual Massive Multitask Language Understanding (MMMLU) dataset includes languages such as Arabic, German, Swahili, and Yoruba. It was shared on the open data platform Hugging Face and builds on the popular MMLU …