Skip to content
GCC AI Research

20 million words and counting: UAE’s grand plan to power Arabic with AI - Gulf Business

WAM · · Significant research

Summary

The UAE government is developing large language models (LLMs) specifically for the Arabic language, with a target training dataset of 20 million words. This initiative aims to overcome the underrepresentation of Arabic in existing AI models. The project seeks to enhance AI's ability to understand and generate nuanced Arabic content. Why it matters: A national Arabic LLM can enable culturally relevant AI applications across various sectors in the region, from education to government services.

Keywords

LLM · Arabic · UAE · NLP · AI

Get the weekly digest

Top AI stories from the GCC region, every week.

Related

AI in Arabic? How Gulf could soon lead Artificial Intelligence race - Khaleej Times

Khaleej Times ·

The Gulf region is making significant investments in artificial intelligence, particularly in Arabic NLP. Recent developments include large language models trained on Arabic data and initiatives to promote AI ethics and policy. Why it matters: These investments aim to position the Gulf as a leader in AI, especially in leveraging the Arabic language and culture.

UAE launches AI project to digitally preserve national history - Gulf News

The National ·

The UAE has launched a new AI-powered project dedicated to digitally preserving its national history and cultural heritage. This initiative aims to digitize, catalog, and make accessible a vast collection of historical documents, artifacts, and oral traditions. The project seeks to create a comprehensive digital archive to ensure the longevity and accessibility of the nation's cultural memory for future generations. Why it matters: This initiative demonstrates a significant application of AI by the UAE government for cultural preservation and national identity, setting a precedent for leveraging advanced technology in the digital humanities.