Areas of Work and Initiatives

Menu Display

Linguistic Computing:

This interdisciplinary field bridges technology and the Arabic language. It involves identifying, evaluating, and creating linguistic resources, as well as developing related data and computational applications designed for the automated processing of Arabic—both in terms of comprehension and generation. As a specialized discipline, it merges computer science with Arabic, aiming to support Arabic language fields through computational applications and enhance systems related to Arabic. The ultimate goal is for Arabic to be competitive with other languages in this domain.

Areas of Work:

The sector focuses on linguistic computing, actively contributing to the identification, evaluation, and creation of linguistic data sources. It emphasizes building human capacities, empowering specialists, and developing linguistic resources. It also works on integrating modern technologies with the Arabic language, creating applications, tools, and software that ensure the preservation of the Arabic language through the use of artificial intelligence technologies.

The sector has various programs, most notably:

  • Contemporary Arabic Language Corpus
  • Artificial Intelligence Center for Arabic Language Processing
  • King Salman Academy Hackathon in Arabic Language Processing

Objectives of the Linguistic Computing Sector:

  • Achieving global reference status in linguistic corpora and Arabic dictionaries.
  • Enhancing and developing artificial intelligence technologies for the automated processing of the Arabic language, both locally and globally.
  •  

Most important projects

Asset Publisher

(BALSAM) Index for Evaluating Arabic Large Language Models (LLM): Read more
(BALSAM) Index for Evaluating Arabic Large Language Models (LLM):

A collaborative initiative launched by the academy in partnership with leading academic and government institutions across the Middle East.

Read more
(FALAK) Linguistic Corpus Platform: Read more
(FALAK) Linguistic Corpus Platform:

The Computational Linguistics sector seeks to enhance linguistic resources that contribute to accelerating the pace of scientific research in the Arabic language

Read more
(SIWAR) Linguistic Lexicon Platform: Read more
(SIWAR) Linguistic Lexicon Platform:

Read more
Arabic AI (ARAI) Center: Read more
Arabic AI (ARAI) Center:

The center provides a comprehensive suiteof services designed to empower researchers and developers around the world harness artificial intelligence technologies for the automated processing of the Arabic language. It also supports the development of applications, tools, and software that ensure the preservation of the Arabic language.

Read more
Arabic Computing Challenge: Read more
Arabic Computing Challenge:

An annual global technical challenge aimed at individuals with technical and linguistic expertise from around the world to work on Arabic language processing.

Read more
Arabic Computing Observatory: Read more
Arabic Computing Observatory:

The Arabic Computing Observatory is an analytical and reference tool designed to document and track advancements in the field of Arabic computing.

Read more
Aswat Corpus: Read more
Aswat Corpus:

Aswat is a general spoken corpus that includes multiple linguistic levels—both Modern Standard Arabic and dialects—sourced from various regions within Saudi Arabia. The audio recordings were collected from diverse social groups across five regions of the Kingdom.

Read more
Language Games: Read more
Language Games:

Interactive linguistic and cultural games designed to enhance the vocabulary of native Arabic speakers and learners of Arabic as a foreign language.

Read more
Riyadh Dictionary of Contemporary Arabic: Read more
Riyadh Dictionary of Contemporary Arabic:

This dictionary includes the linguistic material needed by both Arabic speakers and non-native speakers.

Read more