Building Arabic AI from the ground up

Short Url
Updated 25 September 2025
Follow

Building Arabic AI from the ground up

  • From language depth to data security, regional AI must reflect local values, priorities

ALKHOBAR: When Saudi Arabia unveiled Allam, its homegrown Arabic large language model, it sent a clear signal: the Kingdom is no longer content to simply consume global AI technologies. 

It intends to build its own. For many, this was a moment of pride — a proof that the Arab world can produce tools designed to understand its own languages, cultures, and contexts.

But experts caution that Allam is only the first step in a much longer journey. Success will not be determined by the models alone, but by the invisible foundations that support them: data, infrastructure, governance, and trust.

“You can’t capture the intent, emotion, and cultural depth of Arabic through translation,” said David Barber, director of the UCL Centre for Artificial Intelligence and Distinguished Scientist at UiPath. “You need systems that think in Arabic from the ground up.”




David Barber, director, UCL Centre for Artificial Intelligence; distinguished scientist at UiPath. (Supplied)

Barber highlights a stark reality: only about 15 percent of Arabic text online is clean enough for training a large language model, compared with over 50 percent for English — a huge head start for models like GPT or Claude. Complicating matters further are Arabic’s complex grammar, diverse dialects, and the common mixing of English and Arabic in a single sentence.

“When you train on noisy or shallow data, the system learns shortcuts,” Barber explained. “It can mimic fluency, but it misses the depth, the idioms, the cultural nuances, the rhythm of thought that makes Arabic distinct.”

For Barber, this underscores the importance of Saudi Arabia’s push for locally sourced, high-quality datasets. Without them, any Arabic LLM risks becoming a shallow copy of English-language AI: competent at generic tasks but unable to capture the soul of the language it claims to represent.

Even the best data is ineffective if it cannot be properly organized, secured, and delivered to the model. Seema Alidily, regional director at Denodo, said Gulf enterprises still face major challenges here.

“Without localized infrastructure, AI systems risk misunderstanding user intent or producing irrelevant outputs,” she said. “Data virtualization is one of the few ways to unify governance and access across cloud and on-site systems without moving sensitive information.”




Seema Alidily, regional director, Denodo. (Supplied)

Practically, this means investing in platforms that can pull data from dozens of scattered sources — from ERP systems to IoT sensors— and present it in a unified view for AI to use. In Saudi Arabia, where Vision 2030 projects depend on massive, real-time datasets, this approach is critical, especially given strict regulations on handling citizen data.

Alidily warned that merely replicating Western infrastructure may not suffice. “In the Gulf, centralized visibility and compliance must come first,” she noted. “It is not just a technical issue, it is about aligning with the legal, cultural, and regulatory expectations of the region.”

For Bader AlBahaian, country manager for Saudi Arabia at VAST Data, the stakes go beyond efficiency — they touch on independence and security.

“If we depend exclusively on external platforms, we risk importing their policies and their priorities, often at the expense of regional needs,” he said.




Bader AlBahaian, country manager, Saudi Arabia, VAST Data. (Supplied)

AlBahaian advocates for “sovereign-by-design” systems: storage and compute architectures that keep sensitive data within national borders, encryption and access controls that satisfy local regulators, and AI models trained under rules set by the Kingdom rather than a foreign vendor.

“It is not just about where the data sits,” he added. “It is about who gets to define how it is used, who takes responsibility when something goes wrong, and who has the power to switch the system off if necessary.”

This question of sovereignty is becoming urgent as AI begins to shape decisions in finance, healthcare, education, and public policy. A misaligned model trained on foreign data could issue recommendations that contradict local priorities — or worse, expose the region to economic or political risks.

But building perfect infrastructure is only half the challenge. Success ultimately depends on how AI is deployed.

“Digital labor will allow businesses to have much deeper relationships with their customers,” said Ibrahim Alseghayr, managing director of Salesforce Saudi Arabia. “And by taking on so much of the routine work, AI frees humans to focus on collaboration, creativity, and critical thinking.”




Ibrahim Alseghayr, managing director of Salesforce Saudi Arabia. (Supplied)

Alseghayr points to Agentic AI — systems that can act on a company’s behalf — as already transforming service centers, financial operations, and citizen engagement platforms. In Saudi Arabia, he sees huge potential for digital labor in scaling mega-projects like Neom, automating logistics networks, and delivering smarter healthcare services.

He cautioned that this transformation must be carefully managed. “We need strong governance, testing environments, and continuous oversight,” he said. “Otherwise, we risk building tools we do not fully understand, and that could erode trust instead of building it.”

Across all four experts, one theme is clear: global rules and imported frameworks will not suffice. The Arab world must craft its own AI governance models, rooted in its cultural and legal realities.

For Barber, Allam is a test case. “This is the Kingdom’s chance to prove that it can build systems that are not only technically powerful but also aligned with its values,” he added.

DID YOU KNOW?

• Arabic’s complex grammar, dialect diversity, and frequent English–Arabic mixing make it one of the hardest languages for AI to master. 

• Saudi Arabia’s Allam is the first homegrown Arabic large language model, designed to think in Arabic rather than translate from English. 

• Vision 2030 projects depend on real-time data, but regulations require strict handling of citizen information.

“Agentic AI can create personalized treatment plans, autonomously monitor patients, and detect early signs of health deterioration before a doctor ever enters the room,” he said.Alidily agrees, emphasizing that governance frameworks must reflect the Gulf’s unique data protection requirements, with regulators working closely with technology providers to define shared standards.

AlBahaian is even more direct. “Trust is earned through systems, not slogans. People need to know where their data is, who is using it, and for what purpose. That is the only way to build confidence at scale.”

The message is clear: Arabic AI’s future will not be decided by model size alone. It will depend on investments in infrastructure, sovereignty, and governance.

Saudi Arabia has taken the first step with Allam. What comes next — the data pipelines, virtualized infrastructure, sovereign controls, and digital labor deployments — will determine whether the Kingdom becomes a true AI creator or remains a buyer of foreign-built intelligence.

 


Operational challenges bring Riyadh Airport to a near standstill

Updated 49 min 30 sec ago
Follow

Operational challenges bring Riyadh Airport to a near standstill

  • Airlines issue statements, while sources tell Arab News rain is to blame

RIYADH: Thousands of passengers travelling to and from King Khalid International Airport in Riyadh were left stranded as major airlines struggled to offer alternative flights following a slew of cancellations and delays.

Saudia and flyadeal were among the aviation firms who faced difficulties, with the two airlines putting out statements blaming temporary operational challenges for the issues.

A statement from the airport on its official X account urged travelers to contact airlines directly before heading to the aviation hub to verify the updated status and timing of their flights.

The statement said: “King Khalid International Airport would like to inform you that, due to the concurrence of a number of operational factors over the past two days —including several flights diverting from other airports to King Khalid International Airport, in addition to scheduled maintenance works within the fuel supply system — this has resulted in an impact on the schedules of some flights, including the delay or cancellation of a number of flights operated by certain airlines.”

The airport added that operational teams are working “around the clock in close coordination with our airline partners and relevant stakeholders to address developments and restore operational regularity as soon as possible”, while taking all necessary measures to minimize any impact on the passenger experience.

Airport sources told Arab News that the issue has to do with the heavy rain Riyadh experienced earlier on Friday. Water apparently got into the fuel tankers supposed to refuel jets before they fly, and then several airlines struggled to re-schedule passengers. 

It its own statement on X, Saudia said: “Affected guests are being contacted through various communication chanels, with all ticket changes processed at no additional cost.”

Arab News reached out to Saudia for further information.

Also in a post on X, flyadeal said any of it passengers impacted by the disruption “will be notified directly by emails and SMS with rebooking and support options.”