Meta releases AI model for translating speech between dozens of languages

Meta is making the model available to the public for non-commercial use. (AFP/File)
Short Url
Updated 22 August 2023
Follow

Meta releases AI model for translating speech between dozens of languages

  • SeamlessM4T model can perform speech-to-text, speech-to-speech, text-to-speech, and text-to-text translations for up to 100 languages including Modern Standard Arabic

LONDON: Facebook parent company Meta Platforms on Tuesday released an AI model capable of translating and transcribing speech in dozens of languages, a potential building-block for tools enabling real-time communication across language divides.

The company said in a blog post that its SeamlessM4T model could support translations between text and speech in nearly 100 languages, as well as full speech-to-speech translation for 35 languages, including Modern Standard Arabic, Western Persian and Urdu.

Meta has built upon past innovations combining technology that was previously available only in separate models, such as No Language Left Behind (NLLB) and Universal Speech Translator, to create this unified multilingual model.

CEO Mark Zuckerberg has said he envisions such tools facilitating interactions between users from around the globe in the metaverse, the set of interconnected virtual worlds on which he is betting the company’s future.

Meta is making the model available to the public for non-commercial use, the blog post said.

The world’s biggest social media company has released a flurry of mostly free AI models this year, including a large language model called Llama that poses a serious challenge to proprietary models sold by Microsoft-backed OpenAI and Alphabet’s Google.

Zuckerberg says an open AI ecosystem works to Meta’s advantage, as the company has more to gain by effectively crowd-sourcing the creation of consumer-facing tools for its social platforms than by charging for access to the models.

Nonetheless, Meta faces similar legal questions as the rest of the industry around the training data ingested to create its models.

In July, comedian Sarah Silverman and two other authors filed copyright infringement lawsuits against both Meta and OpenAI, accusing the companies of using their books as training data without permission.

For the SeamlessM4T model, Meta researchers said in a research paper that they gathered audio training data from 4 million hours of “raw audio originating from a publicly available repository of crawled web data,” without specifying which repository.

A Meta spokesperson did not respond to questions on the provenance of the audio data. Text data came from datasets created last year that pulled content from Wikipedia and associated websites, the research paper said.

Meta said to have conducted extensive research on mitigating toxicity and bias in its generative AI models, resulting in a model that is more aware of and responsive to potential issues.

Earlier this year, Meta joined forces with Alphabet, Microsoft, and OpenAI to announce a joint framework on the responsible use of AI in order to mitigate the risks associated with generative AI tools.

With Reuters


Israeli court overturns conviction of officer who assaulted Palestinian journalist, citing ‘Oct. 7 PTSD’

Updated 25 February 2026
Follow

Israeli court overturns conviction of officer who assaulted Palestinian journalist, citing ‘Oct. 7 PTSD’

  • Judge sentenced Yitzhak Sofer to 300 hours of community service, saying officer “devoted his life to Israel’s security” and conviction was “disproportionate to severity of his actions”
  • Footage shows Sofer throwing photojournalist Mustafa Alkharouf to the ground, and repeatedly beating and kicking him while he covered Palestinian gatherings near Al-Aqsa Mosque

LONDON: An Israeli court overturned the conviction of a border police officer who assaulted a Palestinian journalist, ruling his actions were influenced by post-traumatic stress disorder from serving during the Oct. 7 2023 attacks.

On Tuesday, the Jerusalem Magistrate’s Court sentenced officer Yitzhak Sofer to 300 hours of community service for assaulting Anadolu Agency photojournalist Mustafa Alkharouf in occupied East Jerusalem in December 2023.

Footage shows Sofer and other officers drawing weapons, throwing Alkharouf to the ground, and repeatedly beating and kicking him while he covered Palestinian gatherings near Al-Aqsa Mosque amid heavy restrictions.

Alkharouf was hospitalized with facial and body injuries. His cameraman, Faiz Abu Ramila, was also attacked.

Sofer had been convicted in September 2024 of assault causing bodily harm (acquitted of threats) and initially faced six months’ community service, as recommended by Mahash, the Justice Ministry’s police misconduct unit.

Judge Amir Shaked accepted the defense request to cancel the conviction, replacing it with community service.

He cited Sofer’s PTSD from responding to the Oct. 7 Hamas-led attack, noting the officer had “no prior criminal record” and had “devoted his life to Israel’s security.”

“The court cannot ignore this when considering whether the defendant’s conviction should stand,” he said, adding that while the incident is “serious and does cross the criminal threshold,” the conviction in place could cause Sofer harm “disproportionate to the severity of his actions.”

The ruling comes amid surging attacks on journalists in the West Bank, East Jerusalem and Gaza since Israel’s war on Gaza began.

The Committee to Protect Journalists reported Israel responsible for two-thirds of the 129 media workers killed worldwide in 2025, the deadliest year on record, citing a “persistent culture of impunity” and lack of transparent probes.

Reporters Without Borders called the Israeli army the “worst enemy of journalists” in its 2025 report, with nearly half of global reporter deaths in Gaza.

Foreign journalists face raids, arrests and intimidation. In late January 2026, Israel’s Supreme Court granted a delay on ruling a ban on foreign media access to Gaza.