Meta releases AI model for translating speech between dozens of languages

News23 Aug 2023

By Katie Paul

NEW YORK (Reuters) - Facebook parent company Meta Platforms on Tuesday released an AI model capable of translating and transcribing speech in dozens of languages, a potential building-block for tools enabling real-time communication across language divides.

The company said in a blog post that its SeamlessM4T model could support translations between text and speech in nearly 100 languages, as well as full speech-to-speech translation for 35 languages, combining technology that was previously available only in separate models.

CEO Mark Zuckerberg has said he envisions such tools facilitating interactions between users from around the globe in the metaverse, the set of interconnected virtual worlds on which he is betting the company’s future.

Meta is making the model available to the public for non-commercial use, the blog post said.

The world’s biggest social media company has released a flurry of mostly free AI models this year, including a large language model called Llama that poses a serious challenge to proprietary models sold by Microsoft-backed OpenAI and Alphabet’s Google.

Zuckerberg says an open AI ecosystem works to Meta’s advantage, as the company has more to gain by effectively crowd-sourcing the creation of consumer-facing tools for its social platforms than by charging for access to the models.

Nonetheless, Meta faces similar legal questions as the rest of the industry around the training data ingested to create its models.

In July, comedian Sarah Silverman and two other authors filed copyright infringement lawsuits against both Meta and OpenAI, accusing the companies of using their books as training data without permission.

For the SeamlessM4T model, Meta researchers said in a research paper that they gathered audio training data from 4 million hours of "raw audio originating from a publicly available repository of crawled web data," without specifying which repository.

A Meta spokesperson did not respond to questions on the provenance of the audio data.

Text data came from datasets created last year that pulled content from Wikipedia and associated websites, the research paper said.

(Reporting by Katie Paul, Editing by Rosalba O’Brien)

AI & Automation

Meta releases AI model for translating speech between dozens of languages

Business Reporter Team

You may also like

#BreakTheBias this International Women’s Day

#ShapeTheWorld this International Women in Engineering Day-June 2020

10 micro-trends that will shape the future of marketing technologySPONSORED ARTICLE

Related Articles

AI Talk: Using AI to help process automation reach peak efficiency

AI Talk: Deliver an ROI from conversational AI

AI Talk: How GenAI can enhance your organisation

Digital twins: catalysts of business innovation

Related Articles

Making AI a catalyst for new and uprated services

US asks Nvidia to investigate how its chips ended up in China, The Information reports

Most Viewed

Delay to Nvidia's new AI chip could affect Microsoft, Google, Meta, the Information says

Revolut joins Europe's biggest banks with $45 billion valuation after share sale

Samsung flags better-than-expected profit rise as AI boom lifts chip prices

Siemens can make more acquisitions after Altair deal, exec tells paper

Verizon to Acquire Frontier Communications in $20 Billion All-Cash Deal

SymphonyAI targets second half 2025 IPO with $500 million in revenue run rate

Volkswagen investment chief steps down from Northvolt board

Exclusive-Samsung delays taking deliveries of ASML chip gear for its new US factory, sources say

Microsoft, BlackRock to launch $30 billion fund for AI infrastructure

CBOE partners with UK firm to launch U.S. private share trading platform, Bloomberg News reports

Winston House, 3rd Floor, Units 306-309, 2-4 Dollis Park, London, N3 1HF

23-29 Hendon Lane, London, N3 1RT

020 8349 4363

info@business-reporter.co.uk