Meta heats up Big Tech's AI arms race with new language model
The public battle to dominate the AI technology space kicked off late last year with the launch of Microsoft-backed OpenAI's ChatGPT and prompted tech heavyweights from Alphabet Inc to China's Baidu Inc, to trumpet their own offerings.
Meta's LLaMA, short for Large Language Model Meta AI, will be available under non-commercial license to researchers and entities affiliated with government, civil society, and academia, it said in a blog.
Large language models mine vast amounts of text in order to summarize information and generate content. They can answer questions, for instance, with sentences that can read as though written by humans.
The model, which Meta said requires "far less" computing power than previous offerings, is trained on 20 languages with a focus on those with Latin and Cyrillic alphabets.
AI has emerged as a bright spot for investments in the tech industry, whose slowing growth has prompted widespread layoffs and a cutback on experimental bets.
Meta said LLaMA could outperform competitors that examine more parameters, or variables that the algorithm takes into account.
Specifically, it said a version of LLaMA with 13 billion parameters can outperform GPT-3, a recent predecessor to the model on which ChatGPT is built.
It described its 65-billion-parameter LLaMA model as "competitive" with Google's Chinchilla70B and PaLM-540B, which are even larger than the model that Google used to show off its Bard chat-powered search.
Meta in May last year released large language model OPT-175B, also aimed at researchers, which formed the basis of a new iteration of its chatbot BlenderBot.
It later introduced a model called Galactica, which could write scientific articles and solve math problems, but quickly pulled down the demo after it generated authoritative-sounding false responses.
Comments
Post a Comment