Non-Profit Organization Ai2 Releases New LLM Competitive With Meta’s Llama
The nonprofit research organization The Allen Institute for Artificial Intelligence (Ai2) launched OLMo 2, the second family of its open language model, with highly competitive tools and capabilities comparable to leading models in the market such as Meta’s Llama 3.1.
In a Rush? Here are the Quick Facts!
- Ai2 launched OLMo 2 yesterday, an advanced and open-source language model
- The organization describes it as “the best fully open language model to date”
- OLMo 2 competes with other open-source models like Meta’s Llama 3.1
Ai2, founded by Microsoft’s co-founder Paul Allen in 2014, described this model as “the best fully open language model to date.”
“We introduce OLMo 2, a new family of 7B and 13B models trained on up to 5T tokens,” wrote the organization in an announcement on its website. “These models are on par with or better than equivalently sized fully open models, and competitive with open-weight models such as Llama 3.1 on English academic benchmarks.”
OLMo 2 is the result of an upgrade to the previous versions of models released throughout the year—Ai2 announced its first model, OLMo, in February—focusing on improving critical aspects like training stability, pretraining, state-of-the-art post-training, and performance through an evaluation framework.
The new model is currently only available in English, and there’s an online demo available to the public to test OLMo 2.
According to TechCrunch, OLMo 2 meets the criteria to be considered an open-source AI as its data and tools are publicly available and ready to be tested.
Ai2 shared data proving this new model can outperform other popular models with similar structures.
“We find that OLMo 2 7B and 13B are the best fully-open models to-date, often outperforming open-weight models of equivalent size,” states the document shared by the organization. “Not only do we observe a dramatic improvement in performance across all tasks compared to our earlier OLMo 0424 model but, notably, OLMo 2 7B outperforms LLama-3.1 8B and OLMo 2 13B outperforms Qwen 2.5 7B despite its lower total training FLOPs.”
Alibaba released the new Qwen 2.5 models, considered by Ai2 for comparison, in September.
Leave a Comment
Cancel