Meta quietly releases Llama 2 Long AI model

[ad_1]

VentureBeat presents: AI Unleashed – An exclusive executive event for enterprise data leaders. Network and learn with industry peers. Learn More

Meta Platforms showed off a bevy of new AI features for its consumer-facing services Facebook, Instagram and WhatsApp at its annual Meta Connect conference in Menlo Park, California, this week.

But the biggest news from Mark Zuckerberg’s company may have actually come in the form of a computer science paper published without fanfare by Meta researchers on the open access and non-peer reviewed website arXiv.org.

The paper introduces Llama 2 Long, a new AI model based on Meta’s open source Llama 2 released in the summer, but that has undergone “continual pretraining from Llama 2 with longer training sequences and on a dataset where long texts are upsampled,” according to the researcher-authors of the paper.

As a result of this, Meta’s newly elongated AI model outperforms some of the leading competition in generating responses to long (higher character count) user prompts, including OpenAI’s GPT-3.5 Turbo with 16,000-character context window, as well as Claude 2 with its 100,000-character context window.

Event

AI Unleashed

An exclusive invite-only evening of insights and networking, designed for senior enterprise executives overseeing data stacks and strategies.

Learn More

[ad_2]

Source link

Adobe Previews New GenAI Tools for Video Workflows

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

8 Reasons to Make the Switch

Meta quietly releases Llama 2 Long AI model

Event

How LLama 2 Long came to be

Related Posts

Adobe Previews New GenAI Tools for Video Workflows

Exciting Updates From Stanford HAI’s Seventh Annual AI Index Report

8 Reasons to Make the Switch