July 23rd, 2024

Meta releases an open-weights GPT-4-level AI model, Llama 3.1 405B

Meta has launched Llama 3.1 405B, a free AI language model with 405 billion parameters, challenging closed AI models. Users can download it for personal use, promoting open-source AI principles. Mark Zuckerberg endorses this move.

Read original articleLink Icon
Meta releases an open-weights GPT-4-level AI model, Llama 3.1 405B

A new AI language model called Llama 3.1 405B has been released by Meta, offering a GPT-4-class large language model for free download and use on personal hardware. This model, with 405 billion parameters, is positioned to rival other top AI models in various capabilities like general knowledge and multilingual translation. Meta claims that Llama 3.1 405B performs closely to industry-leading models like GPT-4 Turbo and Claude 3.5 Sonnet based on benchmarks. The release challenges the closed AI model approach by allowing anyone to download and run the model, emphasizing the benefits of open-source AI for user control and data security. Mark Zuckerberg supports this move, advocating for open AI releases to avoid vendor lock-in and high costs. However, there is debate over the use of the term "open source" in this context, with concerns raised about its misuse and potential impact on industry terminology. The Llama 3.1 models are available for download through Meta's website and Hugging Face, subject to a license and acceptable use policy.

Related

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU

The article discusses the release of open-source Llama3 70B model, highlighting its performance compared to GPT-4 and Claude3 Opus. It emphasizes training enhancements, data quality, and the competition between open and closed-source models.

Llama 3.1 Official Launch

Llama 3.1 Official Launch

Llama introduces Llama 3.1, an open-source AI model available in 8B, 70B, and 405B versions. The 405B model is highlighted for its versatility in supporting various use cases, including multi-lingual agents and analyzing large documents. Users can leverage coding assistants, real-time or batch inference, and fine-tuning capabilities. Llama emphasizes open-source AI and offers subscribers updates via a newsletter.

Llama 3.1: Our most capable models to date

Llama 3.1: Our most capable models to date

Meta has launched Llama 3.1 405B, an advanced open-source AI model supporting diverse languages and extended context length. It introduces new features like Llama Guard 3 and aims to enhance AI applications with improved models and partnerships.

Meta Llama 3.1 405B

Meta Llama 3.1 405B

The Meta AI team unveils Llama 3.1, a 405B model optimized for dialogue applications. It competes well with GPT-4o and Claude 3.5 Sonnet, offering versatility and strong performance in evaluations.

Why Llama 3.1 is Important

Why Llama 3.1 is Important

Meta's new Llama 3.1 405B model prioritizes data sovereignty, open-source accessibility, cost savings, independence, and advanced customization. It aims to boost AI innovation by empowering companies with control and flexibility.

Link Icon 3 comments
By @jazzyjackson - 3 months
Pleasantly surprised to find it available as an Azure service already so there is some hope of using it on private data without buying any hardware. Does anyone have experience with their "AI Studio Hub" ?

https://learn.microsoft.com/en-us/azure/ai-studio/how-to/dep...

By @davoneus - 3 months
"It's potentially the first time anyone can download a GPT-4-class large language model (LLM) for free and run it on their own hardware. You'll still need some beefy hardware: Meta says it can run on a "single server node," which isn't desktop PC-grade equipment. But it's a provocative shot across the bow of "closed" AI model vendors such as OpenAI and Anthropic."

I love that Ars-Technica calls out BS on the title of the article: "Open source AI is the path forward," says Mark Zuckerberg, misusing the term.

The author Edwards states correctly "...undermining your competitors using a model subsidized by a social media war chest is also an efficient way to play spoiler in a market where you might not always win with the most cutting-edge tech."