July 23rd, 2024

Llama 3.1: Our most capable models to date

Meta has launched Llama 3.1 405B, an advanced open-source AI model supporting diverse languages and extended context length. It introduces new features like Llama Guard 3 and aims to enhance AI applications with improved models and partnerships.

Read original articleLink Icon
Llama 3.1: Our most capable models to date

Meta has introduced Llama 3.1, a groundbreaking open-source AI model with enhanced capabilities. The latest model, Llama 3.1 405B, offers unmatched flexibility and state-of-the-art features, enabling the community to explore new workflows like synthetic data generation and model distillation. With an expanded context length of 128K and support for eight languages, Llama 3.1 405B is set to revolutionize AI applications. Meta aims to empower developers by providing tools to create custom agents and behaviors, along with new security measures like Llama Guard 3 and Prompt Guard. The release includes upgraded versions of the 8B and 70B models, supporting advanced use cases such as text summarization and coding assistance. The model evaluations demonstrate competitive performance across various tasks, positioning Llama 3.1 as a leading open-source foundation model. The ecosystem is supported by key partners like AWS, NVIDIA, and Google Cloud, offering services for immediate development and deployment. The release of Llama 3.1 signifies a significant step towards open-access AI innovation, fostering collaboration and exploration within the developer community.

Related

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU

Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU

The article discusses the release of open-source Llama3 70B model, highlighting its performance compared to GPT-4 and Claude3 Opus. It emphasizes training enhancements, data quality, and the competition between open and closed-source models.

WhatsApp Android beta reveals Llama 3 405B option

WhatsApp Android beta reveals Llama 3 405B option

WhatsApp is updating to version 2.24.14.7 on Google Play Beta, introducing the Meta AI Llama model for enhanced user interactions. Users can choose between different Llama models for tailored AI experiences.

Show HN: Perplexity (llama3 70B) Inline Bot on Telegram

Show HN: Perplexity (llama3 70B) Inline Bot on Telegram

The Llama 3 AI bot on Telegram provides internet access for knowledge sharing. Users can ask questions, summarize content, get programming help, and choose between monthly or yearly plans for a fee. Refunds within 30 days are possible.

Gemma 2 on AWS Lambda with Llamafile

Gemma 2 on AWS Lambda with Llamafile

Google released Gemma 2 9B, a compact language model rivaling GPT-3.5. Mozilla's llamafile simplifies deploying models like LLaVA 1.5 and Mistral 7B Instruct, enhancing accessibility to powerful AI models across various systems.

Llama 3.1 Official Launch

Llama 3.1 Official Launch

Llama introduces Llama 3.1, an open-source AI model available in 8B, 70B, and 405B versions. The 405B model is highlighted for its versatility in supporting various use cases, including multi-lingual agents and analyzing large documents. Users can leverage coding assistants, real-time or batch inference, and fine-tuning capabilities. Llama emphasizes open-source AI and offers subscribers updates via a newsletter.

Link Icon 4 comments
By @sagz - 3 months
405B is already being served on WhatsApp!

https://ibb.co/kQ2tKX5

By @msoad - 3 months
MMLU PRO is the benchmark I trust the most. I noticed they are using 5 shots and CoT. Is that true for GPT4 and Sonnet as well?