September 25th, 2024

Llama can now see and run on your device – welcome Llama 3.2

Meta has released Llama 3.2 with multimodal capabilities, smaller models for on-device use, and licensing restrictions for EU users. It supports multiple languages and integrates with Hugging Face Transformers.

Read original article

Llama can now see and run on your device – welcome Llama 3.2

Llama 3.2 has been released by Meta in collaboration with Hugging Face, introducing multimodal capabilities and smaller models that can run on devices. The update includes ten open-weight models, comprising five multimodal and five text-only variants. The Llama 3.2 Vision model is available in two sizes: 11B for efficient consumer GPU deployment and 90B for large-scale applications. It features advanced visual understanding and reasoning capabilities, allowing it to handle tasks like document question answering and image-text retrieval. The model supports multiple languages and can process both text and images. Additionally, Llama 3.2 includes smaller text models (1B and 3B) designed for on-device use, excelling in tasks such as summarization and multilingual knowledge retrieval. A new version of Llama Guard, which can classify inputs and outputs, has also been introduced. However, there are licensing changes that restrict the use of multimodal models for individuals and companies based in the European Union. The models have been trained on a vast dataset and are expected to perform comparably to their predecessors in text capabilities. The release also emphasizes integration with Hugging Face Transformers and various deployment options, making it easier for developers to utilize these models in applications.

- Llama 3.2 introduces multimodal models with advanced visual reasoning capabilities.

- The update includes smaller text models (1B and 3B) for on-device applications.

- Licensing changes restrict EU users from accessing multimodal models.

- The models support multiple languages and can handle both text and image inputs.

- Integration with Hugging Face Transformers facilitates easier deployment and usage.

Llama 3.1 Official Launch

Llama introduces Llama 3.1, an open-source AI model available in 8B, 70B, and 405B versions. The 405B model is highlighted for its versatility in supporting various use cases, including multi-lingual agents and analyzing large documents. Users can leverage coding assistants, real-time or batch inference, and fine-tuning capabilities. Llama emphasizes open-source AI and offers subscribers updates via a newsletter.

Llama 3.1: Our most capable models to date

Meta has launched Llama 3.1 405B, an advanced open-source AI model supporting diverse languages and extended context length. It introduces new features like Llama Guard 3 and aims to enhance AI applications with improved models and partnerships.

Llama 3.2 released: Multimodal, 1B to 90B sizes

Llama 3.2 has been released as an open-source AI model in various sizes for text and image processing, enhancing application development and gaining significant traction with over 350 million downloads.

Llama 3.2: Revolutionizing edge AI and vision with open, customizable models

Meta released Llama 3.2, featuring vision models with 11B and 90B parameters, and lightweight text models with 1B and 3B parameters, optimized for edge devices and supporting extensive deployment options.

Llama 3.2

Meta has launched the Llama 3.2 collection, featuring models for text and image-text generation, including various parameter sizes and Llama Guard 3 models for safety, hosted on Hugging Face.

1 comments

By @ChrisArchitect - 8 months

Llama 3.2 released: Multimodal, 1B to 90B sizes

https://news.ycombinator.com/item?id=41649748

Llama 3.1 Official Launch

Llama 3.1: Our most capable models to date

Llama 3.2 released: Multimodal, 1B to 90B sizes

Llama 3.2: Revolutionizing edge AI and vision with open, customizable models

Llama 3.2

Meta has launched the Llama 3.2 collection, featuring models for text and image-text generation, including various parameter sizes and Llama Guard 3 models for safety, hosted on Hugging Face.

Llama can now see and run on your device – welcome Llama 3.2

Related

Llama 3.1 Official Launch

Llama 3.1: Our most capable models to date

Llama 3.2 released: Multimodal, 1B to 90B sizes

Llama 3.2: Revolutionizing edge AI and vision with open, customizable models

Llama 3.2

Related

Llama 3.1 Official Launch

Llama 3.1: Our most capable models to date

Llama 3.2 released: Multimodal, 1B to 90B sizes

Llama 3.2: Revolutionizing edge AI and vision with open, customizable models

Llama 3.2