November 10th, 2024

AMD Announces OLMo, Its First Open LLM

AMD has launched OLMo, its first fully open large language model, designed for data centers and smaller organizations. It runs on AMD GPUs, offers strong performance, and is free for developers.

Read original articleLink Icon
AMD Announces OLMo, Its First Open LLM

AMD has announced its first fully open large language model (LLM) named OLMo, which is designed to enhance AI capabilities for data centers and smaller organizations. The model can be downloaded and run on AMD's Instinct MI250 GPUs and Ryzen AI PCs equipped with neural processing units (NPUs). OLMo features reasoning and chat capabilities similar to other LLMs, such as OpenAI's GPT-4o. The model was pre-trained using a cluster of Instinct GPUs and a dataset of 1.3 trillion tokens. AMD's training process for OLMo involved three stages: pretraining, supervised fine-tuning, and alignment to human preferences. The model has shown strong performance in various benchmarks, particularly in areas like science, coding, and mathematics. It also performed comparably to other open-source LLMs in responsible AI benchmarks, which assess the model's interaction qualities and language toxicity. AMD is making OLMo available for free to developers, allowing them to customize the model for their specific needs.

- AMD has launched OLMo, its first fully open large language model.

- OLMo can run on AMD Instinct MI250 GPUs and Ryzen AI PCs.

- The model was trained in three stages to enhance its capabilities.

- OLMo performed well in benchmarks, particularly in science and coding.

- The model is available for free, enabling customization for developers.

Link Icon 1 comments