July 19th, 2024

Mathstral: 7B LLM designed for math reasoning and scientific discovery

MathΣtral, a new 7B model by Mistral AI, focuses on math reasoning and scientific discovery, inspired by Archimedes and Newton. It excels in STEM with high reasoning abilities, scoring 56.6% on MATH and 63.47% on MMLU. The model's release under Apache 2.0 license supports academic projects, showcasing performance/speed tradeoffs in specialized models. Further enhancements can be achieved through increased inference-time computation. Professor Paul Bourdon's curation of GRE Math Subject Test problems contributed to the model's evaluation. Instructions for model use and fine-tuning are available in the documentation hosted on HuggingFace.

Read original articleLink Icon
Mathstral: 7B LLM designed for math reasoning and scientific discovery

MathΣtral, a new model released by Mistral AI, is designed for math reasoning and scientific discovery, paying tribute to Archimedes. This 7B model with a 32k context window is published under the Apache 2.0 license. The release aims to support academic projects and complex mathematical problems, similar to Isaac Newton's contributions. MathΣtral, standing on Mistral 7B's foundation, excels in STEM subjects and achieves high reasoning capacities across industry benchmarks. Notably, it scores 56.6% on MATH and 63.47% on MMLU. The model showcases the performance/speed tradeoffs in specialized models, emphasizing Mistral AI's development philosophy. MathΣtral's performance can be further enhanced with increased inference-time computation. Users are encouraged to refer to the documentation for instructions on utilizing or fine-tuning the model, with weights hosted on HuggingFace. The model's evaluation benefited from Professor Paul Bourdon's curation of GRE Math Subject Test problems.

Link Icon 1 comments