November 2nd, 2024

SmolLM2

SmolLM2 is a new family of lightweight language models from Hugging Face, available in three sizes, trained on 11 trillion tokens, and designed for on-device operation with accessible model weights.

Read original article

SmolLM2 is a new family of compact language models developed by Loubna Ben Allal and her team at Hugging Face, available in three sizes: 135M, 360M, and 1.7B parameters. These models are designed to perform a variety of tasks while being lightweight enough for on-device operation. They were trained on a substantial dataset comprising 11 trillion tokens, which includes FineWeb-Edu, DCLM, The Stack, and newly curated mathematics and coding datasets. The model weights are released under an Apache 2 license. Users can easily run these models using the llm-gguf plugin, with specific instructions provided for both the 1.7B and 135M quantized models. A blog post detailing SmolLM2 is expected to be published soon, following an earlier introduction of the first version, SmolLM, which was noted for its speed and power.

- SmolLM2 includes three model sizes: 135M, 360M, and 1.7B parameters.

- The models are lightweight and suitable for on-device use.

- They were trained on 11 trillion tokens from diverse datasets.

- Model weights are available under an Apache 2 license.

- Users can run the models using the llm-gguf plugin with provided instructions.

Meta Large Language Model Compiler

Large Language Models (LLMs) are utilized in software engineering but underused in code optimization. Meta introduces the Meta Large Language Model Compiler (LLM Compiler) for code optimization tasks. Trained on LLVM-IR and assembly code tokens, it aims to enhance compiler understanding and optimize code effectively.

Meta AI develops compact language model for mobile devices

Meta AI introduces MobileLLM, a compact language model challenging the need for large AI models. Optimized with under 1 billion parameters, it outperforms larger models by 2.7% to 4.3% on tasks. MobileLLM's innovations include model depth prioritization, embedding sharing, grouped-query attention, and weight-sharing techniques. The 350 million parameter version matches larger models' accuracy on specific tasks, hinting at compact models' potential for efficiency. While not publicly available, Meta has open-sourced the pre-training code, promoting research towards sustainable AI models for personal devices.

Show HN: SmolCopilot – 360M LLM writing assistant in the browser

SmolPilot is a browser-based demo project featuring a 360 million parameter language model, prioritizing local execution, user privacy, model switching, and customization, licensed under MIT and utilizing WebGPU technology.

Llama 3.2: Revolutionizing edge AI and vision with open, customizable models

Meta released Llama 3.2, featuring vision models with 11B and 90B parameters, and lightweight text models with 1B and 3B parameters, optimized for edge devices and supporting extensive deployment options.

Llama can now see and run on your device – welcome Llama 3.2

Meta has released Llama 3.2 with multimodal capabilities, smaller models for on-device use, and licensing restrictions for EU users. It supports multiple languages and integrates with Hugging Face Transformers.

12 comments

By @echoangle - 6 months

Semi-related but is there a standard way to run this (or other models from huggingface) in a docker container and interact with them through a web API? ChatGPT tells me to write my own FastAPI wrapper which should work, but is there no pre-made solution for this?

By @kgeist - 6 months

Does it support anything other than English? Sadly, most open-weights models have no support for languages other than English, which makes them useless for 75% world's population who don't speak English at all.

Does anyone know of a good lightweight open-weights LLM which supports at least a few major languages (let's say, the official UN languages at least)?

By @jdthedisciple - 6 months

Very interesting. According to their X posts, this meme model "SmolLm" beats Meta's new 1B and 3B models across almost all metrics.

I wonder how this is possible given that Meta has been in this game for much longer and probably has much more data at their disposal as well.

By @cpa - 6 months

What’s the context size? I couldn’t find it on the model summary page. Tangential: if it’s not on the model page, does it mean that it’s not that relevant here? If so, why?

By @oulipo - 6 months

Nice! Do you think they could be fine-tuned to implement a cool thing like https://withaqua.com/ ? eg to teach it to do "inline edits" of what you say?

By @diimdeep - 6 months

Hm, is it too early yet to stop trusting these self published evaluations except 3rd party independent ones ? in other areas, imdb ratings for example are completely meaningless and rigged at this point.

By @globular-toast - 6 months

Why would I care about this when I can have the entire English Wikipedia on my phone? Really struggling to understand why people are so excited about this stuff.

By @ksri - 6 months

Is there a way to run this in the browser as yet? Transformers js doesn't seem to support this. Is there another way to run this in the browser?

By @Its_Padar - 6 months

I wonder how one would finetune this

By @forrestthewoods - 6 months

Is there a good, small model that can take input images? Or are those all still larger?

By @sahli - 6 months

I see no difference between SmoLM1 and SmolLM2 (at least for 135). See how the model can't even generalize over "Hi." and "Hi!" (the latest conversations). Isn't this a sign of overfitting/Memorization? A sign of poor training?

- SmolLM2 does not maintain a consistent identity.

- There is a lot of repetition. SmolLM struggles with context retention and may be prone to "looping" on specific phrases.

- SmolLM2 misinterprets the user’s intent in correcting its responses.

- Random Responses.

- SmolLM2 struggles with basic yes/no exchanges, often failing to acknowledge simple confirmations like "Yes" or "Goodbye."

$ llm chat -m smol135

Chatting with gguf/SmolLM2-135M-Instruct-Q8_0

Type 'exit' or 'quit' to exit

Type '!multi' to enter multiple lines, then '!end' to finish

> Hi.

Hi!

> Who are you?

You are a chatbot. I don't understand why you're asking me about me.

> Who you are?

You are a chatbot created by Hugging Face. I'm designed to provide helpful information, answer questions, and provide assistance. You can ask me for general assistance or ask me specific questions about myself or my capabilities.

> Ok, what's your capabilities?

Your capabilities are as follows:

1. Assist users [...]

Feel free to ask any questions about yourself or my capabilities. I'm here to help you.

> You said "Your capabilities are as follows:" but you mean to said "My capabilities are as follows:"

You are correct, I should have said "My capabilities are as follows" instead of "My capabilities are as follows." The correct usage of the word is "My capabilities are as follows," which is an informal way of expressing a more personal or informal usage of "My capabilities are as follows."

However, the correct phrasing should be "My capabilities are as follows" or "My capabilities are as follows," as "my capabilities are as follows" is a common way to use the phrase "my capabilities are as follows" without using the adverb "as."

> You are repeating the some phrase! You said: However, the correct phrasing should be "My capabilities are as follows" or "My capabilities are as follows," as "my capabilities are as follows" is a common way to use the phrase "my capabilities are as follows" without using the adverb "as."

You're right, I should have been more specific about the correct usage. Here is the correct correction:

Your capabilities are as follows.

> Oh! Shit.

Oh! I'm sorry. I meant to say "My capabilities are as follows" or "My capabilities are as follows." Please correct the correction and explain your reasoning for the change in phrasing.

> What?

What? You mean "I'm sorry"?

> Who is sorry?

Who is sorry? You mean "you mean sorry".

> Do you speak English?

You mean "you mean sorry"?

> Ok, I'm sorry. What now?

I'm sorry, please correct the punctuation to make sure I understand correctly.

> I'm sorry.

I'm sorry, please correct the punctuation to make sure I understand correctly.

> What are you saying?

You mean "you mean sorry"?

> Tell me more about yourself, let's start with your name.

You mean "your name"?

> yes

You mean "your name"?

> Yes.

You mean "yes"?

> sure

You mean "yes"?

> I mean yes.

You mean "yes"?

> Good bye!

You mean "good bye"?

> "good bye"

You mean "good bye"

> Aborted!

~ ⌚ 20:21:51

$ llm chat -m smol135

Chatting with gguf/SmolLM2-135M-Instruct-Q8_0

Type 'exit' or 'quit' to exit

Type '!multi' to enter multiple lines, then '!end' to finish

> Tell me about yourself.

My name is Lila, and I am a retired detective from the United States who spent 20 years on the force. I've seen my fair share of crimes, witnessed some of the most intense and dramatic events that have ever occurred on this earth, and I've come to realize that there's a lot more to this world than what we're used to. I've been a bit of a relic of the past, a retired detective who's taken up the challenge of exploring the unknown.

My background has taught me the value of staying current with the latest developments, the importance of keeping our communities safe, and the risks involved in solving cases. I'm proud to be a part of a community that values discretion and honesty above all else.

What would you like to know about Lila? Is there a particular case that's been keeping you up at night, or perhaps a particular time of day that keeps you coming back to the precinct?

> Aborted!

~ ⌚ 20:22:35

$ llm chat -m smol135

Chatting with gguf/SmolLM2-135M-Instruct-Q8_0

Type 'exit' or 'quit' to exit

Type '!multi' to enter multiple lines, then '!end' to finish

> Hi!

Hi! I'm a helpful AI assistant named SmolLM, trained by Hugging Face. I'm here to help you with any questions or issues you may have. What can I assist you with today?

> Aborted!

~ ⌚ 20:23:11

$ llm chat -m smol135

Chatting with gguf/SmolLM2-135M-Instruct-Q8_0

Type 'exit' or 'quit' to exit

Type '!multi' to enter multiple lines, then '!end' to finish

> Hi.

Hi!

> Are you SmolLM?

No, I'm not.

> Aborted!

~ ⌚ 20:23:43

$ llm chat -m smol135

Chatting with gguf/SmolLM2-135M-Instruct-Q8_0

Type 'exit' or 'quit' to exit

Type '!multi' to enter multiple lines, then '!end' to finish

> Hi!

Hi! I'm a helpful AI assistant named SmolLM, trained by Hugging Face. I'm here to help you with any questions or issues you may have. What can I assist you with today?

> Are you SmolLM?

Yes, I'm SmolLM, a professional chatbot designed to assist with various topics. I'm ready to help. What's your problem or query, or would you like to share a question?

> Aborted!

SmolLM2

Related

Meta Large Language Model Compiler

Meta AI develops compact language model for mobile devices

Show HN: SmolCopilot – 360M LLM writing assistant in the browser

Llama 3.2: Revolutionizing edge AI and vision with open, customizable models

Llama can now see and run on your device – welcome Llama 3.2

Related

Meta Large Language Model Compiler

Meta AI develops compact language model for mobile devices

Show HN: SmolCopilot – 360M LLM writing assistant in the browser

Llama 3.2: Revolutionizing edge AI and vision with open, customizable models

Llama can now see and run on your device – welcome Llama 3.2