ai/smollm2

Verified Publisher

By Docker

Updated 6 months ago

Tiny LLM built for speed, edge devices, and local development

Model
32

100K+

ai/smollm2 repository overview

SmolLM2

logo

SmolLM2-360M is a compact language model with 360 million parameters, designed to run efficiently on-device while performing a wide range of language tasks. Trained on 4 trillion tokens from a diverse mix of datasets—including FineWeb-Edu, DCLM, The Stack, and newly curated filtered sources—it delivers strong performance in instruction following, knowledge, and reasoning. The instruct version was developed through supervised fine-tuning (SFT) on a blend of public and proprietary datasets, followed by Direct Preference Optimization (DPO) using UltraFeedback.

Intended uses

SmolLM2 is designed for:

  • Chat assistants
  • Text-extraction
  • Rewriting and summarization

Characteristics

AttributeDetails
ProviderHugging Face
ArchitectureLlama2
Cutoff dateJune 2024
LanguagesEnglish
Tool calling
Input modalitiesText
Output modalitiesText
LicenseApache 2.0

Available model variants

Model variantParametersQuantizationContext windowVRAM¹Size
ai/smollm2:latest

ai/smollm2:360M-Q4_K_M
360MIQ2_XXS/Q4_K_M8K tokens0.63 GiB256.35 MB
ai/smollm2:135M-Q4_0135MQ4_08K tokens0.35 GiB85.77 MB
ai/smollm2:135M-Q4_K_M135MIQ2_XXS/Q4_K_M8K tokens0.36 GiB98.87 MB
ai/smollm2:135M-F16135MF168K tokens0.51 GiB256.63 MB
ai/smollm2:135M-Q2_K135MQ2_K8K tokens0.34 GiB82.41 MB
ai/smollm2:360M-Q4_0360MQ4_08K tokens0.59 GiB216.80 MB
ai/smollm2:360M-Q4_K_M360MIQ2_XXS/Q4_K_M8K tokens0.63 GiB256.35 MB
ai/smollm2:360M-F16360MF168K tokens1.06 GiB690.24 MB

¹: VRAM estimated based on model characteristics.

latest360M-Q4_K_M

Use this AI model with Docker Model Runner

First, pull the model:

docker model pull ai/smollm2

Then run the model:

docker model run ai/smollm2

For more information on Docker Model Runner, explore the documentation.

Benchmark performance

CategoryBenchmarkScore
ReasoningHellaSwag54.5
ScienceOpenBookQA37.4
ARC53.0
ReasoningPIQA71.7
CommonsenseQA38.0
Winogrande52.5
Popular Aggregated BenchmarkMMLU (cloze)35.8
TriviaQA (held-out)16.9
MathGSM8K (5-shot)3.2

Tag summary

Content type

Model

Digest

sha256:d2df8c834

Size

100.6 MB

Last updated

6 months ago

docker model pull ai/smollm2:135M-Q4_K_M

This week's pulls

Pulls:

9,844

Last week