🛠️ RunLocalAI — local AI leaderboard & catalog
Reproducible benchmark scores and the full open-weight model catalog for running AI on your own hardware. Every benchmark is measured first-party with a public run log and a one-line reproduction command — no vibes, no leaderboard laundering.
Source of truth: runlocalai.co · Data license: CC-BY-4.0 · Click any model name for the full operator-grade page.
Ranked, reproducible quality scores on real consumer GPUs. Pick a benchmark to see the head-to-head ranking.
🥇 | TurkishMMLU (Generative) | 11.8 | Q4_K_M | ollama-0.24.0 | 81.1 | ⭐ First-party | 2026-05-28 |
What these scores mean
- HumanEval+ (EvalPlus) — pass@1 /100 · coding · Liu et al., 2023 (NeurIPS). Extension of Chen et al. (2021) HumanEval. dataset
- TurkishMMLU (Generative) — accuracy /100 · turkish-language, knowledge, multilingual · Yuksel et al., 2024 dataset
- MBPP+ (EvalPlus) — pass@1 /100 · coding · Liu et al., 2023 (NeurIPS). Extension of Austin et al. (2021) MBPP. dataset
Every run is measured first-party on real consumer hardware and carries a public run log + a one-line reproduction command. Methodology: runlocalai.co/benchmarks/methodology.
Every open-weight model worth running locally — LLMs, embeddings, rerankers, ASR, TTS, diffusion, vision encoders — with license tone and VRAM math.
📐 Embedding | 1600B | 1024K | flux-1-dev-non-commercia | ⚠️ Restricted | ColPali team (Illuin Technology) | 8.7 | 99 |
📐 Embedding | 22M | 256 | apache-2.0 | ✅ Yes | sentence-transformers | 0 | 99 | ||
🎨 Image-gen | 12B | — | flux-1-dev-non-commercia | ⚠️ Restricted | Black Forest Labs | 0 | 98 | ||
💬 Text | 1600B | 1024K | MIT | ✅ Yes | DeepSeek | 0 | 98 | ||
💬 Text | 397B | 256K | Qwen License (commercial | ✅ Yes | Alibaba | 0 | 97 | ||
💬 Text | 235B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 0 | 96 | ||
📐 Embedding | 137M | 8K | apache-2.0 | ✅ Yes | Nomic AI | 0 | 95 | ||
💬 Text | 8B | 128K | Llama 3.1 Community Lice | ✅ Yes | Meta | 8.7 | 95 | ||
💬 Text | 671B | 128K | MIT | ✅ Yes | DeepSeek | 9 | 95 | ||
💬 Text | 284B | 1024K | MIT | ✅ Yes | DeepSeek | 0 | 95 | ||
💬 Text | 109B | 9765K | Llama 4 Community Licens | ✅ Yes | Meta | 8.4 | 95 | ||
🎙️ Audio | 82M | — | apache-2.0 | ✅ Yes | Hexgrad | 0 | 95 | ||
📐 Embedding | 335M | 512 | mit | ✅ Yes | BAAI | 0 | 95 | ||
💬 Text | 600M | 40K | apache-2.0 | ✅ Yes | Alibaba | 0 | 95 | ||
💬 Text | 30B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 0 | 94 | ||
💬 Text | 32B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 9.2 | 93 | ||
💬 Text | 70B | 128K | Llama 3.3 Community Lice | ✅ Yes | Meta | 9.1 | 93 | ||
🔍 Rerank | 570M | 8K | MIT | ✅ Yes | BAAI | 0 | 92 | ||
💬 Text | 32B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 8.9 | 92 | ||
💬 Text | 31B | 128K | Gemma Terms of Use | ✅ Yes | Google | 0 | 92 | ||
📐 Embedding | 109M | 384 | apache-2.0 | ✅ Yes | sentence-transformers | 0 | 92 | ||
🎙️ Audio | 460M | — | Coqui Public Model Licen | ⚠️ Restricted | Coqui | 0 | 92 | ||
🎙️ Audio | 74M | 30 | apache-2.0 | ✅ Yes | OpenAI | 0 | 91 | ||
💬 Text | 8B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 8.5 | 91 | ||
🎙️ Audio | 244M | 30 | apache-2.0 | ✅ Yes | OpenAI | 0 | 91 | ||
🎙️ Audio | 39M | 30 | apache-2.0 | ✅ Yes | OpenAI | 0 | 90 | ||
💬 Text | 70B | 128K | MIT | ✅ Yes | DeepSeek | 9 | 90 | ||
📐 Embedding | 118M | 128 | apache-2.0 | ✅ Yes | sentence-transformers | 0 | 90 | ||
💬 Text | 675B | 256K | Mistral Research License | ⚠️ Restricted | Mistral AI | 0 | 90 | ||
💬 Text | 32B | 128K | MIT | ✅ Yes | DeepSeek | 8.8 | 89 | ||
💬 Text | 200B | 195K | GLM License | ✅ Yes | Zhipu AI (Z.AI) | 0 | 89 | ||
💬 Text | 1.7B | 40K | apache-2.0 | ✅ Yes | Alibaba | 0 | 88 | ||
💬 Text | 3B | 128K | Llama 3.2 Community Lice | ✅ Yes | Meta | 7.4 | 88 | ||
📐 Embedding | 335M | 512 | apache-2.0 | ✅ Yes | Mixedbread AI | 0 | 88 | ||
📐 Embedding | 572M | 8K | cc-by-nc-4.0 | ⚠️ Restricted | Jina AI | 0 | 88 | ||
📐 Embedding | 560M | 514 | mit | ✅ Yes | Microsoft (intfloat) | 0 | 88 | ||
💬 Text | 26B | 128K | Gemma Terms of Use | ✅ Yes | Google | 0 | 88 | ||
💬 Text | 671B | 64K | DeepSeek License | ✅ Yes | DeepSeek | 9 | 88 | ||
💬 Text | 14B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 8.8 | 88 | ||
🎨 Image-gen | 12B | — | apache-2.0 | ✅ Yes | Black Forest Labs | 0 | 88 | ||
👁️ Vision | 2B | 32K | apache-2.0 | ✅ Yes | Alibaba | 0 | 87 | ||
💬 Text | 24B | 32K | Apache 2.0 | ✅ Yes | Mistral AI | 8.4 | 87 | ||
💬 Text | 270M | 32K | gemma | ✅ Yes | Google | 0 | 87 | ||
💬 Text | 30B | 976K | NVIDIA Open Model Licens | ✅ Yes | NVIDIA | 0 | 87 | ||
💬 Text | 7B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 8.6 | 87 | ||
💬 Text | 14B | 16K | MIT | ✅ Yes | Microsoft | 8.6 | 86 | ||
💬 Text | 7B | 128K | MIT | ✅ Yes | DeepSeek | 0 | 86 | ||
💬 Text | 14B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 8.5 | 86 | ||
💬 Text | 27B | 128K | Gemma Terms of Use | ✅ Yes | Google | 8.2 | 85 | ||
💬 Text | 8B | 128K | Llama 3.1 Community Lice | ✅ Yes | NousResearch | 7.7 | 85 | ||
💬 Text | 70B | 128K | Llama 3.1 Community Lice | ✅ Yes | Meta | 8 | 85 | ||
💬 Text | 35B | 256K | Apache-2.0 | ✅ Yes | Alibaba / Qwen team | 8 | 85 | ||
💬 Text | 32B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 8.8 | 84 | ||
💬 Text | 14B | 32K | MIT | ✅ Yes | Microsoft | 8.5 | 84 | ||
💬 Text | 1000B | 1953K | Kimi Open Weights Licens | ✅ Yes | Moonshot AI | 0 | 84 | ||
💬 Text | 12B | 128K | Apache 2.0 | ✅ Yes | Mistral AI / NVIDIA | 7.8 | 84 | ||
💬 Text | 14B | 128K | MIT | ✅ Yes | DeepSeek | 0 | 83 | ||
👁️ Vision | 428M | — | apache-2.0 | ✅ Yes | Google | 0 | 82 | ||
💬 Text | 70B | 128K | Llama 3.1 Community Lice | ✅ Yes | NVIDIA | 0 | 82 | ||
💬 Text | 32B | 32K | Apache 2.0 | ✅ Yes | Alibaba | 8.7 | 82 | ||
💬 Text | 11B | 128K | Llama 3.2 Community Lice | ✅ Yes | Meta | 0 | 81 | ||
💬 Text | 4B | 128K | Gemma Terms of Use | ✅ Yes | Google | 0 | 81 | ||
💬 Text | 120B | 976K | NVIDIA Open Model Licens | ✅ Yes | NVIDIA | 0 | 80 | ||
🎙️ Audio | 756M | 30 | mit | ✅ Yes | Hugging Face / Distil-Whisper | 0 | 80 | ||
💬 Text | 12B | 128K | Gemma Terms of Use | ✅ Yes | Google | 7.9 | 80 | ||
💬 Text | 4B | 128K | Apache 2.0 | ✅ Yes | Alibaba | 0 | 80 | ||
📐 Embedding | 568M | 8K | apache-2.0 | ✅ Yes | Snowflake | 0 | 80 | ||
🎨 Image-gen | 2.6B | — | stabilityai-non-commerci | ⚠️ Restricted | Stability AI | 0 | 80 | ||
💬 Text | 1.1B | 2K | apache-2.0 | ✅ Yes | TinyLlama | 0 | 80 | ||
🔍 Rerank | 278M | 1K | cc-by-nc-4.0 | ⚠️ Restricted | Jina AI | 0 | 80 | ||
💬 Text | 72B | 128K | Qwen License | ✅ Yes | Alibaba | 9 | 80 | ||
💬 Text | 135M | 8K | apache-2.0 | ✅ Yes | Hugging Face | 0 | 80 | ||
💬 Text | 32B | 32K | Apache 2.0 | ✅ Yes | AI2 (Allen AI) | 0 | 79 | ||
💬 Text | 3.8B | 128K | MIT | ✅ Yes | Microsoft | 7.2 | 79 | ||
💬 Text | 27B | 128K | Apache-2.0 | ✅ Yes | Alibaba / Qwen team | 8 | 78 | ||
💬 Text | 16B | 128K | DeepSeek License | ✅ Yes | DeepSeek | 8 | 78 | ||
💬 Text | 400B | 976K | Llama 4 Community Licens | ✅ Yes | Meta | 8.7 | 78 |
Catalog hubs: Small LMs · Embeddings · Audio · Image · Coding · Turkish · Benchmarks
Machine-readable: models · quality-benchmarks · OpenAPI
Data licensed CC-BY-4.0 — attribute to runlocalai.co with a link.