Open models by workload and deployment path

DeepSeek-R1

Open reasoning model family for developers testing long-form reasoning, coding, and local AI workflows.

Apache-2.0 27K stars

Qwen3.5

Alibaba's flagship open model with 397B-A17B MoE architecture, 8.6× decoding improvement over Qwen3, multimodal, 256K context.

MIT 12K stars

Phi-4

Microsoft's compact 14B dense reasoning model, MIT-licensed, tops MMLU in its size class with 16K context.

Apache-2.0 10K stars

Mistral Large 3

Europe's most powerful open model, 675B MoE (41B active), agentic-tuned, strong multilingual performance.

Llama 4 Community License 7.5K stars

Models Llama 4 Community License

Llama 4

Meta's flagship open MoE model family with Scout (109B, 10M context) and Maverick (400B, rivaling GPT-5 on coding).

Apache-2.0 2.7K stars

Rapid-MLX

Apple Silicon local AI engine with OpenAI-compatible API, tool calling, prompt cache, and MLX acceleration.

Open sourceLocal first

Gemma 4 12B

Google DeepMind's 12B open multimodal model for local agentic workflows on laptops.

DeepSeek V4

Open DeepSeek V4 model family for million-token context, coding, reasoning, and agent workflows.

GLM-5

Open model line from Z.ai focused on agentic engineering and longer coding workflows.

MIT model / Apache-2.0 code

Models MIT model / Apache-2.0 code

GLM-OCR

Open OCR model and pipeline for turning complex document images into usable text.

Qwen3-VL

Open vision-language model family for images, screens, documents, and multimodal workflows.

Qwen3.6

Qwen's open model line focused on stronger coding, agentic tasks, and real-world stability.

Modified MIT

Models Modified MIT

Kimi K2.5

Moonshot AI's open-weight multimodal model for agentic and tool-using workflows.

Multimodal, OCR, and document models

Models for image understanding, documents, screenshots, OCR, and visual agent workflows.

Gemma 4

Google DeepMind's open model family for local, multimodal, and agentic AI workflows.

Gemma 4 12B

Google DeepMind's 12B open multimodal model for local agentic workflows on laptops.

MIT model / Apache-2.0 code

Models MIT model / Apache-2.0 code

GLM-OCR

Open OCR model and pipeline for turning complex document images into usable text.

Qwen3-VL

Open vision-language model family for images, screens, documents, and multimodal workflows.

Local and self-hosted models

Open-weight candidates for teams that care about privacy, reproducibility, and deployment control.

MIT 92K stars

DeepSeek-R1

Open reasoning model family for developers testing long-form reasoning, coding, and local AI workflows.

MIT 77.4K stars

gpt4all

Run large language models locally on consumer hardware with a desktop application and Python library.

Apache-2.0 27K stars

Qwen3.5

Alibaba's flagship open model with 397B-A17B MoE architecture, 8.6× decoding improvement over Qwen3, multimodal, 256K context.

MIT 20.4K stars

FinGPT

Open-source financial large language model project for finance sentiment, analysis, and domain adaptation.

MIT 12K stars

Phi-4

Microsoft's compact 14B dense reasoning model, MIT-licensed, tops MMLU in its size class with 16K context.

Apache-2.0 10K stars

Mistral Large 3

Europe's most powerful open model, 675B MoE (41B active), agentic-tuned, strong multilingual performance.

Llama 4 Community License 7.5K stars

Models Llama 4 Community License

Llama 4

Meta's flagship open MoE model family with Scout (109B, 10M context) and Maverick (400B, rivaling GPT-5 on coding).

Apache-2.0 6.5K stars

OLMo 2

Fully open language model family from AI2 for transparent research, training, and evaluation.

Apache-2.0 5.5K stars

LiteRT-LM

Google's open-source inference framework for deploying large language models on edge devices.

Open sourceLocal first

Apache-2.0 2.7K stars

Rapid-MLX

Apple Silicon local AI engine with OpenAI-compatible API, tool calling, prompt cache, and MLX acceleration.

Open sourceLocal first

Gemma 4

Google DeepMind's open model family for local, multimodal, and agentic AI workflows.

Gemma 4 12B

Google DeepMind's 12B open multimodal model for local agentic workflows on laptops.

DeepSeek V4

Open DeepSeek V4 model family for million-token context, coding, reasoning, and agent workflows.

GLM-5

Open model line from Z.ai focused on agentic engineering and longer coding workflows.

MIT model / Apache-2.0 code

Models MIT model / Apache-2.0 code

GLM-OCR

Open OCR model and pipeline for turning complex document images into usable text.

Mistral Small 3.2

Apache-licensed small open model for practical instruction following, local inference, and agent experiments.

Qwen3-VL

Open vision-language model family for images, screens, documents, and multimodal workflows.

Qwen3.6

Qwen's open model line focused on stronger coding, agentic tasks, and real-world stability.

Modified MIT

Models Modified MIT

Kimi K2.5

Moonshot AI's open-weight multimodal model for agentic and tool-using workflows.

Compact and general-purpose candidates

Useful baselines for cheaper inference, edge experiments, and broad assistant workloads.

Apache-2.0 10K stars

Mistral Large 3

Europe's most powerful open model, 675B MoE (41B active), agentic-tuned, strong multilingual performance.

Apache-2.0 6.5K stars

OLMo 2

Fully open language model family from AI2 for transparent research, training, and evaluation.

Gemma 4

Google DeepMind's open model family for local, multimodal, and agentic AI workflows.

Gemma 4 12B

Google DeepMind's 12B open multimodal model for local agentic workflows on laptops.

Mistral Small 3.2

Apache-licensed small open model for practical instruction following, local inference, and agent experiments.