AI Models

Open-source LLMs, embedding models, and multimodal AI systems

13 projects

ollama

Featured

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

168.2k15.5kGoMIT
deepseekgemmagemma3

transformers

Featured

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

159.1k32.8kPythonApache-2.0
audiodeep-learningdeepseek

llama.cpp

Featured

LLM inference in C/C++

102.6k16.6kC++MIT
ggml

whisper

Featured

Robust Speech Recognition via Large-Scale Weak Supervision

97.4k12.0kPythonMIT

gpt4all

Featured

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

77.3k8.3kC++MIT
ai-chatllm-inference

vllm

Featured

A high-throughput and memory-efficient inference and serving engine for LLMs

75.8k15.3kPythonApache-2.0
amdblackwellcuda

llama

Featured

Inference code for Llama models

59.3k9.8kPythonNOASSERTION

FastChat

Featured

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

39.4k4.8kPythonApache-2.0

mlc-llm

Featured

Universal LLM Deployment Engine with ML Compilation

22.4k2.0kPythonApache-2.0
language-modelllmmachine-learning-compilation

StableLM

Featured

StableLM: Stability AI Language Models

15.7k1.0kJupyter NotebookApache-2.0

ChatGLM3

Featured

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

13.7k1.6kPythonApache-2.0

mistral-inference

Featured

Official inference library for Mistral models

10.8k1.0kJupyter NotebookApache-2.0
llmllm-inferencemistralai

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

7.4k1.1kPythonApache-2.0
deepspeed-librarygpt-3language-model