13 projects
ollama
FeaturedGet up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
transformers
Featured🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
llama.cpp
FeaturedLLM inference in C/C++
whisper
FeaturedRobust Speech Recognition via Large-Scale Weak Supervision
gpt4all
FeaturedGPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
vllm
FeaturedA high-throughput and memory-efficient inference and serving engine for LLMs
llama
FeaturedInference code for Llama models
FastChat
FeaturedAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
mlc-llm
FeaturedUniversal LLM Deployment Engine with ML Compilation
StableLM
FeaturedStableLM: Stability AI Language Models
ChatGLM3
FeaturedChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
mistral-inference
FeaturedOfficial inference library for Mistral models
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries