Projects

Found results for

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python

37k ⭐

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

35k ⭐

ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python

34k ⭐

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

32k ⭐

llama3

The official Meta Llama 3 GitHub site

Python

27k ⭐

mlx

MLX: An array framework for Apple silicon

C++

17k ⭐

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python

14k ⭐

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python

13k ⭐

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python

12k ⭐

Megatron-LM

Ongoing research training transformer models at scale

Python

10k ⭐

mistral-inference

Official inference library for Mistral models

Jupyter Notebook

9807 ⭐

Yi

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook

7749 ⭐

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Python

7007 ⭐

skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python

6914 ⭐

sglang

SGLang is a fast serving framework for large language models and vision language models.

Python

6584 ⭐