Projects

Found results for

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python

35k ⭐

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

33k ⭐

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python

32k ⭐

llama3

The official Meta Llama 3 GitHub site

Python

23k ⭐

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

23k ⭐

mlx

MLX: An array framework for Apple silicon

C++

15k ⭐

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python

13k ⭐

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python

12k ⭐

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python

11k ⭐

Megatron-LM

Ongoing research training transformer models at scale

Python

9483 ⭐

mistral-inference

Official inference library for Mistral models

Jupyter Notebook

9309 ⭐

Yi

A series of large language models trained from scratch by developers @01-ai

Python

7505 ⭐

skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Python

6313 ⭐

InternLM

Official release of InternLM2.5 7B base and chat models. 1M context support

Python

5869 ⭐

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Python

2892 ⭐