Projects

Found results for

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python

34k ⭐

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

33k ⭐

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python

31k ⭐

llama3

The official Meta Llama 3 GitHub site

Python

20k ⭐

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

19k ⭐

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python

12k ⭐

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python

11k ⭐

Megatron-LM

Ongoing research training transformer models at scale

Python

8823 ⭐

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Jupyter Notebook

8763 ⭐

Yi

A series of large language models trained from scratch by developers @01-ai

Python

7248 ⭐

skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Python

5729 ⭐

InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support

Python

5276 ⭐

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Python

2464 ⭐

TigerBot

TigerBot: A multi-language multi-task LLM

Python

2212 ⭐

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Makefile

1233 ⭐