Projects

Found results for

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python

36k ⭐

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

35k ⭐

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python

33k ⭐

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

28k ⭐

llama3

The official Meta Llama 3 GitHub site

Python

26k ⭐

mlx

MLX: An array framework for Apple silicon

C++

16k ⭐

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python

13k ⭐

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python

13k ⭐

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python

11k ⭐

Megatron-LM

Ongoing research training transformer models at scale

Python

10k ⭐

mistral-inference

Official inference library for Mistral models

Jupyter Notebook

9628 ⭐

Yi

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook

7629 ⭐

skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python

6669 ⭐

InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Python

6315 ⭐

sglang

SGLang is a fast serving framework for large language models and vision language models.

Python

5601 ⭐