Projects

Found results for

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

38k ⭐

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python

37k ⭐

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

36k ⭐

ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python

35k ⭐

llama3

The official Meta Llama 3 GitHub site

Python

28k ⭐

openai-python

The official Python library for the OpenAI API

Python

24k ⭐

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python

19k ⭐

mlx

MLX: An array framework for Apple silicon

C++

19k ⭐

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python

16k ⭐

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python

13k ⭐

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python

13k ⭐

Megatron-LM

Ongoing research training transformer models at scale

Python

11k ⭐

sglang

SGLang is a fast serving framework for large language models and vision language models.

Python

10k ⭐

skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 14+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python

7187 ⭐