Projects

Found results for

sglang

SGLang is a fast serving framework for large language models and vision language models.

Python

📍 🏆

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

46k ⭐

cline

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript

42k ⭐

ColossalAI

Making large AI models cheaper, faster and more accessible

Python

40k ⭐

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python

38k ⭐

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

38k ⭐

ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python

36k ⭐

mlx

MLX: An array framework for Apple silicon

C++

20k ⭐

flash-attention

Fast and memory-efficient exact attention

Python

17k ⭐

ai

The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents

TypeScript

13k ⭐

Megatron-LM

Ongoing research training transformer models at scale

Python

12k ⭐

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++

11k ⭐