Projects

Found results for

🏆 sglang

SGLang is a fast serving framework for large language models and vision language models.

Python

13k ⭐

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python

45k ⭐

cline

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript

40k ⭐

ColossalAI

Making large AI models cheaper, faster and more accessible

Python

40k ⭐

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python

38k ⭐

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python

37k ⭐

ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python

36k ⭐

mlx

MLX: An array framework for Apple silicon

C++

20k ⭐

flash-attention

Fast and memory-efficient exact attention

Python

16k ⭐

ai

The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered applications and agents

TypeScript

13k ⭐

Megatron-LM

Ongoing research training transformer models at scale

Python

12k ⭐

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++

11k ⭐

axolotl

Go ahead and axolotl questions

Python

9138 ⭐

skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python

7693 ⭐

adk-python

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python

7025 ⭐