Best Open-source Model Serving & Inference Platforms tools

Explore curated open-source tools in the Model Serving & Inference Platforms category. Compare technologies, see alternatives, and find the right solution for your workflow.

10+ projects · Page 1 of 1

Ray

Scale Python and AI workloads from laptop to cluster effortlessly

Model Serving & Inference Platforms

Stars: 41,635
License: Apache-2.0
Last commit: 1 hour ago

PythonActive

SGLang

High‑performance serving framework for LLMs and vision‑language models.

Model Serving & Inference Platforms

Stars: 24,203
License: Apache-2.0
Last commit: 2 hours ago

PythonActive

vLLM

Fast, scalable LLM inference and serving for any workload

Model Serving & Inference Platforms

Stars: 72,344
License: Apache-2.0
Last commit: 4 hours ago

PythonActive

TensorRT LLM

Accelerated LLM inference with NVIDIA TensorRT optimizations

Model Serving & Inference Platforms

Stars: 13,029
License: —
Last commit: 9 hours ago

PythonActive

Triton Inference Server

Unified AI model serving across clouds, edge, and GPUs

Model Serving & Inference Platforms

Stars: 10,408
License: BSD-3-Clause
Last commit: 12 hours ago

PythonActive

KServe

Unified AI inference platform for generative and predictive workloads on Kubernetes

Model Serving & Inference Platforms

Stars: 5,172
License: Apache-2.0
Last commit: 16 hours ago

GoActive

GPUStack

Unified GPU cluster manager for scalable AI inference

Model Serving & Inference Platforms

Stars: 4,598
License: Apache-2.0
Last commit: 16 hours ago

PythonActive

LightLLM

Fast, lightweight Python framework for scalable LLM inference

Model Serving & Inference Platforms

Stars: 3,930
License: Apache-2.0
Last commit: 19 hours ago

PythonActive

SkyPilot

Run, scale, and manage AI workloads on any cloud

Model Serving & Inference Platforms

Stars: 9,549
License: Apache-2.0
Last commit: 22 hours ago

PythonActive

BentoML

Unified Python framework for building high‑performance AI inference APIs

Model Serving & Inference Platforms

Stars: 8,492
License: Apache-2.0
Last commit: 1 day ago

PythonActive

OpenLLM

Run any LLM locally behind an OpenAI-compatible API

Model Serving & Inference Platforms

Stars: 12,149
License: Apache-2.0
Last commit: 5 days ago

PythonActive

Seldon Core 2

Deploy modular, data-centric AI applications at scale on Kubernetes

Model Serving & Inference Platforms

Stars: 4,732
License: —
Last commit: 6 days ago

GoActive

NanoFlow

High‑throughput LLM serving with intra‑device parallelism and asynchronous CPU scheduling

Model Serving & Inference Platforms

Stars: 948
License: —
Last commit: 4 months ago

Jupyter NotebookStable

FEDML

Unified ML library for scalable training, serving, and federated learning.

Model Serving & Inference Platforms

Stars: 4,009
License: Apache-2.0
Last commit: 4 months ago

PythonStable

LoRAX

Serve thousands of fine-tuned LLM adapters on a single GPU

Model Serving & Inference Platforms

Stars: 3,732
License: Apache-2.0
Last commit: 9 months ago

PythonStable