numaserve

module
v0.0.0-...-3c84b19 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Oct 30, 2023 License: Apache-2.0

README

numaserve

numaserve is a standard, cloud agnostic Model Inference Platform on Kubernetes, built for highly scalable use cases. Providing advanced Model deployments like canary rollout, Progressive rollout to confidenty deploying model in to production. Create new canary inference pipeline if any atomic unit changes which will help to assess the new change with end to end pipeline. Easy to create high scalable production inference serving with pre/post processing in few lines Support DAG based inference High resilient system Supports CPU/GPU resources Supporting all model runtime Outbox observability and monitoring

Directories

Path Synopsis
api
v1alpha1
Package v1alpha1 contains API Schema definitions for the mlserve.numaproj.io v1alpha1 API group +kubebuilder:object:generate=true +groupName=mlserve.numaproj.io
Package v1alpha1 contains API Schema definitions for the mlserve.numaproj.io v1alpha1 API group +kubebuilder:object:generate=true +groupName=mlserve.numaproj.io
cmd
pkg

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL