Kubernetes operator for llama.cpp-native LLM inference with GPU scheduling, Apple Silicon Metal support, and OpenAI-compatible API. (Source Code) `Apache-2.0` `Go/Docker/K8S`
LLMKube is a ai code assistant tool that kubernetes operator for llama.cpp-native llm inference with gpu scheduling, apple silicon metal support, and openai-compatible api. (source code) `apache-2.0` `go/docker/k8s`. Available as free plan, it is listed in the MarkBook AI tools directory alongside ai code assistant from top providers.
Looking for LLMKube alternatives or similar ai code assistant? MarkBook helps you compare features, pricing, and reviews across thousands of AI tools. Find the best ai code assistant for your specific needs.