Model swapping for llama.cpp (or any local OpenAPI compatible server)
-
Updated
Jun 19, 2025 - Go
10000
Model swapping for llama.cpp (or any local OpenAPI compatible server)
Go library for embedded vector search and semantic embeddings using llama.cpp
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
Eternal is an experimental platform for machine learning models and workflows.
Inference Hub for AI at Scale
Go package and example utilities for using Ollama / LLMs
Add a description, image, and links to the llamacpp topic page so that developers can more easily learn about it.
To associate your repository with the llamacpp topic, visit your repo's landing page and select "manage topics."