- France
-
03:46
(UTC +02:00)
Stars
esolithe / esobold
Forked from LostRuins/koboldcppEsobold - A fork of KoboldCPP with agent schenanigans and server side saving!
Universal MCT wrapper script for all Windows 10/11 versions from 1507 to 24H2!
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
exo-explore / llama98.c
Forked from karpathy/llama2.cInference Llama models in one file of pure C for Windows 98 running on 25-year-old hardware
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
Stable Diffusion and Flux in pure C/C++
HimariO / llama.cpp.qwen2.5vl
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
Large-scale LLM inference engine
Adaption of KoboldCPP with the goal to add missing core features
llama.cpp fork with additional SOTA quants and improved performance
Yoshqu / koboldcpp-experimental
Forked from LostRuins/koboldcppA simple one-file way to run various GGML and GGUF models with KoboldAI's UI with experimental extensions.
LostRuins / koboldcpp
Forked from ggml-org/llama.cppRun GGUF models easily with a KoboldAI UI. One File. Zero Install.
xhedit / llama.cpp
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
YavorGIvanov / llama.cpp
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
An unsupervised model merging algorithm for Transformers-based language models.
sharpHL / llama.cpp
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
JohannesGaessler / llama.cpp
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++