forked from ggml-org/llama.cpp
-
Notifications
You must be signed in to change notification settings - Fork 462
feat: Model override and server side saves #1373
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
esolithe
wants to merge
256
commits into
LostRuins:concedo_experimental
Choose a base branch
from
esolithe:remoteManagement
base: concedo_experimental
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
feat: Model override and server side saves #1373
esolithe
wants to merge
256
commits into
LostRuins:concedo_experimental
from
esolithe:remoteManagement
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…et it as the default in the example control page)
…ndependant of all the changes your config might bring
…ample of timing the reload with a new "live" endpoint
fix: Updating custom model support to fix issues when argument is not…
…es after Cedo's work
…n lite to import into TextDB as a document (only basic text files for now)
…dio based on whisper)
…e by sevenof9, adapted for KCPP by Esolithe)
…ontext used) by not including the agent JSONs themselves. Ensured that an agent must take an action on the first turn, rather than just stopping thinking right away. Added the option to prevent agent commands being used.
b9b6952
to
13cee48
Compare
…ent in text DB button. Fixed issue with slides not opening with correct value on text DB page. Increased max number of search results to 10.
…sistency with the agent
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What's included
Note
This is in draft while I add documentation to methods, but functionality has been dev tested - latest server side stuff is with Kobold folks to give it a try.
Tried to restructure it in a similar way to the other config stuff which has been added - reuse of the launcher page, popup etc to minimise the structural differences so hopefully is a pretty smooth UX!
I have included the kobold lite changes in this PR to allow for direct testing - they are structures closer to a usermod at the moment to allow for easier portability between lite versions (wasn't sure if this would be integrated at all so wrote it like that). The lite changes also require forge.js which I embedded - allowing the client side save encryption.
These will probably need to be introduced as a separate PR, mostly this PR is after implementing the API changes so backend support is present 👍
Usage
Can be set through the launcher, but if you prefer the arguments:
Binaries for testing (releases also include some screenshots)
https://github.com/esolithe/koboldcpp/releases/