Using KoboldCpp
Last updated
Last updated
You can find the full KoboldCpp documentation .
For example, we will use OpenChat 3.5 model, which is what is used on the demo instance. There are many models to choose from.
Navigate to and download one of the models, such as openchat_3.5.Q5_K_M.gguf
. Place this file inside the ./models
directory.
First select KoboldCpp
as the backend in the client:
Then configure KoboldCpp
:
Inside of "Use KoboldCpp" ensure that "Use Extra" is enabled. This will allow you to use the extra features of KoboldCpp, such as streaming.