ezLocalai
ezLocalai will run a suite of local models for you and automatically handle the pipelines for using them. It is an API that turns any model multimodal. Additional functionality is built in for voice cloning text to speech (drop in a ~10 second voice clip of the person you want to clone the voice of) and a voice to text for easy voice communication as well as image generation entirely offline after the initial setup. It exposes OpenAI style endpoints for ease of use as a drop in OpenAI API replacement with more capabilities. Wrapping AGiXT around it turns your computer into an automation machine.
ezLocalai automatically scales to whatever your desired max tokens is. You just have to set them in your ezlocalai .env
file before running it.
Hardware requirements to run ezLocalai may be steep! Running 32k context with Meta-Llama-3-8B-Instruct
in ezLocalai requires 23GB VRAM and only running on CPU or NVIDIA GPU is supported by ezLocalai at this time. I often run ezLocalai on my laptop with 16GB VRAM running phi-2-dpo
with 16k max context and it works very well. You can reduce your VRAM usage by reducing your max tokens. Adjust your GPU_LAYERS
in your ezLocalai .env
file to reduce VRAM usage and offload to CPU.
Quick Start Guide
Follow the instructions for setting up ezLocalai at https://github.com/DevXT-LLC/ezlocalai. Once you have it installed and running with your desired models, you can use it with AGiXT by following the instructions below.
Update your agent settings
- Set
AI_PROVIDER
toezlocalai
. - Set
EZLOCALAI_API_KEY
to your API key that you set up with ezLocalai. - Set
EZLOCALAI_API_URL
to the URL that you set up with ezLocalai. The default ishttp://YOUR LOCAL IP:8091
. - Set
AI_MODEL
to whichever model you are running withezlocalai
. - Set
MAX_TOKENS
to the maximum number of input tokens. - Set
AI_TEMPERATURE
to the temperature you want to use for generation. This is a float value between 0 and 1. The default is1.33
. - Set
AI_TOP_P
to the top_p value you want to use for generation. This is a float value between 0 and 1. The default is0.95
. - Set
VOICE
to the voice you want to use for the generated audio. The default isDukeNukem
. You can add cloning TTS voices toezlocalai
by putting any ~10 second wav file in thevoices
directory of theezlocalai
repository and then setting theVOICE
variable to the name of the file without the.wav
extension.