ezLocalai
ezLocalai will run a suite of local models for you and automatically handle the pipelines for using them. It is an API that turns any model multimodal. Additional functionality is built in for voice cloning text to speech (drop in a ~10 second voice clip of the person you want to clone the voice of) and a voice to text for easy voice communication as well as image generation entirely offline after the initial setup. It exposes OpenAI style endpoints for ease of use as a drop in OpenAI API replacement with more capabilities. Wrapping AGiXT around it turns your computer into an automation machine.
ezLocalai automatically scales to whatever your desired max tokens is. You just have to set them in your ezlocalai .env file before running it.
Hardware requirements to run ezLocalai may be steep! Running 32k context with Meta-Llama-3-8B-Instruct in ezLocalai requires 23GB VRAM and only running on CPU or NVIDIA GPU is supported by ezLocalai at this time. I often run ezLocalai on my laptop with 16GB VRAM running phi-2-dpo with 16k max context and it works very well. You can reduce your VRAM usage by reducing your max tokens. Adjust your GPU_LAYERS in your ezLocalai .env file to reduce VRAM usage and offload to CPU.
Quick Start Guide
Follow the instructions for setting up ezLocalai at https://github.com/DevXT-LLC/ezlocalai. Once you have it installed and running with your desired models, you can use it with AGiXT by following the instructions below.
Update your agent settings
- Set
AI_PROVIDERtoezlocalai. - Set
EZLOCALAI_API_KEYto your API key that you set up with ezLocalai. - Set
EZLOCALAI_API_URLto the URL that you set up with ezLocalai. The default ishttp://YOUR LOCAL IP:8091. - Set
EZLOCALAI_MODELto whichever model you are running withezlocalai. - Set
EZLOCALAI_MAX_TOKENSto the maximum number of input tokens. - Set
EZLOCALAI_TEMPERATUREto the temperature you want to use for generation. This is a float value between 0 and 1. The default is1.33. - Set
EZLOCALAI_TOP_Pto the top_p value you want to use for generation. This is a float value between 0 and 1. The default is0.95. - Set
EZLOCALAI_VOICEto the voice you want to use for the generated audio. The default isDukeNukem. You can add cloning TTS voices toezlocalaiby putting any ~10 second wav file in thevoicesdirectory of theezlocalairepository and then setting theVOICEvariable to the name of the file without the.wavextension.