A tool for running large language models locally on your machine, supporting models like Llama, Mistral, and Code Llama with a simple API.