Nitro: A fast, lightweight 3MB inference server with OpenAI-Compatible API January 6, 2024 by Comments