The server at ai.binarygnome.com returns 405 on POST /api/generate,
which means it speaks the OpenAI-compatible API rather than the native
Ollama format. Switch to /v1/chat/completions with messages[] payload
and data.choices[0].message.content response parsing.
Also add optional OLLAMA_API_KEY env var for servers that require a
Bearer token.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>