Skip to content

Conversation

jongirard
Copy link

Needed to add Ollama for my usage/implementation so I added support for embedding/querying with local Ollama models. Currently uses nomic-embed-text (https://ollama.com/library/nomic-embed-text) and llama3.2 models for embedding/querying respectively. This can be adapted to be passed as an argument for custom models selection but for my purposes I only needed these 2 models specifically.

Tested with Claude desktop and working.

@jongirard jongirard mentioned this pull request May 14, 2025
@Tyler-Hardin
Copy link

Ollama has an OpenAI compatible endpoint. If you set OPENAI_API_BASE="http://localhost:111434/v1", set LLM_MODEL, and EMBEDDING_MODEL, you should be able to get it to work with no changes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants