Prevent checking if model exists on every autocompletion #46

dhuertas · 2024-03-02T13:56:50Z

Hi!

Thanks for coding this little wonder of extension. Kudos! I've been using it for a bit, and I have noticed that every autocompletion generates an extra request to the /api/tags endpoint in Ollama:

I suspect it comes from the call to ollamaCheckModel() in provideInlineCompletionItems():

llama-coder/src/prompts/provider.ts

Line 89 in 996ac71

    
           let modelExists = await ollamaCheckModel(inferenceConfig.endpoint, inferenceConfig.modelName);

In my view it should not be necessary to send a request to the /api/tags endpoint every time. I am aware the latency it introduces is orders of magnitude lower than the /api/generate cat, but still ... it's extra job for the extension that (in my view) does not need to do.

I'd suggest to go for a different strategy 🤔 Perhaps do the check once and save the list of available models to check locally. Then check again whenever the configuration changes, or every now and then.

Thanks!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent checking if model exists on every autocompletion #46

Prevent checking if model exists on every autocompletion #46

dhuertas commented Mar 2, 2024

Prevent checking if model exists on every autocompletion #46

Prevent checking if model exists on every autocompletion #46

Comments

dhuertas commented Mar 2, 2024