[Feature Request] Add configuration to never unload a model

Would it be possible to add an option to never unload a model?

Here's my usecase:
* I use multiple different LLMs, for different purposes or even just for experimentation.
* But the embeddings and reranking models are always the same.
* Also, due to my coding assistant tool, the embeddings model runs quite often.

So I'd like to be able to:
* Have the embeddings and reranking models always loaded and ready to go, and
* Never unload the "main LLM" I'm using when I need to use the embeddings and reranking models

I guess I could achieve this with profiles. But it would require creating a new profile every time I add a new LLM (and I do that a lot).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Add configuration to never unload a model #99

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[Feature Request] Add configuration to never unload a model #99

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions