Allow parallel inference

### Describe the feature you'd like

I would like the inference queue to allow parallel execution of jobs.

### Describe the benefits this would bring to existing Hoarder users

The use case is availability of multiple load ballanced ollama backends that would speed up processing.

### Can the goal of this request already be achieved via other means?

Not easily.
- faster gpu with more ram
- multiple gpus on the same host
- something else than ollama that can run jobs distributed on multiple machines (like https://github.com/exo-explore/exo maybe?)

### Have you searched for an existing open/closed issue?

- [X] I have searched for existing issues and none cover my fundamental request

### Additional context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Allow parallel inference #724

Describe the feature you'd like

Describe the benefits this would bring to existing Hoarder users

Can the goal of this request already be achieved via other means?

Have you searched for an existing open/closed issue?

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Allow parallel inference #724

Description

Describe the feature you'd like

Describe the benefits this would bring to existing Hoarder users

Can the goal of this request already be achieved via other means?

Have you searched for an existing open/closed issue?

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions