Support multi-adpater concurrent inferencing

### Feature request

Hey all:

Very glad to see that [PR-263](https://github.com/huggingface/peft/pull/263) ,  but i try it and find that it use  `set_adpater ` method to switch lora-adapter , it means that we can only have concurrency with 1 in the same time.  i see that lora layer can be inference without merged to origin base model.  So can we support an way that permit user dynamicaly combine base_model and lora_adapter, and user can do concurrent call to base_model and lora_adapter


Thanks.

### Motivation

This way can save many money............

### Your contribution

NA

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multi-adpater concurrent inferencing #973

Feature request

Motivation

Your contribution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support multi-adpater concurrent inferencing #973

Description

Feature request

Motivation

Your contribution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions