[Feature request] Add Support for vicuna-13b-delta-v1.1

**support for [vicuna-13b-delta-v1.1](https://huggingface.co/lmsys/vicuna-13b-delta-v1.1)
*NOTE: It's not listed in transformers supported models list but it does work with transformers*

**Reason for request**
With the upcoming WebGPU support in [ONNXRuntime](https://github.com/microsoft/onnxruntime) I believe it'll be really helpful to have an LLm support for browser based applications and this repo is the best solution we got so far.

**Additional context**
I've been working on an AI assistant made in electron and cordova for desktop and mobile platforms respectively. I'm already using TransformerJS with whisper for speech-to-text. I intend to switch to WebGPU with JSEP as soon it's available, so I can leverage the GPU compute capabilities to run larger models. I'm trying to build the project with as much opensource resources as possible and having an LLM support would be real nice instead of using openai apis. This keeps the project cost free for users and user's data-privacy is another benifit. I'm really looking forward to see if this is gonna be possible. I'm willing to contribute as much as I can being a complete novice to the ML community.

Thanks in advance


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature request] Add Support for vicuna-13b-delta-v1.1 #96

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature request] Add Support for vicuna-13b-delta-v1.1 #96

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions