Customizable tokenizer for RULER#1731
Conversation
|
https://github.com/open-compass/opencompass/blob/main/configs/eval_ruler.py On the other hand, the configuration (**_gen.py) is standard configurations for general evaluations. You are of course welcome to try your own configurations. |
|
Thanks for the review. The general workflow we use opencompass is via the CLI: |
|
* Customizable tokenizer for RULER * Relax requirements
* Customizable tokenizer for RULER * Relax requirements
* Customizable tokenizer for RULER * Relax requirements
Adding an optional environment variable
TOKENIZER_MODELwhich controls the tokenizer model to use for RULER data generation. With this option, the dataset length will be more precise when we evaluate models that do not usegpt-4tokenizer.