We have evaluated the performacne of FastV on LLaVA-1.5-7B, and we find that the true results are significantly higher than the results reported in your paper.
Specifically, we set this line in the official repo of FastV to K=2, ratio = 0.25 (i.e., keep 576*0.25=144 tokens), and we observed the results below:
|
Token number |
VQAv2 |
SQA |
| True FastV (using official code) |
144 |
74.06 |
69.06 |
| FastV in your paper |
128 |
61.8 |
67.3 |
| SparseVLM |
128 |
73.8 |
69.1 |
You seem to have significantly depressed the FastV results?
We have evaluated the performacne of FastV on LLaVA-1.5-7B, and we find that the true results are significantly higher than the results reported in your paper.
Specifically, we set this line in the official repo of FastV to K=2, ratio = 0.25 (i.e., keep 576*0.25=144 tokens), and we observed the results below:
You seem to have significantly depressed the FastV results?