Skip to content

Google Drive scripts: incorrect iteration #17

@SM20sam

Description

@SM20sam

Observe the structure of an entry in the dataset for a json file: https://huggingface.co/datasets/drive-bench/arena
[
{
"scene_token": "da41ecbc644b4915b84bb732e35ebf8c",
"frame_token": "7e4c3282bc2a4402b5d1d6705f9eb844",
"question_type": "robust_qas",
"question": "What is the current type of corruption?: A. Fog. B. Bit error. C. JPEG compression. D. Sensor failure",
"answer": "D. Sensor failure",
"tag": [
0
],
"image_path": {
"CAM_FRONT": "./CameraCrash/CAM_FRONT/n008-2018-08-30-15-52-26-0400__CAM_FRONT__1535658934012637.jpg",
"CAM_FRONT_LEFT": "./CameraCrash/CAM_FRONT_LEFT/n008-2018-08-30-15-52-26-0400__CAM_FRONT_LEFT__1535658934004799.jpg",
"CAM_FRONT_RIGHT": "./CameraCrash/CAM_FRONT_RIGHT/n008-2018-08-30-15-52-26-0400__CAM_FRONT_RIGHT__1535658934020482.jpg",
"CAM_BACK": "./CameraCrash/CAM_BACK/n008-2018-08-30-15-52-26-0400__CAM_BACK__1535658934037558.jpg",
"CAM_BACK_LEFT": "./CameraCrash/CAM_BACK_LEFT/n008-2018-08-30-15-52-26-0400__CAM_BACK_LEFT__1535658934047405.jpg",
"CAM_BACK_RIGHT": "./CameraCrash/CAM_BACK_RIGHT/n008-2018-08-30-15-52-26-0400__CAM_BACK_RIGHT__1535658934028113.jpg"
}
},
{

In google drive script, https://drive.google.com/drive/folders/18p9JRMNEVA-wBaKMJK8S5g5yykPp6kqf, llava1.6_dist.py incorrectly iterate over keys in image_path like "CAM_BACK" instead of actual paths to files

    filenames = batch['images'] # should be image_path
    batch_size = len(filenames)

    assert batch_size == 1, "Currently only support batch size 1"

    # Load images and build image placeholders and multi_modal_data
    image_placeholders = [''] * batch_size
    multi_modal_datas = [dict(image=[]) for _ in range(batch_size)]
    system_prompts = [self.system_prompt] * batch_size

    for idx, sample_filenames in enumerate(filenames):
        # Handle corruption if needed
        image_index = 1
        # Replace system prompt
        system_prompts[idx] = replace_system_prompt(system_prompts[idx], sample_filenames)
        for filename in sample_filenames:
             img_path = filename

In this situation, img_path would be equal to something like "CAM_BACK" and not a path to an image

The actual image path is not extracted like
./CameraCrash/CAM_BACK/n008-2018-08-30-15-52-26-0400__CAM_BACK__1535658934037558.jpg

so this error occurs; [Errno 2] No such file or directory: 'CAM_BACK'

The inference py file for llava1.5 is correct in the github.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions