Skip to content

The download of the dataset failed when evaluating the VLM #868

@moluzhui

Description

@moluzhui

Self-Check List

Before submitting an issue, please ensure you have completed the following steps:

Problem Description

Please briefly describe the problem you encountered.

The evaluation code reported an error due to the failure of the dataset download

EvalScope Version (Required)

v1.0.2

Tools Used

  • Native / Native framework
  • Opencompass backend
  • VLMEvalKit backend
  • RAGEval backend
  • Perf / Model inference stress testing tool
  • Arena / Arena mode

Executed Code or Instructions

Please provide the main code or instructions you executed. reference web

from evalscope import TaskConfig, run_task

task_cfg_dict = TaskConfig(
    work_dir='outputs',
    eval_backend='VLMEvalKit',
    eval_config={
        'data': ['BLINK'],
        'mode': 'all',
        'model': [
            {'api_base': 'http://xx.xx.xx.xx:xxx/v1/chat/completions',
            'key': 'HikLLM!@#',
            'name': 'Qwen2-VL-7B-Instruct',
            'temperature': 0.6,
            'type': 'qwen2.5-vl-7b-instruct',
            'img_size': -1,
            'video_llm': False,
            'max_tokens': 2048,}
            ],
        'reuse': False,
        'nproc': 16,
        'judge': 'exact_matching'},
)

run_task(task_cfg=task_cfg_dict)

Error Log

Please paste the complete error log or console output.

[2025-10-13 19:16:58,728] WARNING - RUN - run.py: run_task - 168: --reuse is not set, will not reuse previous (before one day) temporary files
2025-10-13 19:16:58,728 - RUN - WARNING - --reuse is not set, will not reuse previous (before one day) temporary files
fatal: Not a git repository (or any parent up to mount point /data)
Stopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set).
/usr/python/lib/python3.11/site-packages/vlmeval/dataset/image_base.py:93: UserWarning: The dataset tsv is not downloaded
  warnings.warn('The dataset tsv is not downloaded')
BLINK.tsv: 0.00B [00:00, ?B/s]
2025-10-13 19:16:59,138 - root - WARNING - <class 'urllib.error.URLError'>: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1006)>
[2025-10-13 19:16:59,138] ERROR - RUN - run.py: run_task - 420: Model Qwen2-VL-7B-Instruct x Dataset BLINK combination failed: Failed to download https://opencompass.openxlab.space/utils/VLMEval/BLINK.tsv, skipping this combination.
Traceback (most recent call last):
  File "/usr/python/lib/python3.11/urllib/request.py", line 1348, in do_open
    h.request(req.get_method(), req.selector, req.data, headers,
  File "/usr/python/lib/python3.11/http/client.py", line 1303, in request
    self._send_request(method, url, body, headers, encode_chunked)
  File "/usr/python/lib/python3.11/http/client.py", line 1349, in _send_request
    self.endheaders(body, encode_chunked=encode_chunked)
  File "/usr/python/lib/python3.11/http/client.py", line 1298, in endheaders
    self._send_output(message_body, encode_chunked=encode_chunked)
  File "/usr/python/lib/python3.11/http/client.py", line 1058, in _send_output
    self.send(msg)
  File "/usr/python/lib/python3.11/http/client.py", line 996, in send
    self.connect()
  File "/usr/python/lib/python3.11/http/client.py", line 1475, in connect
    self.sock = self._context.wrap_socket(self.sock,
                ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/python/lib/python3.11/ssl.py", line 517, in wrap_socket
    return self.sslsocket_class._create(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/python/lib/python3.11/ssl.py", line 1104, in _create
    self.do_handshake()
  File "/usr/python/lib/python3.11/ssl.py", line 1382, in do_handshake
    self._sslobj.do_handshake()
ssl.SSLCertVerificationError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1006)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/python/lib/python3.11/site-packages/vlmeval/smp/file.py", line 209, in download_file
    urllib.request.urlretrieve(url, filename=filename, reporthook=t.update_to)
  File "/usr/python/lib/python3.11/urllib/request.py", line 241, in urlretrieve
    with contextlib.closing(urlopen(url, data)) as fp:
                            ^^^^^^^^^^^^^^^^^^
  File "/usr/python/lib/python3.11/urllib/request.py", line 216, in urlopen
    return opener.open(url, data, timeout)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/python/lib/python3.11/urllib/request.py", line 519, in open
    response = self._open(req, data)
               ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/python/lib/python3.11/urllib/request.py", line 536, in _open
    result = self._call_chain(self.handle_open, protocol, protocol +
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/python/lib/python3.11/urllib/request.py", line 496, in _call_chain
    result = func(*args)
             ^^^^^^^^^^^
  File "/usr/python/lib/python3.11/urllib/request.py", line 1391, in https_open
    return self.do_open(http.client.HTTPSConnection, req,
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/python/lib/python3.11/urllib/request.py", line 1351, in do_open
    raise URLError(err)
urllib.error.URLError: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1006)>

Running Environment

  • Operating System: Centos7.9
  • Python Version: python 3.11

Additional Information

If there is any other relevant information, please provide it here.

In fact, I was testing Qwen2.5-VL-7B-Instruct, but due to the model name assertion, I had to use the names in the list, which might lead to mistakes when recording.

AssertionError: Invalid models: ['Qwen2.5-VL-7B-Instruct'], refer to the following list to get proper model name: ['TransCore_M', 'PandaGPT_13B', 'flamingov2', 'VisualGLM_6b', 'mPLUG-Owl2', 'mPLUG-Owl3', 'OmniLMM_12B', 'MGM_7B', 'Bunny-llama3-8B', 'VXVERSE', 'paligemma-3b-mix-448', '360VL-70B', 'Llama-3-MixSenseV1_1', 'Parrot', 'OmChat', 'RBDash_72b', 'Pixtral-12B', 'Falcon2-VLM-11B', 'GPT4V', 'GPT4V_HIGH', 'GPT4V_20240409', 'GPT4V_20240409_HIGH', 'GPT4o', 'GPT4o_HIGH', 'GPT4o_20240806', 'GPT4o_20241120', 'GPT4o_MINI', 'GeminiPro1-0', 'GeminiPro1-5', 'GeminiFlash1-5', 'GeminiFlash2-0', 'GeminiPro1-5-002', 'GeminiFlash1-5-002', 'QwenVLPlus', 'QwenVLMax', 'RekaEdge', 'RekaFlash', 'RekaCore', 'Step1V', 'Step1.5V-mini', 'Yi-Vision', 'Claude3V_Opus', 'Claude3V_Sonnet', 'Claude3V_Haiku', 'Claude3-5V_Sonnet', 'Claude3-5V_Sonnet_20241022', 'GLM4V', 'GLM4V_PLUS', 'GLM4V_PLUS_20250111', 'abab6.5s', 'abab7-preview', 'CloudWalk', 'SenseChat-Vision', 'HunYuan-Vision', 'CustomAPIModel', 'HunYuan-Standard-Vision', 'BailingMM-Lite-1203', 'BailingMM-Pro-0120', 'BlueLM_V', 'JTVL', 'Taiyi', 'TeleMM', 'lmdeploy', 'lmdeploy_internvl_78B_MPO', 'lmdeploy_qvq_72B_preview', 'Taichu-VL-2B', 'DoubaoVL', 'llava-internlm2-7b', 'llava-internlm2-20b', 'llava-internlm-7b', 'llava-v1.5-7b-xtuner', 'llava-v1.5-13b-xtuner', 'llava-llama-3-8b', 'qwen_base', 'qwen_chat', 'qwen-vl-chat', 'monkey', 'monkey-chat', 'minimonkey', 'llava_v1.5_7b', 'llava_v1.5_13b', 'llava_v1_7b', 'sharegpt4v_7b', 'sharegpt4v_13b', 'llava_next_vicuna_7b', 'llava_next_vicuna_13b', 'llava_next_mistral_7b', 'llava_next_yi_34b', 'llava_next_llama3', 'llava_next_72b', 'llava_next_110b', 'llava_next_qwen_32b', 'llava_next_interleave_7b', 'llava_next_interleave_7b_dpo', 'llava-onevision-qwen2-0.5b-ov-hf', 'llava-onevision-qwen2-0.5b-si-hf', 'llava-onevision-qwen2-7b-ov-hf', 'llava-onevision-qwen2-7b-si-hf', 'llava_onevision_qwen2_0.5b_si', 'llava_onevision_qwen2_7b_si', 'llava_onevision_qwen2_72b_si', 'llava_onevision_qwen2_0.5b_ov', 'llava_onevision_qwen2_7b_ov', 'llava_onevision_qwen2_72b_ov', 'Aquila-VL-2B', 'llava_video_qwen2_7b', 'llava_video_qwen2_72b', 'varco-vision-hf', 'InternVL-Chat-V1-1', 'InternVL-Chat-V1-2', 'InternVL-Chat-V1-2-Plus', 'InternVL-Chat-V1-5', 'Mini-InternVL-Chat-2B-V1-5', 'Mini-InternVL-Chat-4B-V1-5', 'InternVL2-1B', 'InternVL2-2B', 'InternVL2-4B', 'InternVL2-8B', 'InternVL2-26B', 'InternVL2-40B', 'InternVL2-76B', 'InternVL2-8B-MPO', 'InternVL2-8B-MPO-CoT', 'InternVL2_5-1B', 'InternVL2_5-2B', 'InternVL2_5-4B', 'InternVL2_5-8B', 'InternVL2_5-26B', 'InternVL2_5-38B', 'InternVL2_5-78B', 'InternVL2_5-1B-MPO', 'InternVL2_5-2B-MPO', 'InternVL2_5-4B-MPO', 'InternVL2_5-8B-MPO', 'InternVL2_5-26B-MPO', 'InternVL2_5-38B-MPO', 'InternVL2_5-78B-MPO', 'Yi_VL_6B', 'Yi_VL_34B', 'XComposer', 'sharecaptioner', 'XComposer2', 'XComposer2_1.8b', 'XComposer2_4KHD', 'XComposer2d5', 'MiniGPT-4-v2', 'MiniGPT-4-v1-7B', 'MiniGPT-4-v1-13B', 'idefics_9b_instruct', 'idefics_80b_instruct', 'idefics2_8b', 'Idefics3-8B-Llama3', 'instructblip_7b', 'instructblip_13b', 'deepseek_vl_7b', 'deepseek_vl_1.3b', 'deepseek_vl2_tiny', 'deepseek_vl2_small', 'deepseek_vl2', 'Janus-1.3B', 'MiniCPM-V', 'MiniCPM-V-2', 'MiniCPM-Llama3-V-2_5', 'MiniCPM-V-2_6', 'MiniCPM-o-2_6', 'cogvlm-grounding-generalist', 'cogvlm-chat', 'cogvlm2-llama3-chat-19B', 'glm-4v-9b', 'WeMM', 'cambrian_8b', 'cambrian_13b', 'cambrian_34b', 'chameleon_7b', 'chameleon_30b', 'Video-LLaVA-7B', 'Video-LLaVA-7B-HF', 'VideoChat2-HD', 'Chat-UniVi-7B', 'Chat-UniVi-7B-v1.5', 'LLaMA-VID-7B', 'Video-ChatGPT', 'PLLaVA-7B', 'PLLaVA-13B', 'PLLaVA-34B', 'Ovis1.5-Llama3-8B', 'Ovis1.5-Gemma2-9B', 'Ovis1.6-Gemma2-9B', 'Ovis1.6-Llama3.2-3B', 'Ovis1.6-Gemma2-27B', 'VILA1.5-3b', 'Llama-3-VILA1.5-8b', 'VILA1.5-13b', 'VILA1.5-40b', 'Mantis-8B-siglip-llama3', 'Mantis-8B-clip-llama3', 'Mantis-8B-Idefics2', 'Mantis-8B-Fuyu', 'MMAlaya', 'MMAlaya2', 'Phi-3-Vision', 'Phi-3.5-Vision', 'xgen-mm-phi3-interleave-r-v1.5', 'xgen-mm-phi3-dpo-r-v1.5', 'Qwen-VL-Max-0809', 'Qwen-VL-Plus-0809', 'QVQ-72B-Preview', 'Qwen2-VL-72B-Instruct', 'Qwen2-VL-7B-Instruct', 'Qwen2-VL-7B-Instruct-AWQ', 'Qwen2-VL-7B-Instruct-GPTQ-Int4', 'Qwen2-VL-7B-Instruct-GPTQ-Int8', 'Qwen2-VL-2B-Instruct', 'Qwen2-VL-2B-Instruct-AWQ', 'Qwen2-VL-2B-Instruct-GPTQ-Int4', 'Qwen2-VL-2B-Instruct-GPTQ-Int8', 'XinYuan-VL-2B-Instruct', 'Slime-7B', 'Slime-8B', 'Slime-13B', 'Eagle-X4-8B-Plus', 'Eagle-X4-13B-Plus', 'Eagle-X5-7B', 'Eagle-X5-13B', 'Eagle-X5-13B-Chat', 'Eagle-X5-34B-Chat', 'Eagle-X5-34B-Plus', 'Moondream1', 'Moondream2', 'Llama-3.2-11B-Vision-Instruct', 'LLaVA-CoT', 'Llama-3.2-90B-Vision-Instruct', 'molmoE-1B-0924', 'molmo-7B-D-0924', 'molmo-7B-O-0924', 'molmo-72B-0924', 'Kosmos2', 'POINTS-Yi-1.5-9B-Chat', 'POINTS-Qwen-2.5-7B-Chat', 'POINTSV15-Qwen-2.5-7B-Chat', 'NVLM', 'Vintern-3B-beta', 'Vintern-1B-v2', 'h2ovl-mississippi-2b', 'h2ovl-mississippi-1b', 'Aria', 'SmolVLM', 'SmolVLM-DPO', 'SmolVLM-Synthetic', 'SAIL-VL-2B', 'valley_eagle', 'vita', 'vita_qwen2', 'ross-qwen2-7b', 'emu2_chat', 'emu3_chat', 'emu3_gen']

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions