Skip to content

reproduce BRIGHT result #209

@abdoelsayed2016

Description

@abdoelsayed2016

Hi, thanks for releasing Reason-ModernColBERT.

I’m trying to reproduce the Biology BRIGHT result reported in the model card (NDCG@10 = 33.25) using lightonai/Reason-ModernColBERT.

I evaluated Biology in two ways:

  1. ANN candidate retrieval + ColBERT reranking
  2. exact full ColBERT scoring against all Biology documents (no ANN pruning), with pytrec_eval-style
    metrics

In both cases, I get about NDCG@10 ~= 0.295 instead of 0.3325.

Exact run:

  • task: biology
  • split: examples
  • NDCG@10: 0.29534
  • MRR@10: 0.38032
  • MAP@100: 0.23691
  • Recall@100: 0.72108

My environment currently shows:

  • checkpoint created with sentence-transformers 4.0.2
  • runtime has sentence-transformers 3.4.1

So I wanted to ask:

  1. What exact evaluation script was used for the published BRIGHT numbers?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions