Skip to content

Feature Request: Port the significance test? #40

@felix-schneider

Description

@felix-schneider

In moses, there is this script for determining the "true" BLEU score within a confidence interval. Unfortunately, it does not have the configurability that sacreBLEU has.

In order to compare systems with regard to statistical significance, it would be nice to have a similar script, but supporting sacreBLEU.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions