Feature Request: Port the significance test?

In moses, there is [this script](https://github.com/moses-smt/mosesdecoder/blob/master/scripts/analysis/bootstrap-hypothesis-difference-significance.pl) for determining the "true" BLEU score within a confidence interval. Unfortunately, it does not have the configurability that sacreBLEU has.

In order to compare systems with regard to statistical significance, it would be nice to have a similar script, but supporting sacreBLEU.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Port the significance test? #40

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Feature Request: Port the significance test? #40

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions