Skip to content

Big gap in WER between online and offline CTC decoding #1194

@chiendb97

Description

@chiendb97

I tried offline decoding using hlg_decode.cu and online decoding using online_decode.cu. And here is the result:

  • For model librispeech conformer ctc: offline decoding: 3.49% WER, online decoding: 19.08% WER
  • For our model: offline decoding: ~3% WER, online decoding: ~18% WER
    (WER online decoding is much larger than offline decoding (both use the same am output), online decoding uses chunk size 16)

Could you please tell me the difference between offline decoding and online decoding? In addition, could you tell us the result of 2 kinds of decoding.
Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions