I tried offline decoding using hlg_decode.cu and online decoding using online_decode.cu. And here is the result:
- For model librispeech conformer ctc: offline decoding: 3.49% WER, online decoding: 19.08% WER
- For our model: offline decoding: ~3% WER, online decoding: ~18% WER
(WER online decoding is much larger than offline decoding (both use the same am output), online decoding uses chunk size 16)
Could you please tell me the difference between offline decoding and online decoding? In addition, could you tell us the result of 2 kinds of decoding.
Thanks!