(1) According to #1, it is encouraged to use unlabeled data. My question is a bit tricky. Is that allowed to use the unlabeled data (i.e. passages only) of the out-of-domain datasets to fine-tune the pre-trained language model?
(2) Is there any whitelist of labeled non-QA datasets?