Skip to content

加载预训练模型出错 #30

@DoubleSsh

Description

@DoubleSsh

运行命令为python train.py --embedding --model_dir model_save_dir --dataset okvqa --validate
报错:Some weights of the model checkpoint at unc-nlp/lxmert-base-uncased were not used when initializing LxmertModel: ['obj_predict_head.decoder_dict.attr.weight', 'cls.predictions.decoder.weight', 'cls.predictions.bias', 'obj_predict_head.decoder_dict.obj.weight', 'obj_predict_head.decoder_dict.attr.bias', 'answer_head.logit_fc.2.weight', 'answer_head.logit_fc.3.weight', 'obj_predict_head.decoder_dict.obj.bias', 'answer_head.logit_fc.0.weight', 'obj_predict_head.transform.dense.weight', 'cls.predictions.transform.dense.weight', 'obj_predict_head.transform.LayerNorm.bias', 'cls.predictions.transform.LayerNorm.weight', 'answer_head.logit_fc.3.bias', 'obj_predict_head.transform.dense.bias', 'cls.seq_relationship.weight', 'cls.seq_relationship.bias', 'answer_head.logit_fc.0.bias', 'obj_predict_head.decoder_dict.feat.bias', 'answer_head.logit_fc.2.bias', 'cls.predictions.transform.dense.bias', 'obj_predict_head.decoder_dict.feat.weight', 'obj_predict_head.transform.LayerNorm.weight', 'cls.predictions.transform.LayerNorm.bias']

  • This IS expected if you are initializing LxmertModel from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
  • This IS NOT expected if you are initializing LxmertModel from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
    9009
    已杀死
    请问这是什么原因呢?预训练模型(unc-nlp/lxmert-base-uncased)我是有下载的。在linux上运行的。

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions