Skip to content

Something is wrong in revMHA Encoder, which project the nodes coordinates to the weird embedding like [nan,nan,nan,nan,...] #1

@bxx-seu

Description

@bxx-seu

Dear author, I ran the code of Pointerformer locally and tested its performance to TSP tasks. However, I found that the model could not correctly solve the tasks for some instances, and obtained inexplicable final solutions such as [0,0,0,0,0,0].

By debug, found that something is wrong in revMHA Encoder, which cause the node embeddings projected from the model input becoming the weird embedding, like [nan,nan,nan,nan,...]. Abviously, the solution must be wrong.

For config.yaml, I only changed the "precision" from 16 to 64 because of "RuntimeError: expected scalar type Double but found Half".

I attach a testset for verification, which include four instances and the first can be solved normally and the rest three instances can not be solved.
testset.txt

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions