Skip to content

Request to add DINO object detector #36205

@tcourat

Description

@tcourat

Model description

DINO (do not confuse it with the DINO image encoder from META) is a SOTA DETR-like object detector, improving the denoising training, query initialization, and box prediction. It is based on a combination of the enhancement brought by DN-DETR , DAB-DETR , and Deformable DETR.
As it is used as backbone for many other DETR architecture (e.g Co-DETR which is SOTA on COCO test-dev : https://paperswithcode.com/sota/object-detection-on-coco), it would be nice to have it in transformers.

Additionnaly, a slighly improved version of DINO, called Stable-DINO, also exist, and should be easily added on top of DINO (only a few lines of code).

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

Paper : https://arxiv.org/abs/2203.03605
Code : https://github.com/IDEA-Research/DINO

Code for Stable-DINO : https://github.com/IDEA-Research/Stable-DINO

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions