-
Notifications
You must be signed in to change notification settings - Fork 32.6k
Description
Model description
DINO (do not confuse it with the DINO image encoder from META) is a SOTA DETR-like object detector, improving the denoising training, query initialization, and box prediction. It is based on a combination of the enhancement brought by DN-DETR , DAB-DETR , and Deformable DETR.
As it is used as backbone for many other DETR architecture (e.g Co-DETR which is SOTA on COCO test-dev : https://paperswithcode.com/sota/object-detection-on-coco), it would be nice to have it in transformers.
Additionnaly, a slighly improved version of DINO, called Stable-DINO, also exist, and should be easily added on top of DINO (only a few lines of code).
Open source status
- The model implementation is available
- The model weights are available
Provide useful links for the implementation
Paper : https://arxiv.org/abs/2203.03605
Code : https://github.com/IDEA-Research/DINO
Code for Stable-DINO : https://github.com/IDEA-Research/Stable-DINO