Cuda tried allocating an enormous amount of memory (1936GiB)

### Search before asking

- [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and found no similar bug report.


### YOLOv5 Component

Training

### Bug

Hi, new to YOLO, I am getting this error message when training YOLOv5x6:

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 1936.00 GiB (GPU 1; 11.17 GiB total capacity; 2.15 GiB already allocated; 7.62 GiB free; 3.02 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation.  See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

For context, I am trying to train YOLOv5x6 on the PubLayNet datatset (article [here](https://arxiv.org/pdf/1908.07836.pdf), github [here](https://github.com/ibm-aur-nlp/PubLayNet)) to compare the results with the DocLayNet dataset that has already been tested on YOLOv5x6 (article [here](https://arxiv.org/pdf/2206.01062.pdf), github [here](https://github.com/DS4SD/DocLayNet))

I am doing this using base image size of 640, batch size of 8 and running distributed data parallel mode on 2 K80 GPUs. 
During training, memory usage is normal as can be seen below:

![image](https://user-images.githubusercontent.com/44286703/208399478-4206637b-5206-4d33-baee-cde4d2955832.png)
![image](https://user-images.githubusercontent.com/44286703/208399647-11ce378d-565e-4fd9-a32c-c99b5f298cc0.png)


But about 80% through the first epoch I get the above error message. Any clues as to why the model would try to allocate such an enormous amount of memory and how to fix it ?

### Environment

- Yolo : Yolov5x6
- OS : Ubuntu 20.04
- Python : 3.9.12

### Minimal Reproducible Example

_No response_

### Additional

_No response_

### Are you willing to submit a PR?

- [ ] Yes I'd like to help by submitting a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cuda tried allocating an enormous amount of memory (1936GiB) #10528

Search before asking

YOLOv5 Component

Bug

Environment

Minimal Reproducible Example

Additional

Are you willing to submit a PR?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Cuda tried allocating an enormous amount of memory (1936GiB) #10528

Description

Search before asking

YOLOv5 Component

Bug

Environment

Minimal Reproducible Example

Additional

Are you willing to submit a PR?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions