If not in use, the GPU memory usage is too high.
Could you check the guide here? https://huggingface.co/docs/transformers/main/en/perf_train_gpu_one#gradient-checkpointing
· Sign up or log in to comment