FP16 uses lesser GPU memory, but training speed unchanged #6711

athenawisdoms · 2021-03-29T04:14:07Z

athenawisdoms
Mar 29, 2021

I started exploring FP16 training on my Ubuntu machine with two RTX 2070S GPUS.

dm = ...
model  = ...
trainer = Trainer(accelerator="ddp", precision=16)
trainer.fit(model, dm)

Without FP16, the memory usage on GPU 1 is 3.3 GB and the progress bar reports 15 it/s.
With FP16, the memory usage on GPU 1 is 2.6 GB (21% decrease) but the progress bar reports 16 it/s (7% increase). GPU utilization is 94%.

I am using Pytorch 1.7.0, Pytorch Lightning 1.2.5, Python 3.8, CUDA 10.2.89, CUDNN 7.6.5.

Is this improvement in training speed much lesser than expected? I was expecting about a slightly lesser than 50% reduction in GPU memory, and maybe a 1.5X increase in iterations/s.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FP16 uses lesser GPU memory, but training speed unchanged #6711

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

FP16 uses lesser GPU memory, but training speed unchanged #6711

Uh oh!

Uh oh!

athenawisdoms Mar 29, 2021

Replies: 0 comments

athenawisdoms
Mar 29, 2021