RuntimeError: cuda runtime error (711) : peer mapping resources exhausted at /pytorch/aten/src/THC/THCGeneral.cpp:139
#apperently there is a limit of 8 for number of GPU that can be used
#so if you have more than 8 you will have to specify only 8
model = nn.DataParallel(model, device_ids=[0, 1, 2, 3, 4, 5 ,6 ,7])