Fig. 4From: Instance segmentation on distributed deep learning big data clusterIn Model parallelism, different parts of the large model are assigned to different nodes or machines. The intermediate results or activations need to be exchanged between machines or devices during forward propagation, and the gradients need to be passed backward between machines or devices during backward propagationBack to article page