Using DistributedDataParallel to train a base model from scratch in the cloud January 8, 2026 by kamal Comments