Data movement bottlenecks to large-scale model training: Scaling past 1e28 FLOP November 3, 2024 by Comments