Horovod CANDLE NT3 Benchmark 1.0; learning rate=0.001, batch size=40.0, epochs=4.0