AICurious Logo

What is: Crossbow?

SourceCROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

Crossbow is a single-server multi-GPU system for training deep learning models that enables users to freely choose their preferred batch size—however small—while scaling to multiple GPUs. Crossbow uses many parallel model replicas and avoids reduced statistical efficiency through a new synchronous training method. SMA, a synchronous variant of model averaging, is used in which replicas independently explore the solution space with gradient descent, but adjust their search synchronously based on the trajectory of a globally-consistent average model.