Going deeper with convolutions, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. ,
Deep residual learning for image recognition, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. ,
Densely connected convolutional networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017. ,
Accurate, large minibatch sgd: Training imagenet in 1 hour, 2017. ,
Imagenet training in minutes, 2017. ,
In-place activated batchnorm for memory-optimized training of dnns, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.5639-5647, 2018. ,
Memory-efficient implementation of densenets, 2017. ,
vdnn: Virtualized deep neural networks for scalable, memory-efficient neural network design, The 49th Annual IEEE/ACM International Symposium on Microarchitecture, p.18, 2016. ,
Dynamic memory management for gpu-based training of deep neural networks, IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2019. ,
Parallelized stochastic gradient descent, Advances in neural information processing systems, pp.2595-2603, 2010. ,
Gpu asynchronous stochastic gradient descent to speed up neural network training, 2013. ,
Large scale distributed deep networks, Advances in neural information processing systems, pp.1223-1231, 2012. ,
Mathematical Programming: recent developments and applications, vol.6, pp.83-107, 1989. ,
Mitgcm user manual, 2008. ,
Engineering Design Optimization using Calculus Level Methods, 2016. ,
Algorithm 799: Revolve: an implementation of checkpointing for the reverse or adjoint mode of computational differentiation, ACM Transactions on Mathematical Software (TOMS), vol.26, issue.1, pp.19-45, 2000. ,
Memory-efficient backpropagation through time, Advances in Neural Information Processing Systems, pp.4125-4133, 2016. ,
Training deep nets with sublinear memory cost, 2016. ,
Backpropagation for long sequences: beyond memory constraints with constant overheads, 2018. ,
Automatic differentiation in pytorch, 2017. ,
, Periodic checkpointing in pytorch, 2018.