Going deeper with convolutions, CVPR, 2015. ,
Deep residual learning for image recognition, CVPR, 2016. ,
Densely connected convolutional networks, IEEE CVPR, 2017. ,
Accurate, large minibatch sgd: Training imagenet in 1 hour, 2017. ,
Imagenet training in minutes, 2017. ,
In-place activated batchnorm for memory-optimized training of dnns, Proceedings of CVPR, pp.5639-5647, 2018. ,
Memory-efficient implementation of densenets, 2017. ,
Going deeper in the automated identification of herbarium specimens, BMC Evolutionary Biology, vol.17, issue.1, p.181, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01580070
Multi-level 3d cnn for learning multi-scale spatial features, IEEE CVPR Workshops, pp.0-0, 2019. ,
Gvcnn: Group-view convolutional neural networks for 3d shape recognition, IEEE CVPR, pp.264-272, 2018. ,
Multi-view convolutional neural networks for 3d shape recognition, IEEE ICCV, pp.945-953, 2015. ,
Can spatiotemporal 3d cnns retrace the history of 2d cnns and imagenet?, 2018. ,
Cdc: Convolutional-de-convolutional networks for precise temporal action localization in untrimmed videos, IEEE CVPR, pp.5734-5743, 2017. ,
Batch normalization: Accelerating deep network training by reducing internal covariate shift, 2015. ,
How does batch normalization help optimization, Advances in Neural Information Processing Systems, pp.2483-2493, 2018. ,
A graph theoretic framework of recomputation algorithms for memory-efficient backpropagation, 2019. ,
vdnn: Virtualized deep neural networks for scalable, memory-efficient neural network design, The 49th Annual IEEE/ACM International Symposium on Microarchitecture, p.18, 2016. ,
Dynamic memory management for gpu-based training of deep neural networks, IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2019. ,
Parallelized stochastic gradient descent, Advances in neural information processing systems, pp.2595-2603, 2010. ,
Gpu asynchronous stochastic gradient descent to speed up neural network training, 2013. ,
Large scale distributed deep networks, Advances in neural information processing systems, pp.1223-1231, 2012. ,
Mathematical Programming: recent developments and applications, vol.6, pp.83-107, 1989. ,
Mitgcm user manual, 2008. ,
Engineering Design Optimization using Calculus Level Methods, 2016. ,
Algorithm 799: Revolve: an implementation of checkpointing for the reverse or adjoint mode of computational differentiation, ACM Transactions on Mathematical Software (TOMS), vol.26, issue.1, pp.19-45, 2000. ,
Memory-efficient backpropagation through time, Advances in Neural Information Processing Systems, pp.4125-4133, 2016. ,
Training deep nets with sublinear memory cost, 2016. ,
Backpropagation for long sequences: beyond memory constraints with constant overheads, 2018. ,
Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory, Inria Bordeaux Sud-Ouest, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02352969
Efficient rematerialization for deep networks, Advances in Neural Information Processing Systems, pp.15-146, 2019. ,
Optimal gradient checkpoint search for arbitrary computation graphs, 2018. ,
Checkmate: Breaking the memory wall with optimal tensor rematerialization, 2019. ,
Automatic differentiation in pytorch, 2017. ,
, Periodic checkpointing in pytorch, 2018.
Compressing dma engine: Leveraging activation sparsity for training deep neural networks, 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA), pp.78-91, 2018. ,
Beyond the memory wall: A case for memory-centric hpc system for deep learning, 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) ,
, IEEE, pp.148-161, 2018.
Training deeper models by gpu memory optimization on tensorflow, Proc. of ML Systems Workshop in NIPS, 2017. ,
Tflms: Large model support in tensorflow by graph rewriting, 2018. ,
Efficient memory management for gpu-based deep learning systems, 2019. ,
Superneurons: Dynamic gpu memory management for training deep neural networks, SIGPLAN Not, vol.53, issue.1, pp.41-53, 2018. ,