This repo contains the code to reproduce the plots for the paper "Mean Spectral Normalization of Deep Neural Networks for Embedded Automation" [ArXiv Version] (Accepted for Oral presentation at IEEE CASE 2019) by Anand Krishnamoorthy Subramanian and Nak-Young Chong, 2019.
Spectral Normalization essentially induces a high level gradient sparsity in the network which is advatageous for deep neural nets, but dimishes the performance of medium and small neural nets. Our Mean Spectral Normalization (MSN) improves upon Spectral Normalization maintaining the gradient sparsity as improving the performance as close to Batch Norm.
The main cause for the poor performance of spectral normalization was identified to be the uncontrolled mean-drift effect (a remnant of the internal covariate shift), which MSN resolves; like Batch Norm - but with fewer parameters and computationally more efficient.
MSN works well for small, medium and alrge networks, comparable to that of Batch Norm, but with lesser number of trainable parameters.
Unsupervised Image Generation using MSNGAN on CIFAR10
The following are required to successfully run the code -
- Python 3.6.8
- Pytorch 1.0.0
- CUDA 9.0.176
- Numpy 1.15.4
- Matplotlib 3.0.2
- Seaborn (optional) 0.9.0
- A minimum of 1 CUDA_capable GPU
Note: The above experiments were run on NVIDIA 1080 ti GPU.