Abstract
Background: The image generation model based on generative adversarial network
(GAN) has achieved remarkable achievements. However, traditional GAN has the disadvantage of
unstable training, which affects the quality of the generated image.
Objective: This method is to solve the GAN image generation problems of poor image quality,
single image category, and slow model convergence.
Methods: An improved image generation method is proposed based on GAN. Firstly, the attention
mechanism is introduced into the convolution layer of the generator and discriminator and a batch
normalization layer is added after each convolution layer. Secondly, the ReLU and leaky ReLU are
used as the active layer of the generator and discriminator, respectively. Thirdly, the transposed
convolution is used in the generator while the small step convolution is used in the discriminator,
respectively. Fourthly, a new discarding method is applied in the dropout layer.
Results: The experiments were carried out on Caltech 101 dataset. The experimental results showed
that the image quality generated by the proposed method is better than that generated by GAN with
attention mechanism (AM-GAN) and GAN with stable training strategy (STS-GAN) and the stability
was improved.
Conclusion: The proposed method is effectiveness for image generation with high quality.
Graphical Abstract
[2]
K. Gregor, I. Danihelka, and A. Graves, "DRAW: a recurrent neural network for image generation", Proceedings of the 32nd International Conference on Machine Learning, Lille, France, 2015, pp. 1462-1471.
[3]
J. Chen, "A neuron-MOS-based VLSI implementation of pulse-coupled neural networks for image feature generation", IEEE Trans. Circ. Syst., vol. 57, no. 6, pp. 1143-1153, 2010.
[6]
A. Frühstück, K.K. Singh, and E. Shechtman, "InsetGAN for full-body image generation", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 2022, pp. 7723-7732.
[7]
K. Katsumata, D.M. Vo, and H. Nakayama, "OSSGAN: Open-set semi-supervised image generation", Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 2022, pp. 11185-11193.
[9]
C. Han, H. Hayashi, and L. Rundo, "GAN-based synthetic brain MR image generation", 15th International Symposium on Biomedical Imaging, Washington, DC, USA, 2018, pp. 734-738.
[16]
T. Salimans, I. Goodfellow, and W. Zaremba, "Improved techniques for training GANs", Proceedings of the 30th International Conference on Neural Information Processing Systems, Barcelona, Spain, 2016, pp. 2234-2242.
[17]
M. Heusel, H. Ramsauer, and T. Unterthiner, "GANs trained by a two time-scale update rule converge to a local nash equilibrium", Adv. Neural Inf. Process. Syst., vol. 30, pp. 1-5, 2017.