AI Image Generation
AI Image Generation
• Artists and designers are using these tools to explore novel visual styles, create digital
artwork, and generate new concepts.
• In entertainment, AI-generated images are used for special effects, game design, and
virtual world creation. Moreover, researchers are harnessing this technology for tasks
like data augmentation, image synthesis, and medical image generation.
• This project intersects technology, creativity, and innovation, promising a diverse scope
of applications across various industries.
AI Image Generation 2
Objectives
• To create realistic and high-quality images. AI image generators can
be used to create images that are indistinguishable from real
photographs. This is useful for a variety of applications, such as
creating marketing materials, generating realistic product renders,
or creating digital art.
AI Image Generation 3
Literature Review
Name Authors Year Presented at
Image Generation: a review Neural Elasri, M. O., Elharrouss, O., Research Gate 2022
Processing Letters Al-Maadeed, S., & Tairi, H.
V. Image Generation Based on Rohith, M., Pallavi, L., Shirisha, K., IEEE 2023
Text Using BERT And GAN Model. Sanjay, M., & Priya
AI Illustrator: Art Illustration Zi-Han, C., Chen, L., Zhao, Z., & IEEE 2020
Generation Based on Generative Wang, Y.
Adversarial Network
Stacking VAE and GAN for Zhang, C., & Peng, Y. IEEE 2018
Context-aware Text-to-Image
Generation
Name Authors Year Presented at
AraBERT and DF-GAN fusion for Bahani, M., Ouaazizi, A. E., & 2022 Science
Arabic text-to-image generation. Maalmi, K. Direct
MRP-GAN: Multi-resolution Qi, Z., Fan, C., Xu, L., Li, X., & Shu, Z 2021 Science
parallel generative adversarial Direct
networks for text-to-image
synthesis. Pattern Recognition
Letters
Image generation from text with Zhou, D., Sun, K., Hu, M., & He, Y. 2021 Science
entity information fusion. Direct
Knowledge Based Systems
AI Image Generation 5
Literature Review
AI Illustrator: Art Illustration Generation Based on Generative
Adversarial Network
Zihan Chen, Lianghong Chen, Zhiyuan Zhao, Yue Wang With the
improvement of the spiritual need of the public, people have higher
requirements for books, among which the illustration of the relevant
words in the books is an urgent solution. The traditional method of
completing the related illustrations by illustrators has been unable to
meet the need of the growing book market.
AI Image Generation
6
Stacking VAE and GAN for Context-aware Text-to-Image Generation
AI Image Generation 7
Hierarchical Text-Conditional Image Generation with CLIP Latents
AI Image Generation
8
Problem Definition
AI Image Generation
6
References
[1]Elasri, M. O., Elharrouss, O., Al-Maadeed, S., & Tairi, H. (2022). Image Generation: a review.
Neural Processing Letters, 54(5), 4609–4646. https://doi.org/10.1007/s11063-022-10777-x
[2]Rohith, M., Pallavi, L., Shirisha, K., Sanjay, M., & Priya, V. (2023). Image Generation Based on
Text Using BERT And GAN Model. IEEE. https://doi.org/10.1109/cictn57981.2023.10141495
[3]Ramesh, A., Dhariwal, P., Nichol, A., Chu, C., & Chen, M. (2022). Hierarchical Text-Conditional
Image Generation with CLIP Latents. arXiv (Cornell University).
https://doi.org/10.48550/arxiv.2204.06125
[4]Zi-Han, C., Chen, L., Zhao, Z., & Wang, Y. (2020). AI Illustrator: Art Illustration Generation Based on
Generative Adversarial Network. IEEE. https://doi.org/10.1109/icivc50857.2020.9177494
[5]Zhang, C., & Peng, Y. (2018). Stacking VAE and GAN for Context-aware Text-to-Image Generation.
IEEE. https://doi.org/10.1109/bigmm.2018.8499439
AI Image Generation
7
References
[6]Bahani, M., Ouaazizi, A. E., & Maalmi, K. (2022). AraBERT and DF-GAN fusion for Arabic
text-to-image generation. Array, 16, 100260. https://doi.org/10.1016/j.array.2022.100260
[7]Qi, Z., Fan, C., Xu, L., Li, X., & Shu, Z. (2021). MRP-GAN: Multi-resolution parallel generative
adversarial networks for text-to-image synthesis. Pattern Recognition Letters, 147, 1–7.
https://doi.org/10.1016/j.patrec.2021.02.020
[8]Zhou, D., Sun, K., Hu, M., & He, Y. (2021). Image generation from text with entity information fusion.
Knowledge Based Systems, 227, 107200. https://doi.org/10.1016/j.knosys.2021.107200
[9]Zhang, Z., & Schomaker, L. (2022). DiverGAN: an efficient and effective Single-Stage framework for
diverse Text-to-Image generation. Neurocomputing, 473, 182–198.
https://doi.org/10.1016/j.neucom.2021.12.005
AI Image Generation
8
Copyright @Information Technology Dept , XIE 9