0% found this document useful (0 votes)

25 views

AI Image Generation

Uploaded by

Sarvesh Chavan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

AI Image Generation

Uploaded by

Sarvesh Chavan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

AI Image Generation

Harsh Dalvi (202003004)

Bhanu Sunka (202003057)
Osama Shaikh (202003035)
Sarvesh Chavan (202003040)

Copyright @Information Technology Dept , XIE

Introduction
• AI image generation has found applications in various domains, such as art, design,
entertainment, and even scientific research.

• Artists and designers are using these tools to explore novel visual styles, create digital
artwork, and generate new concepts.

• In entertainment, AI-generated images are used for special effects, game design, and
virtual world creation. Moreover, researchers are harnessing this technology for tasks
like data augmentation, image synthesis, and medical image generation.

• This project intersects technology, creativity, and innovation, promising a diverse scope
of applications across various industries.

AI Image Generation 2
Objectives
• To create realistic and high-quality images. AI image generators can
be used to create images that are indistinguishable from real
photographs. This is useful for a variety of applications, such as
creating marketing materials, generating realistic product renders,
or creating digital art.

• To automate the image creation process. AI image generators can

automate the process of creating images, which can save time and
money.

• AI image generators can be used to create images that are beyond

the capabilities of human artists.

AI Image Generation 3
Literature Review
Name Authors Year Presented at

Image Generation: a review Neural Elasri, M. O., Elharrouss, O., Research Gate 2022
Processing Letters Al-Maadeed, S., & Tairi, H.

V. Image Generation Based on Rohith, M., Pallavi, L., Shirisha, K., IEEE 2023
Text Using BERT And GAN Model. Sanjay, M., & Priya

Hierarchical Text-Condition Ramesh, A., Dhariwal, P., Nichol, IEEE 2022

Image Generation with CLIP A., Chu, C., & Chen, M.
Latents.

AI Illustrator: Art Illustration Zi-Han, C., Chen, L., Zhao, Z., & IEEE 2020
Generation Based on Generative Wang, Y.
Adversarial Network

Stacking VAE and GAN for Zhang, C., & Peng, Y. IEEE 2018
Context-aware Text-to-Image
Generation
Name Authors Year Presented at

AraBERT and DF-GAN fusion for Bahani, M., Ouaazizi, A. E., & 2022 Science
Arabic text-to-image generation. Maalmi, K. Direct

MRP-GAN: Multi-resolution Qi, Z., Fan, C., Xu, L., Li, X., & Shu, Z 2021 Science
parallel generative adversarial Direct
networks for text-to-image
synthesis. Pattern Recognition
Letters

Image generation from text with Zhou, D., Sun, K., Hu, M., & He, Y. 2021 Science
entity information fusion. Direct
Knowledge Based Systems

DiverGAN: an efficient and Zhang, Z., & Schomaker, L. 2022 Science

effective Single-Stage framework Direct
for diverse Text-to-Image
generation.

AI Image Generation 5
Literature Review
AI Illustrator: Art Illustration Generation Based on Generative
Adversarial Network

Zihan Chen, Lianghong Chen, Zhiyuan Zhao, Yue Wang With the
improvement of the spiritual need of the public, people have higher
requirements for books, among which the illustration of the relevant
words in the books is an urgent solution. The traditional method of
completing the related illustrations by illustrators has been unable to
meet the need of the growing book market.

AI Image Generation
6
Stacking VAE and GAN for Context-aware Text-to-Image Generation

Chenrui Zhang and Yuxin Peng As an attractive research topic, text-to-image

generation has been receiving extensive attention in computer vision and natural
language processing communities. This task aims to generate realistic images
conditioned on text description, which has widespread applications in various
fields, including photo editing and data augmentation, etc. Meanwhile, learning to
generate is a promising paradigm for semi-supervised and unsupervised learning,
which can provide meaningful hints for deep models’ interpretability.

AI Image Generation 7
Hierarchical Text-Conditional Image Generation with CLIP Latents

Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu Recent

progress in computer vision has been driven by scaling models on large
datasets of captioned images collected from the internet. Within this
framework, CLIP has emerged as a successful representation learner for
images. CLIP embeddings have a number of desirable properties: they
are robust to image distribution shift, have impressive zero-shot
capabilities, and have been fine-tuned to achieve state-of-the-art results
on a wide variety of vision and language tasks .

AI Image Generation
8
Problem Definition

• Generating images that are realistic and coherent with the

provided input can be challenging.

• Handling of the diverse inputs and generate appropriate images.

• Balancing the trade-off between quality and speed is essential.

• Generating novel and original images that go beyond mere

replication of the training data is a critical challenge.

AI Image Generation
6
References

[1]Elasri, M. O., Elharrouss, O., Al-Maadeed, S., & Tairi, H. (2022). Image Generation: a review.
Neural Processing Letters, 54(5), 4609–4646. https://doi.org/10.1007/s11063-022-10777-x

[2]Rohith, M., Pallavi, L., Shirisha, K., Sanjay, M., & Priya, V. (2023). Image Generation Based on
Text Using BERT And GAN Model. IEEE. https://doi.org/10.1109/cictn57981.2023.10141495

[3]Ramesh, A., Dhariwal, P., Nichol, A., Chu, C., & Chen, M. (2022). Hierarchical Text-Conditional
Image Generation with CLIP Latents. arXiv (Cornell University).
https://doi.org/10.48550/arxiv.2204.06125

[4]Zi-Han, C., Chen, L., Zhao, Z., & Wang, Y. (2020). AI Illustrator: Art Illustration Generation Based on
Generative Adversarial Network. IEEE. https://doi.org/10.1109/icivc50857.2020.9177494

[5]Zhang, C., & Peng, Y. (2018). Stacking VAE and GAN for Context-aware Text-to-Image Generation.
IEEE. https://doi.org/10.1109/bigmm.2018.8499439

AI Image Generation
7
References
[6]Bahani, M., Ouaazizi, A. E., & Maalmi, K. (2022). AraBERT and DF-GAN fusion for Arabic
text-to-image generation. Array, 16, 100260. https://doi.org/10.1016/j.array.2022.100260

[7]Qi, Z., Fan, C., Xu, L., Li, X., & Shu, Z. (2021). MRP-GAN: Multi-resolution parallel generative
adversarial networks for text-to-image synthesis. Pattern Recognition Letters, 147, 1–7.
https://doi.org/10.1016/j.patrec.2021.02.020

[8]Zhou, D., Sun, K., Hu, M., & He, Y. (2021). Image generation from text with entity information fusion.
Knowledge Based Systems, 227, 107200. https://doi.org/10.1016/j.knosys.2021.107200

[9]Zhang, Z., & Schomaker, L. (2022). DiverGAN: an efficient and effective Single-Stage framework for
diverse Text-to-Image generation. Neurocomputing, 473, 182–198.
https://doi.org/10.1016/j.neucom.2021.12.005

AI Image Generation
8
Copyright @Information Technology Dept , XIE 9

DLL COT1 English 7
100% (5)
DLL COT1 English 7
2 pages
70 20 10 Framework
100% (1)
70 20 10 Framework
20 pages
Challenge - Stage 7 - Global Brands - tcm143-467513
100% (1)
Challenge - Stage 7 - Global Brands - tcm143-467513
4 pages
Image Generation A Review
No ratings yet
Image Generation A Review
39 pages
ppt1
No ratings yet
ppt1
20 pages
Generating AI Text to Image A Comprehensive Guide
No ratings yet
Generating AI Text to Image A Comprehensive Guide
3 pages
Dynamic Image Generation From Text Prompt Research Paper-JOT-5135
100% (1)
Dynamic Image Generation From Text Prompt Research Paper-JOT-5135
7 pages
BTP_6 sem_part1
No ratings yet
BTP_6 sem_part1
40 pages
ai-image-generator
No ratings yet
ai-image-generator
37 pages
Base Paper Batch 9 Final Updated 3
No ratings yet
Base Paper Batch 9 Final Updated 3
10 pages
An Adaptive Approach To Text To Image
No ratings yet
An Adaptive Approach To Text To Image
5 pages
Text-to-Image Synthesis With Generative Models Met
No ratings yet
Text-to-Image Synthesis With Generative Models Met
16 pages
Text To Image Synthesis Using Generative Adversarial Networks
No ratings yet
Text To Image Synthesis Using Generative Adversarial Networks
10 pages
Image Synthesis From An Ethical Perspective: Oliver Bendel
No ratings yet
Image Synthesis From An Ethical Perspective: Oliver Bendel
10 pages
Intro to Image Generation With AI
No ratings yet
Intro to Image Generation With AI
2 pages
Image-Dev An Advance Text To Image AI Model
No ratings yet
Image-Dev An Advance Text To Image AI Model
6 pages
Documents 5
No ratings yet
Documents 5
5 pages
Indian Institute OF Information Technology Allahabad: Text To Image Synthesis
No ratings yet
Indian Institute OF Information Technology Allahabad: Text To Image Synthesis
8 pages
Image Synthesis From an Ethical Perspective
No ratings yet
Image Synthesis From an Ethical Perspective
11 pages
Parag
No ratings yet
Parag
20 pages
Text-to-Image Generation Using Deep Learning
No ratings yet
Text-to-Image Generation Using Deep Learning
6 pages
Meta
No ratings yet
Meta
17 pages
AI Art in Architecture
No ratings yet
AI Art in Architecture
11 pages
Best AI Image Generator
100% (1)
Best AI Image Generator
12 pages
NLP Based Image Generation Usiing Ai
No ratings yet
NLP Based Image Generation Usiing Ai
59 pages
Presentation1
No ratings yet
Presentation1
64 pages
(Paper ID -321) Exploring the various Machine Learning Models for Image Generation - A Comprehensive Survey Unlocking the Future of Digital Creativity
No ratings yet
(Paper ID -321) Exploring the various Machine Learning Models for Image Generation - A Comprehensive Survey Unlocking the Future of Digital Creativity
15 pages
s40745-024-00544-1
No ratings yet
s40745-024-00544-1
30 pages
Engproc 20 00016 With Cover
No ratings yet
Engproc 20 00016 With Cover
7 pages
New Microsoft Word Document (2)
No ratings yet
New Microsoft Word Document (2)
8 pages
98152bdf-d3c6-4d64-8f54-5cfe41c88dda_Background_and_Literature_Review
No ratings yet
98152bdf-d3c6-4d64-8f54-5cfe41c88dda_Background_and_Literature_Review
17 pages
b383fba0-f67c-4a5a-aad0-fd288516352c_Background_and_Literature_Review
No ratings yet
b383fba0-f67c-4a5a-aad0-fd288516352c_Background_and_Literature_Review
7 pages
Rishab Paper Final
No ratings yet
Rishab Paper Final
7 pages
Image generator
No ratings yet
Image generator
11 pages
Ai Image Generator
No ratings yet
Ai Image Generator
20 pages
Photographic Text-to-Image Synthesis With A Hierarchically-Nested Adversarial Network
No ratings yet
Photographic Text-to-Image Synthesis With A Hierarchically-Nested Adversarial Network
10 pages
Final All Correct
No ratings yet
Final All Correct
49 pages
Dehouce
No ratings yet
Dehouce
12 pages
MPAI05_FINAL DOCUMENT
No ratings yet
MPAI05_FINAL DOCUMENT
40 pages
Unit - 4
No ratings yet
Unit - 4
46 pages
Text-to-Image_Synthesis_With_Generative_Models_Methods_Datasets_Performance_Metrics_Challenges_and_Future_Direction_Basiv
No ratings yet
Text-to-Image_Synthesis_With_Generative_Models_Methods_Datasets_Performance_Metrics_Challenges_and_Future_Direction_Basiv
16 pages
SMS Spam Detection Using Machine Learning
No ratings yet
SMS Spam Detection Using Machine Learning
68 pages
DeepPov GAI
100% (1)
DeepPov GAI
47 pages
Unleashing the Power of Image Generators
No ratings yet
Unleashing the Power of Image Generators
11 pages
ImageGenerationwithGans basedTechniquesASurvey
No ratings yet
ImageGenerationwithGans basedTechniquesASurvey
19 pages
A Pathway Towards Responsible AI Generated Content
No ratings yet
A Pathway Towards Responsible AI Generated Content
12 pages
Sample Report PDF
No ratings yet
Sample Report PDF
25 pages
project 4 report(Rohit&Gayatri)
No ratings yet
project 4 report(Rohit&Gayatri)
36 pages
Building A System That Can Generate High
No ratings yet
Building A System That Can Generate High
2 pages
What's in A Text-To-Image Prompt The Potential of Stable Diffusion in Visual Arts Education
No ratings yet
What's in A Text-To-Image Prompt The Potential of Stable Diffusion in Visual Arts Education
12 pages
Utilizing Generative AI for Text-To-Image Generation
No ratings yet
Utilizing Generative AI for Text-To-Image Generation
6 pages
Generativeai Cheatsheet
No ratings yet
Generativeai Cheatsheet
8 pages
Algorithms 17 00136
No ratings yet
Algorithms 17 00136
20 pages
AI Research 1
No ratings yet
AI Research 1
37 pages
Tao DF-GAN A Simple and Effective Baseline For Text-to-Image Synthesis CVPR 2022 Paper
No ratings yet
Tao DF-GAN A Simple and Effective Baseline For Text-to-Image Synthesis CVPR 2022 Paper
11 pages
Text-to-image generation using Generative AI
No ratings yet
Text-to-image generation using Generative AI
5 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
10 pages
Design Guidelines For Prompt Engineering
No ratings yet
Design Guidelines For Prompt Engineering
23 pages
DDDDD
No ratings yet
DDDDD
20 pages
Deep Learning Based Text To Image Genera
No ratings yet
Deep Learning Based Text To Image Genera
6 pages
thesis-11-51
No ratings yet
thesis-11-51
41 pages
1 RV
No ratings yet
1 RV
11 pages
Foundational Models and Architectures S1: Generative AI, #1
From Everand
Foundational Models and Architectures S1: Generative AI, #1
Leaster Startx
No ratings yet
Role Description Childcare Assessor
No ratings yet
Role Description Childcare Assessor
4 pages
Business Technology Applications Syllabus
No ratings yet
Business Technology Applications Syllabus
1 page
CSS Lesson Plan
95% (44)
CSS Lesson Plan
2 pages
(Ebook) Aptitude, Learning, and Instruction: Cognitive Process Analyses of Aptitude by Richard E. Snow & Pat-Anthony Federico & William E. Montague ISBN 9781003162865, 100316286X all chapter instant download
100% (6)
(Ebook) Aptitude, Learning, and Instruction: Cognitive Process Analyses of Aptitude by Richard E. Snow & Pat-Anthony Federico & William E. Montague ISBN 9781003162865, 100316286X all chapter instant download
71 pages
Co3 Dlp Cookery 9 - Components of Sandwich
No ratings yet
Co3 Dlp Cookery 9 - Components of Sandwich
6 pages
2016 NCAE Orientation School Level
No ratings yet
2016 NCAE Orientation School Level
28 pages
Proposal Yuspikaa
No ratings yet
Proposal Yuspikaa
28 pages
Educ630 Lesson Plan 3
No ratings yet
Educ630 Lesson Plan 3
3 pages
P Comman
No ratings yet
P Comman
25 pages
ESL STUDENT HANDBOOK
No ratings yet
ESL STUDENT HANDBOOK
5 pages
Sergia Soriano Esteban Integrated School Sped Center Junior High School Department
No ratings yet
Sergia Soriano Esteban Integrated School Sped Center Junior High School Department
2 pages
Emma Watson Speech Subtle Techniques
No ratings yet
Emma Watson Speech Subtle Techniques
13 pages
DLL Science 10 - July 1
100% (1)
DLL Science 10 - July 1
2 pages
Proposed DSG Resolution On Duke's Grading Policy
No ratings yet
Proposed DSG Resolution On Duke's Grading Policy
2 pages
Sentiment Analysis Using Bert On Yelp Restaurant Reviews
No ratings yet
Sentiment Analysis Using Bert On Yelp Restaurant Reviews
63 pages
Proposed Manual For Private Schools
100% (1)
Proposed Manual For Private Schools
177 pages
A Case Study On Gurukul System of Education: A Contemporary Approach by Gotirth Vidyapeeth
No ratings yet
A Case Study On Gurukul System of Education: A Contemporary Approach by Gotirth Vidyapeeth
24 pages
Intertropical Converging Zone
No ratings yet
Intertropical Converging Zone
5 pages
Personality Development Review N Term Quiz Answer
100% (7)
Personality Development Review N Term Quiz Answer
6 pages
EDUC - 221 Study Guide - May 21
No ratings yet
EDUC - 221 Study Guide - May 21
2 pages
Sample Lesson Plan
No ratings yet
Sample Lesson Plan
1 page
Multigrade Teaching Introduction 1212743864627450 8
No ratings yet
Multigrade Teaching Introduction 1212743864627450 8
10 pages
Life Skill SLM - 0
No ratings yet
Life Skill SLM - 0
105 pages
TRANSFORMATIVE EDUCATION - Teaching Prof
No ratings yet
TRANSFORMATIVE EDUCATION - Teaching Prof
19 pages
Assignment / Tugasan HDPS4103 Perancangan Dan Pengajaran Afektif May 2024 Semester
No ratings yet
Assignment / Tugasan HDPS4103 Perancangan Dan Pengajaran Afektif May 2024 Semester
10 pages
Education Officer Exams
No ratings yet
Education Officer Exams
489 pages
Savignon S.J. - Communicative Language Teaching_ Strategies and Goals (Before 2005)
No ratings yet
Savignon S.J. - Communicative Language Teaching_ Strategies and Goals (Before 2005)
18 pages