Generative Ai & Creative Applications
Generative Ai & Creative Applications
MANAGED BY
SEMINAR REPORT
ON
Submitted By
PRANAV.T
2201150559
MANAGED BY
CERTIFICATE
This is to certify that this seminar report is the bonafide record of the Seminar presented by
PRANAV.T (2201150559) on for the partial fulfillment of the requirement for the award of
Diploma in COMPUTER HARDWARE ENGINEERING of Technical Education, KeralaState
(2024-2025).
It is with great enthusiasm and the learning spirit that I bring out this seminar report. I also feel
that it is the right opportunity to acknowledge the support and guidance that came in from various
quarters during the course of the completion of my seminar.
I am extremely grateful to Mr. Sajith.P, Principal, for providing the necessary facilities,
excellent infrastructure together with awesome environment to complete the seminar.
I would like to express my sincere gratitude to Mrs.Sajna.N.P , Head of Department for her
encouragement and guidance.
I express my sincere gratitude to Mrs.Divya.P.M, Seminar guide for her encouragement and
guidance.
I also thank all the staff members of the department for extending their helping hands to make
this project a success.
I would also like to thank all my friends and my parents who have prayed and helped me during
my work
ABSTRACT
In recent years, advances in artificial intelligence (AI) and machine learning have given rise
to powerful new tools and methods for creative practitioners. 2022–2023 in particular saw an
explosion in generative AI tools, models and use cases. Noting the long history of critical arts
engaging with AI, this chapter considers both the application of generative AI in the creative
industries, and ways in which artists co-shape the development of these emerging technologies. After
reviewing the landscape of generative AI in visual arts, music and games, we propose four areas of
critical interest for the future co-shaping of generative AI and creative practice in the areas of
communities and open source, deeper engagement with AI, beyond the human and cultural feedbacks.
CONTENTS
1 INTRODUCTION 1
3 GENERATIVE AI TIMELINE 5
4 KEY MILESTONES 6
5 TYPES OF GENERATIVE AI 7
6 APPLICATIONS 9
6.3 Manufacturing 10
7 BENEFITS 13
8 LIMITATIONS 15
9 FUTURE OF GENERATIVE AI 17
10 CONCLUSION 21
11 REFERENCES 22
LIST OF FIGURES
4.2 AI in Gameplay 6
CHAPTER 1
INTRODUCTION
The last half a century has seen advances across a range of technological domains, including
artificial intelligence (AI), as well as in new imaging and immersive techniques. The last decade in
particular has seen major break-throughs in machine learning, and recent developments in diffusion
models and large language models have given rise to powerful and widely accessible generative
AI tools. In 2022–2023 AI-powered image generators and chat-bot assistants have exploded into
the mainstream and the public conscious-ness, with some declaring a “golden age for AI art”
(Faber, 2022). This has led to unprecedented opportunities for artistic creation but also profound
concerns about the implications for professional artists and society at large.
These AI capabilities can underpin new forms of creative practice and fuel transformative
experiences for audiences across the creative industries, including performing arts, visual arts,
music, museums and heritage, games, film/TV, digital media, advertising and creative design. A
comprehensive review of some of the key creative AI technologies and their uses can be found in
Anantrasirichai and Bull (2022). Extending to “(i) content creation, (ii) information analysis, (iii)
content enhancement and post production workflows, (iv) information extraction and enhancement,
and (v) data com-pression” (Anantrasirichai and Bull, 2022, p.589), the wide-reaching scope
of these technologies is challenging to engage with, both for creators as well as their audiences.
While we are wary of hype cycles, this is a moment in which many creators are exploring
the implications of AI for their own practice (Cremer et al., 2023) and voicing their perspectives
on the profound upheavals that these developments bring (Hemment et al., 2023a). We see
important changes in human–computer creativity. Authorship and audience experiences are
becoming ever more digital, networked, algorithmic and complex. Conversational agents, virtual
characters, interactive robots and other autonomous technologies are increasingly becoming part
of creative content. This transition goes beyond the simple adoption of new formats or
technologies: we are entering into a whole new context for making, sharing, learning, connecting
and consuming.
With new capabilities come new challenges. The complex algorithms of AI are often black-
boxed, with their operations and assumptions not accessible to human understanding. The outputs
of the new generation of platform-based tools, such as recently released text-to-image generators,
appear like ‘magic’, with little scope for human intervention or creative control. Often, the only
creative input is through a text prompt, and the generative models that underpin the current tools
are largely trained on massive datasets scraped from the internet without permission or fair pay for
the original content creators (Wu, 2020). Other urgent concerns include excessive energy use,
harmful bias and misinformation. With the release of ever more powerful generative AI tools,
issues that were previously considered niche concerns within data science have become
increasingly mainstream ethical minefields.
At times of social and economic upheaval, artists are often at the fore-front, helping to
illuminate the ways emerging technologies impact on life at a profound level (Ibid.). A specific
interest in this chapter is critical arts, or critical AI arts, where creative practitioners directly engage
in the politics, ethics and philosophy of AI, and in its capacity to challenge and inform as well as
to delight audiences (Hemment et al., 2022). Artists expose and explore the sublime, the
indefinable, what we can’t put into words (Ingram, 2023), and the outputs of the statistical lens of
AI are often uncanny and preternatural, beyond what is normal or natural (Hemment et al., 2019).
Artists are currently pushing at the boundaries of human–machine creativity to generate works that
combine machine learning methods with human intuition and embodied experience.
In this chapter we provide a brief overview of creative practice concerning generative AI,
with a focus on practical examples in visual art, music and games to highlight some priorities for
emergent areas of study. This chapter argues that it is necessary to equip cultural producers and
artists to negotiate political, legal, security, ethical and environmental controversies and challenges
in emerging technologies and formats and to develop best practices.
The case study associated with this chapter offers more examples of how this can be done
in practice. In it we describe the work of the research group and creative community The New
Real,1 a joint initiative of the University of Edinburgh and the Alan Turing Institute, in which some
of the authors of this chapter are involved and which is intimately concerned with the previous
challenges and themes. The New Real has the twin ambitions of supporting the creation of
significant new art and inspiring new concepts and paradigms for fair and inclusive AI, which it
advances through its novel research theme, experiential AI (Hemment et al., 2019).
CHAPTER 2
WHAT IS GENERATIVE AI?
Generative AI (Gen AI) is a type of artificial intelligence technology that can produce various
types of content, including text, imagery, audio and synthetic data. The recent buzz around generative
AI has been driven by the simplicity of new user interfaces for creating high-quality text, graphics
and videos in a matter of seconds.
The technology, it should be noted, is not brand-new. Generative AI was introduced in the
1960s in chatbots. But it was not until 2014, with the introduction of generative adversarial networks,
or GANs -- a type of machine learning algorithm -- that generative AI could create convincingly
authentic images, videos and audio of real people.
On the one hand, this newfound capability has opened up opportunities that include better
movie dubbing and rich educational content. It also unlocked concerns about deepfakes -- digitally
forged images or videos -- and harmful cybersecurity attacks on businesses, including nefarious
requests that realistically mimic an employee's boss.
Two additional recent advances that will be discussed in more detail below have played a
critical part in generative AI going mainstream: transformers and the breakthrough language models
they enabled. Transformers are a type of machine learning that made it possible for researchers to
train ever-larger models without having to label all of the data in advance. New models could thus be
trained on billions of pages of text, resulting in answers with more depth. In addition, transformers
unlocked a new notion called attention that enabled models to track the connections between words
across pages, chapters and books rather than just in individual sentences. And not just words:
Transformers could also use their ability to track connections to analyze code, proteins, chemicals and
DNA.
The rapid advances in so-called large language models (LLMs) -- i.e., models with billions or
even trillions of parameters -- have opened a new era in which generative AI models can write
engaging text, paint photorealistic images and even create somewhat entertaining sitcoms on the fly.
Moreover, innovations in multimodal AI enable teams to generate content across multiple types of
media, including text, graphics and video. This is the basis for tools like Dall-E that automatically
create images from a text description or generate text captions from images.
These breakthroughs notwithstanding, we are still in the early days of using generative AI to
create readable text and photorealistic stylized graphics. Early implementations have had issues with
accuracy and bias, as well as being prone to hallucinations and spitting back weird answers. Still,
progress thus far indicates that the inherent capabilities of this generative AI could fundamentally
change enterprise technology how businesses operate. Going forward, this technology could help
write code, design new drugs, develop products, redesign business processes and transform supply
chains.
CHAPTER 3
GENERATIVE AI TIMELINE
CHAPTER 4
KEY MILESTONES
Fig 4.1
GAN
➢ In 2017: AlphaGo AI generates music and art, demonstrating creative capabilities beyond
gameplay.
Fig 4.2
AI in Gameplay
➢ In 2018: Generative AI used in film and video production, enabling new forms of content
creation and editing.
Fig 4.3
AI in film production
CHAPTER 5
TYPES OF GENERATIVE AI
Generative AI encompasses various models and techniques that aim to generate new data or
content that resembles human-created data. There are several types of generative AI models, each
with its own unique approach to generating content. Some of the most prominent types include:
GANs consist of two neural networks, the generator and the discriminator, that
compete against each other in a game-like setup. The generator generates synthetic data (e.g.,
images, text) from random noise, while the discriminator’s task is to distinguish between real
and fake data. The generator aims to create increasingly realistic data to deceive the
discriminator, while the discriminator improves its ability to differentiate real from generated
data. Through this competition, GANs are capable of generating highly realistic content, and
they have been successfully used in image synthesis, art creation, and video generation.
VAEs are generative models that learn to encode data into a latent space and then
decode it back to reconstruct the original data. They learn probabilistic representations of the
input data, allowing them to generate new samples from the learned distribution. VAEs are
commonly used in image generation tasks and have also been applied to text & audio
generation.
RNNs are a type of neural network that processes sequential data, such as natural
language sentences or time-series data. They can be used for generative tasks by predicting
the next element in the sequence given the previous elements. However, RNNs are limited in
generating long sequences due to the vanishing gradient problem. More advanced variants of
RNNs, such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU), have
been developed to address this limitation.
Transformers, like the GPT series, have gained significant popularity in natural
language processing and generative tasks. They use attention mechanisms to model the
relationships between different elements in a sequence effectively. Transformers are
parallelizable and can handle long sequences, making them well-suited for generating coherent
and contextually relevant text.
These are just some of the types of generative AI models, and there is ongoing research and
development in this field, leading to the emergence of new and more advanced generative models
over time.
CHAPTER 6
APPLICATIONS
information. It can also predict the rest of the code a developer begins to type, much like how
auto-complete works while texting on a smartphone.
• Translate programming languages: Generative AI can be a tool for developers to interact
with software without needing a programming language. The generative AI would act as a
translator.
• Automate testing: Developers can improve their automated testing processes using
generative AI to highlight potential problems and execute testing sequences faster than other
AI methods. Generative AI can learn the logic of the software and how users will interact with
it and create test cases to demonstrate various user scenarios.
6.5 Financial services
According to McKinsey, generative AI could add $200 billion to $340 billion of value to the banking
industry annually. Some of the applications of generative AI in the financial services industry include
artificial intelligence investment strategies, drafting documentation and monitoring regulatory
changes, and using generative AI as an interpreter to facilitate communications between clients and
investors.
• Create investment strategies: Generative AI can recommend the best investments according
to your or your client’s goals. This technology can find and execute trades much faster than
human investors and can do so within the parameters you set for the kind of transaction you
want.
• Communicate and educate clients and investors: Financial services professionals
sometimes need to communicate complex information to clients and colleagues. Generational
AI can provide hyperpersonalized customer service without adding more customer service
professionals.
• Quickly draft documentation and monitor regulation: Generative AI can monitor
regulatory activity, keep you informed of any changes, and create drafts of documents such as
investment research or insurance policies.
6.6 Media and entertainment
Media and entertainment could embrace generative AI in several ways, considering the industry
primarily engages in the same task as the tech: generating unique content. Generative AI can help
create and edit visual content, create short highlight videos of sporting events, and make working with
content management systems easier.
• Create audio and visual content: Generative AI can create new video content from scratch.
This tech can also help you make visual content faster by creating visual effects, adding
graphics, or streamlining editing.
• Generate highlights for sports and events: When it comes to sporting and live events, gen
AI can create highlight reels instantly and allow fans to create their own custom highlights.
For example, fans could generate highlights of a particular play or a tournament series.
• Manage tags for better content management: Generative AI can tag and index extensive
media libraries, making locating the files you need at any time easier. Similar to our
manufacturing example above, generative AI allows using conversational language to find the
information or media you’re looking for in a complex media library.
CHAPTER 7
BENEFITS
One of the most significant advantages of generative AI is its ability to create creative and original
content. Whether its generating art, writing stories, or composing music, generative AI systems
like GPT-3 have demonstrated remarkable creative capabilities. This can be a game-changer for
content creators, as it can assist in generating ideas, saving time, and even producing high-quality
content autonomously.
Generative AI has revolutionized natural language processing. Chatbots, virtual assistants, and
automated content generation are just a few examples of how NLP powered by generative AI can
streamline communication and improve efficiency. Businesses can leverage these technologies to
provide better customer service, automate routine tasks, and gain deeper insights from text data.
In the healthcare sector, generative AI has made significant strides in medical diagnosis and drug
discovery. Deep learning models can analyze medical images, such as X-rays and MRI scans, to
detect diseases and abnormalities more accurately and quickly than human experts. Additionally,
generative AI is helping researchers discover new drugs and explore potential treatments by
simulating molecular interactions.
6. Content Translation
Language barriers are a significant obstacle in our increasingly globalized world. Generative AI,
especially in the form of neural machine translation models, has made significant advancements in
breaking down these barriers. Services like Google Translate use generative AI to provide accurate
and context-aware translations, facilitating communication and fostering cross-cultural
understanding.
Generative AI has entered the world of art and design, producing captivating works of digital art
and even influencing architectural design. Artists and architects are using generative algorithms to
explore new creative horizons, generating intricate patterns, sculptures, and structures that were
previously unimaginable.
CHAPTER 8
LIMITATIONS
Generative AI, while impressive, has its limitations. It can produce content that appears
realistic but may lack true comprehension or context understanding. This can result in the
generation of incorrect or biased information. Moreover, ethical concerns regarding the misuse of
generative AI for generating fake news, deep fakes, or other deceptive content are significant
challenges. The quality and coherence of generated output can also vary, and controlling these
aspects remains an ongoing challenge in the field. Additionally, the substantial computational
resources required for training and running generative AI models can be a limitation for some
applications. The limitations of generative artificial intelligence are as follows.
1.Ethical Concerns
Generative AI raises ethical concerns related to the creation and dissemination of fake content.
Deep fake videos, fake news articles, and forged documents created by AI models can deceive and
manipulate individuals, leading to misinformation, identity theft, and privacy breaches. Addressing
these concerns requires robust regulations and ethical guidelines.
AI models, including generative ones, can inherit biases present in their training data. This can
result in discriminatory outcomes, perpetuating biases against certain groups. Developers must be
vigilant in mitigating bias and ensuring that AI systems are fair and equitable.
3.Lack of Creativity
While generative AI can produce creative content, it lacks genuine creativity and understanding.
AI-generated art, for example, may lack the depth of human expression and emotion. This
limitation can be especially noticeable in creative fields where a deep understanding of human
experiences is crucial.
4.Data Dependency
Generative AI models require vast amounts of data to function effectively. They rely on patterns
and information from their training data to generate content. This means that they may struggle in
situations where data is scarce or unrepresentative, limiting their usefulness in niche domains.
5.Energy Consumption
Training large generative AI models consumes a significant amount of energy and computing
resources. The environmental impact of AI, especially in data centers, has raised concerns about
sustainability. Researchers are actively working on more energy-efficient AI models, but this
remains a significant limitation.
6.Cost of Development
Building and training generative AI models can be prohibitively expensive. Access to high-quality
data, powerful hardware, and expertise in machine learning is not readily available to all
organizations and individuals. This creates a digital divide and limits the democratization of AI
technology.
7.Security Risks
Generative AI can be used maliciously to automate cyberattacks, such as phishing and social
engineering. Attackers can leverage AI to craft convincing messages and infiltrate systems, making
it challenging for traditional security measures to detect and defend against these threats.
8.Uncanny Valley
In some cases, generative AI may produce content that falls into the “uncanny valley,” where it
appears almost human but not quite right. This can be unsettling and impact user trust and
acceptance, especially in applications like customer service chatbots or virtual assistants.
CHAPTER 9
FUTURE OF GENERATIVE AI
These anticipated advancements in LLMs suggest a future where generative AI not only
comprehends language intricacies at an advanced level but also responds with a heightened
contextual awareness, contributing to more accurate, contextually relevant, and nuanced content
creation. The ongoing development of LLMs is instrumental in pushing the boundaries
of generative AI, promising a future where language models can interact with users in a manner
that mirrors a deeper comprehension of context, subtleties, and the intricacies of human
communication.
2. Multimodal AI Integration
convergence of textual, visual, and auditory elements. The integration of multiple modalities not
only enhances the versatility of generative systems but also opens new avenues for creating rich,
immersive content experiences that transcend traditional boundaries and engage users across
various sensory channels.
The overarching goal is to provide users with a more personalized and tailored experience,
enabling them to exert greater influence over the nature and characteristics of the content generated
by generative AI systems. This trend reflects a commitment to accommodating individual
preferences, promoting user agency, and enhancing the adaptability of generative technologies to
meet diverse user needs and expectations.
In generative AI, a critical imperative is Ethical and Bias Mitigation. Looking forward,
there is a pressing need to confront ethical concerns and proactively address biases in generative
outputs. Developers are expected to prioritize the implementation of safeguards to prevent the
unintentional amplification of biases inherent in training data.
The result is expected to be more precise, tailored, and domain-specific content generation.
By aligning closely with the intricacies of particular industries, these specialized generative models
aim to deliver outputs that are not only contextually accurate but also attuned to the unique
requirements and intricacies of the targeted domain, marking a significant step towards enhanced
relevance and applicability in diverse professional sectors.
6. Real-Time Applications
Future tools are expected to bridge the gap between human ingenuity and AI capabilities,
offering a harmonious collaboration that amplifies creative processes and yields novel, imaginative
outcomes. This evolution represents a transformative approach to creative workflows,
where generative AI becomes an integral and synergistic part of the collaborative creative journey.
Thus for Generative AI, the future holds immense promise for reshaping technological and
societal paradigms. This future is not just about technological advances but also about integrating
AI into our daily lives and industries in a more personalized, ethical, and comprehensive manner.
CHAPTER 10
CONCLUSION
Generative AI holds immense promise for the future across various industries, promising
exceptional levels of productivity, efficiency, and innovation. As this technology continues to
advance, it will reshape how businesses operate.
Adopting Generative AI represents not just a technological advancement, but a more liberating
creative expression towards art. While challenges remain, like ensuring responsible development and
addressing potential biases, the vast potential benefits outweigh them. In making sure that Generative
AI technology not replacing humans but enhancing human capabilities, we can use this cutting-edge
technology for the betterment of society.
Every technology has two aspects. How powerful AI will become and whether it will replace
human creativity is a matter of future discussion. However, it is certain now that using this technology,
business owners can streamline many operations. If you have an idea, we are happy to turn your idea
into digital reality with our expert AI/ML development services. As a top-rated AI development
company, we offer cutting-edge generative AI software development services."
CHAPTER 11
REFERENCES
• https://bigid.com/blog/unveiling-6-types-of-generative-ai/
• https://www.researchgate.net/publication/379679735_On_creative_practice_and_generative_AI
• https://www.ftc.gov/system/files/ftc_gov/pdf/12-15-2023AICEStaffReport.pdf
• https://www.coursera.org/articles/generative-ai-applications
• https://www.gartner.com/en/topics/generative-ai
• https://en.wikipedia.org/wiki/Generative_artificial_intelligence