Educ-AI-tion Week takeover: How To Generate Consistent Images Using AI

 By Dave Holloway

Generating images using AI tools such as Adobe Firefly is an increasingly popular method of creating visual materials, however consistency remains an issue as it is challenging to create sets of images that work together as a set.

To a certain degree, image prompting is a science and finding a consistent prompt which can generate consistent images requires some trial and error, but it is achievable.

This article was originally written as advice to a JISC project on AI image generation and a question around comic books art, but the theory remains no matter what your subject matter.


I really liked one of the images and I generated a full version of that, and then I uploaded it back into my image generation tool and asked it to describe the picture as a prompt. Not all tools have a describe feature, but you can quite easily find a tool which does by Googling 'generate prompt from image'.

The prompt that was created was

Prompt: ‘a humanoid rabbit with long ears, wearing tactical gear, stands in the ruins of an ancient city. This is an ink drawing in the style of Katsuhiro Otomo and Jean Giraud, presented in black and white’

By generating a prompt this way I am ensuring that it is the clearest description of the image that my generation tool recognises. I should now be able to create more images in a similar style. I refer to this as my base prompt.

I then generated more images using the base prompt and simply changed the focal point of the image. In the following examples, the text in bold is the text that I changed for each new prompt.

Prompt: ‘A humanoid rabbit with long ears, wearing tactical gear, fights henchmen in the centre of an abandoned shopping mall. This is an ink drawing in the style of Katsuhiro Otomo and Jean Giraud, presented in black and white.



Prompt: A humanoid rabbit with long ears, wearing tactical gear, embraces a lost child on the rooftop of a Japanese skyscraper. This is an ink drawing in the style of Katsuhiro Otomo and Jean Giraud, presented in black and white.


Prompt: A humanoid rabbit with long ears, wearing tactical gear, watches TV and eats pizza in a dressing gown. This is an ink drawing in the style of Katsuhiro Otomo and Jean Giraud, presented in black and white.


As you can see, by keeping the base text prompt and only changing the actions and locations, I managed to generate images which look like came from the same illustrator and belong together as part of the same project.

After that, by adding a few more variances and a little bit of text work on Photoshop, I even have a cover for my comic book.


Example prompt: ‘A humanoid rabbit with long ears, wearing tactical gear, holds an American flag aloft in a post-apocalyptic wasteland. This is an ink drawing in the style of Katsuhiro Otomo and Jean Giraud, presented in vibrant 1950s colour. Comic cover art"


The key with image generation is to spend lots of time experimenting with different prompt structures, influences, styles and ideas until you get something which looks as closely as possible as to what you imagined.

Once you have that initial first image, using it to generate a base prompt will help you ensure consistency across all future generations.  Keeping the base prompt and only changing key action or location text means you have a much greater chance of keeping your aesthetic consistent.

A version of this blog post originally appeared on the Good Practice Highlights website.

Dave Holloway is a Senior Digital Learning Advisor in the Digital Learning Team.