How to Make Images with ChatGPT for Beginners

TLDR;

This video provides a step-by-step guide on how to use ChatGPT to create stunning and realistic AI images, even without prior experience. It covers essential aspects such as crafting effective prompts, refining image styles, using photography terms, creating realistic images of people, and addressing ethical considerations.

Crafting effective prompts using a specific formula.
Refining image styles by specifying art styles or eras.
Using photography terms to control lighting and depth of field.
Creating realistic images of people by detailing their characteristics and emotions.
Addressing ethical considerations to avoid deception, harm, or copyright infringement.

Intro [0:00]

The video introduces a guide for writers, marketers, communications professionals, and content creators on using Chat GPT to generate realistic AI images. The creator, Aga, who runs an online education business, promises to share use cases, examples, and less obvious tips. She encourages viewers to subscribe for more content on AI, content creation, and productivity.

Prompting formula [0:47]

To generate an image in Chat GPT, you need to enter a prompt, which is a description of the desired image. A basic formula for an effective prompt includes the subject, scene or setting, style, and details. Using a detailed prompt results in a more specific and less random image compared to a basic prompt. For example, a basic prompt like "create an image of coffee" yields a random image, while a detailed prompt like "a steaming cup of coffee on a rustic wooden table, morning sunlight coming through a window in soft cinematic lighting" provides a more specific and visually appealing result.

Image formats [2:11]

When creating an image for a specific piece of content, you can specify the dimensions in your prompt. For example, you can request a square image for Instagram or a wide image for a blog header. By including format details in the prompt, you can avoid additional editing after the image is generated. For instance, prompting "a square image of a vibrant colorful smoothie bowl with fruit and seeds, tropical vibes styled like a trendy Instagram post" will generate an image with the specified format and style. If you're not satisfied with the result, you can ask Chat GPT to create several alternatives to save time.

Refining the style [3:07]

You can refine the style of an image by specifying the art style or era in your prompt. Using the same prompt for a smoothie bowl, you can generate different images by specifying styles such as a 1970s magazine ad or a children's book illustration. This allows you to create images that align with your brand and desired aesthetic. Specifying the style and era helps Chat GPT produce diverse images, providing you with more options to choose from.

Using photography terms [3:56]

Using basic photography terms in your prompts can help create a specific feel for your images. Terms like "shallow depth of field" can create a blurry background, while "backlit" or "golden hour" can specify the type of lighting. For a more polished look, you can use "soft studio lighting." For example, a prompt like "a woman reading on a couch in natural window light, shallow depth of field, candid photo style" will generate an image with natural lighting and a blurry background, resembling a candid photo. Familiarizing yourself with photography terminology ensures you get the desired look and feel in your AI-generated images.

Images of people [4:41]

When creating images of people, it's better to create a character with specific details rather than just asking for a picture of a person. Adding parameters to your prompt can significantly change the outcome. For example, instead of just saying "a woman standing outside," you can specify "a 30s something South Asian woman with curly hair wearing casual jeans and a white linen blouse standing outside a small bookshop smiling naturally." This provides more detail about the character's appearance, clothing, and environment, resulting in a less random and more specific image. Grounding the character in a real environment, such as a local market or a messy living room, can also make the image more realistic.

Making images more realistic [7:31]

AI-generated images tend to be perfect, lacking imperfections that make them look fake. To make images more realistic, introduce imperfections in your prompts. For example, ask for "dirty old trainers" or "a cafe with slightly messy tables in the background." Real life is messy, and your audience is more likely to relate to images that reflect this. Similarly, AI often generates images of conventionally good-looking people. Modify your prompt to request someone who is not conventionally good-looking or specify features like thinning hair or uneven skin. Anchoring images in culture or time, such as "a cozy autumn street in Brooklyn with brownstone buildings" or "a millennial woman working on a laptop in a cafe 2020 style," can also enhance realism and familiarity.

Ethical considerations [9:08]

Before creating images with Chat GPT, consider the ethical implications. Avoid using AI to deceive your audience or create harmful, misleading, or copyright-infringing images. Be transparent about using AI-generated images, especially in a commercial context. Ethical considerations are crucial to ensure responsible and honest use of AI in image creation.