A picture is worth a thousand words for an LLM

· Source: How I AI · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Intermediate, quick

Summary

The process for generating high-quality images begins by establishing a visual direction, often through a mood board created in platforms like Pinterest or Cosmos. This mood board captures the desired aesthetic, such as a "pink and cute, not super girly, very internet kind of coded" vibe. Users can then integrate this mood board into Midjourney either by directly copying and pasting images or by adding them as Style References (SRFs). SRFs instruct Midjourney to adopt the overall style, coloring, camera treatment, and general atmosphere from the reference images. The initial phase focuses on gathering information, observing how the AI interprets both the mood board and specific text prompts like "beautiful female model" and "astronaut" to refine the output.

Key takeaway

For creative technologists aiming to achieve a specific visual aesthetic in AI-generated images, you should start by curating a detailed mood board. This initial step provides a clear visual blueprint for the AI, significantly improving the relevance and quality of the output. Experiment with both direct image integration and Style References in tools like Midjourney to see which method best translates your desired "vibe" into the final image.

Key insights

Mood boards and style references are crucial for guiding AI image generation towards a specific aesthetic.

Principles

Method

Create a mood board in Pinterest/Cosmos, then integrate it into Midjourney via direct image paste or Style References (SRFs) to guide style, color, and camera treatment, followed by iterative text prompting.

In practice

Topics

Best for: Prompt Engineer, Product Designer, Creative Technologist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by How I AI.