Text-to-Image Prompting Cheat Sheet_v2_tables

Next Topic: Text-to-Speech (TTS) Synthesis Cheat Sheet

Text-to-image prompting is the practice of crafting natural language instructions to guide AI image generation models like Stable Diffusion, Midjourney, FLUX.2, GPT-4o, and Google Imagen 4 in creating visual content. It sits at the intersection of linguistic precision and creative direction, where word choice, syntax structure, and parameter tuning directly shape the output. Effective prompting transforms vague ideas into detailed, controllable visuals by leveraging techniques like weighting, negative prompts, style modifiers, JSON structuring, and compositional keywords. The core insight: prompts aren't just descriptions—they're structured instructions that map semantic meaning to visual features; understanding how each model interprets, tokenizes, and weights prompt components lets you move from random experimentation to reproducible, high-quality results.

What This Cheat Sheet Covers

This topic spans 13 focused tables and 152 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: Core Prompt ElementsTable 2: Prompt Syntax and StructureTable 3: Negative Prompting TechniquesTable 4: Style and Aesthetic KeywordsTable 5: Composition and FramingTable 6: Camera and Photography TermsTable 7: Lighting KeywordsTable 8: Color and Mood KeywordsTable 9: Material and Texture KeywordsTable 10: Technical Generation ParametersTable 11: Sampling MethodsTable 12: Model-Specific Techniques (Stable Diffusion)Table 13: Model-Specific Techniques (Midjourney)

Table 1: Core Prompt Elements

The fundamental vocabulary of image prompting — every strong prompt builds from these building blocks, and placing the most important element first consistently improves output quality across all major models.

Element	Example	Description
Subject	`a golden retriever puppy`	The primary focus of the image — most important element, placed first for maximum model attention
Style Modifier	`digital art, concept art style`	Specifies artistic approach or movement; drastically changes rendering aesthetic and feel
Lighting	`soft golden hour lighting`	Describes illumination and shadows; critically affects mood, depth, and three-dimensionality
Composition	`rule of thirds, low angle shot`	Describes framing and camera perspective; controls visual balance and focal point placement
Environment/Setting	`in a misty forest at dawn`	The context or location where the subject exists; establishes spatial relationships and background elements

Table 1: Core Prompt Elements

Element	Example	Description
Subject	`a golden retriever puppy`	The primary focus of the image — most important element, placed first for maximum model attention
Style Modifier	`digital art, concept art style`	Specifies artistic approach or movement; drastically changes rendering aesthetic and feel
Lighting	`soft golden hour lighting`	Describes illumination and shadows; critically affects mood, depth, and three-dimensionality
Composition	`rule of thirds, low angle shot`	Describes framing and camera perspective; controls visual balance and focal point placement
Environment/Setting	`in a misty forest at dawn`	The context or location where the subject exists; establishes spatial relationships and background elements