Text-to-Image Prompting Cheat Sheet

Updated 2026-03-17

Next Topic: Text-to-Speech (TTS) Synthesis Cheat Sheet

Text-to-image prompting is the practice of crafting natural language instructions to guide AI image generation models like Stable Diffusion, Midjourney, DALL-E, and Flux in creating visual content. It sits at the intersection of linguistic precision and creative direction, where word choice, syntax structure, and parameter tuning directly shape the output. Effective prompting transforms vague ideas into detailed, controllable visuals by leveraging techniques like weighting, negative prompts, style modifiers, and compositional keywords. The core insight: prompts aren't just descriptions—they're structured instructions that map semantic meaning to visual features. Understanding how models tokenize, process, and weight prompt components lets you move from random experimentation to reproducible, high-quality generation.

What This Cheat Sheet Covers

This topic spans 20 focused tables and 211 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: Core Prompt ElementsTable 2: Prompt Syntax and StructureTable 3: Negative Prompting TechniquesTable 4: Style and Aesthetic KeywordsTable 5: Composition and FramingTable 6: Camera and Photography TermsTable 7: Lighting KeywordsTable 8: Color and Mood KeywordsTable 9: Material and Texture KeywordsTable 10: Technical Generation ParametersTable 11: Sampling MethodsTable 12: Model-Specific Techniques (Stable Diffusion)Table 13: Model-Specific Techniques (Midjourney)Table 14: Model-Specific Techniques (DALL-E 3)Table 15: Advanced Control TechniquesTable 16: Image-to-Image TechniquesTable 17: Iterative Refinement StrategiesTable 18: Batch Generation and ConsistencyTable 19: Platform-Specific TipsTable 20: Common Mistakes to Avoid

Table 1: Core Prompt Elements

Element	Example	Description
Subject	`a golden retriever puppy`	• The primary focus of the image—what the AI should generate • most important element and typically placed first for maximum attention
Action/Pose	`running through a field`	• Describes what the subject is doing • adds dynamic movement and narrative context to static subjects
Environment/Setting	`in a misty forest at dawn`	• The context or location where the subject exists • establishes spatial relationships and background elements
Style Modifier	`digital art, concept art style`	• Specifies the artistic approach or movement • drastically changes the rendering style and aesthetic feel
Lighting	`soft golden hour lighting`	• Describes illumination and shadows • critically affects mood, depth, and three-dimensionality

Table 1: Core Prompt Elements

Element	Example	Description
Subject	`a golden retriever puppy`	• The primary focus of the image—what the AI should generate • most important element and typically placed first for maximum attention
Action/Pose	`running through a field`	• Describes what the subject is doing • adds dynamic movement and narrative context to static subjects
Environment/Setting	`in a misty forest at dawn`	• The context or location where the subject exists • establishes spatial relationships and background elements
Style Modifier	`digital art, concept art style`	• Specifies the artistic approach or movement • drastically changes the rendering style and aesthetic feel
Lighting	`soft golden hour lighting`	• Describes illumination and shadows • critically affects mood, depth, and three-dimensionality