Skip to main content

Menu

LEVEL 0
0/5 XP
HomeAboutTopicsPricingMy VaultStats

Categories

πŸ€– Artificial Intelligence
☁️ Cloud and Infrastructure
πŸ’Ύ Data and Databases
πŸ’Ό Professional Skills
🎯 Programming and Development
πŸ”’ Security and Networking
πŸ“š Specialized Topics
HomeAboutTopicsPricingMy VaultStats
LEVEL 0
0/5 XP
GitHub
Β© 2026 CheatGridβ„’. All rights reserved.
Privacy PolicyTerms of UseAboutContact

Stable Diffusion Cheat Sheet

Stable Diffusion Cheat Sheet

Back to Generative AI
Updated 2026-04-05
Next Topic: Structured Output Generation with LLMs Cheat Sheet

Stable Diffusion is an open-source latent diffusion model family that generates images from text descriptions. The original SD 1.x/2.x series uses a U-Net backbone operating in a VAE-compressed latent space guided by CLIP text encoders; SD 3 and later shifted to a Multimodal Diffusion Transformer (MMDiT) architecture with triple text-encoder conditioning (CLIP-L, OpenCLIP-G, T5-XXL). Competing models like Flux.1 (Black Forest Labs) apply a similar transformer-based flow-matching design. Understanding generation parametersβ€”from CFG scale to sampling schedulersβ€”gives precise control over output, while extensions like ControlNet, LoRA, and IP-Adapter enable advanced customization without retraining the full model.

What This Cheat Sheet Covers

This topic spans 20 focused tables and 149 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: Core Model VersionsTable 2: Generation ModesTable 3: Key Generation ParametersTable 4: Sampling Methods (Schedulers)Table 5: Prompt Engineering TechniquesTable 6: ControlNet ModelsTable 7: Fine-Tuning TechniquesTable 8: Advanced ExtensionsTable 9: VAE (Variational Autoencoder)Table 10: UI PlatformsTable 11: Hardware RequirementsTable 12: Model File FormatsTable 13: Upscaling MethodsTable 14: Common Artifacts & FixesTable 15: Aspect Ratios & ResolutionsTable 16: Popular Community ModelsTable 17: Prompt Modifiers by CategoryTable 18: Negative Prompt EssentialsTable 19: CLIP Text EncoderTable 20: Latent Diffusion Process

Table 1: Core Model Versions

ModelExampleDescription
SD 1.5
runwayml/stable-diffusion-v1-5
β€’ 512Γ—512 base resolution
β€’ 983M parameters
β€’ largest LoRA/extension/embedding ecosystem
β€’ fastest on low VRAM
SDXL 1.0
stabilityai/stable-diffusion-xl-base-1.0
β€’ 1024Γ—1024 native resolution
β€’ 3.5B parameters
β€’ dual text encoders (CLIP ViT-L + OpenCLIP ViT-bigG)
β€’ optional refiner model
Flux.1 [dev]
black-forest-labs/FLUX.1-dev
β€’ 12B parameter flow-matching transformer (Black Forest Labs)
β€’ guidance-distilled; excels at text rendering and photorealism
β€’ 20–50 steps
β€’ non-commercial license
Flux.1 [schnell]
black-forest-labs/FLUX.1-schnell
β€’ Step-distilled Flux variant; 1–4 steps for rapid generation
β€’ Apache 2.0 license
β€’ slight quality tradeoff vs dev
Flux.1 Kontext [dev]
Flux Kontext dev (12B)
β€’ In-context image editing model (Black Forest Labs, May 2025)
β€’ edits existing images via text instructions
β€’ maintains character/style consistency across edits
SD 3.5 Large
stabilityai/stable-diffusion-3-5-large
β€’ 8B parameters; MMDiT with CLIP-L + OpenCLIP-G + T5-XXL
β€’ requires 18GB+ VRAM (GGUF ~12GB)
β€’ best prompt adherence in Stability lineup
SD 3.5 Medium
stabilityai/stable-diffusion-3-5-medium
β€’ 2.5B parameters
β€’ optimized for 8–12GB VRAM
β€’ MMDiT architecture with improved attention gates

More in Generative AI

  • Speech-to-Text (ASR) Models Cheat Sheet
  • Structured Output Generation with LLMs Cheat Sheet
  • Advanced RAG Patterns and Optimization Cheat Sheet
  • Chain-of-Thought Reasoning Cheat Sheet
  • Knowledge Distillation Cheat Sheet
  • MCP Servers Implementation Cheat Sheet
View all 77 topics in Generative AI