JimLiu-baoyu-skills/skills/baoyu-cover-image/references/workflow/reference-images.md

3.3 KiB

Reference Image Handling

Guide for processing user-provided reference images in cover generation.

Input Detection

Input Type Action
Image file path provided Copy to refs/ → can use --ref
Image in conversation (no path) ASK user for file path with AskUserQuestion
User can't provide path Extract style/palette verbally → append to prompt (NO frontmatter references)

CRITICAL: Only add references to prompt frontmatter if files are ACTUALLY SAVED to refs/ directory.

File Saving

If user provides file path:

  1. Copy to refs/ref-NN-{slug}.{ext} (NN = 01, 02, ...)
  2. Create description: refs/ref-NN-{slug}.md
  3. Verify files exist before proceeding

Description File Format:

---
ref_id: NN
filename: ref-NN-{slug}.{ext}
usage: direct | style | palette
---
[User's description or auto-generated description]
Usage When to Use
direct Reference matches desired output closely
style Extract visual style characteristics only
palette Extract color scheme only

Verbal Extraction (No File)

When user can't provide file path:

  1. Analyze image visually, extract: colors, style, composition
  2. Create refs/extracted-style.md with extracted info
  3. DO NOT add references to prompt frontmatter
  4. Append extracted style/colors directly to prompt text

Deep Analysis ⚠️ CRITICAL

References are high-priority inputs. Extract specific, concrete, reproducible elements:

Analysis Description Example (good vs bad)
Brand elements Logos, wordmarks, specific typography Good: "Logo uses vertical parallel lines for 'm'" / Bad: "Has a logo"
Signature patterns Unique decorative motifs, textures Good: "Woven intersecting curves forming diamond grid" / Bad: "Has patterns"
Color palette Exact hex values for key colors Good: "#2D4A3E dark teal, #F5F0E0 cream" / Bad: "Dark and light colors"
Layout structure Specific spatial arrangement Good: "Bottom 30% dark banner with branding" / Bad: "Has a banner"
Typography Font style, weight, spacing, case Good: "Uppercase, wide letter-spacing" / Bad: "Has text"
Content/subject What the reference depicts Factual description
Usage recommendation direct / style / palette Based on analysis

Output format: List each element as bullet that can be copy-pasted into prompt as mandatory instruction.

Verification Output

For saved files:

Reference Images Saved:
- ref-01-{slug}.png ✓ (can use --ref)
- ref-02-{slug}.png ✓ (can use --ref)

For extracted style:

Reference Style Extracted (no file):
- Colors: #E8756D coral, #7ECFC0 mint...
- Style: minimal flat vector, clean lines...
→ Will append to prompt text (not --ref)

Priority Rules

When user provides references, they are HIGH PRIORITY:

  • References override defaults: If reference conflicts with preferred palette/rendering, reference takes precedence
  • Concrete > abstract: Extract specific elements — not vague "clean style"
  • Mandatory language: Use "MUST", "REQUIRED" in prompt for reference elements
  • Visible in output: Verify elements are present after generation; strengthen prompt if not