255 lines
11 KiB
Markdown
255 lines
11 KiB
Markdown
# Step 3: Prompt Template
|
|
|
|
Save to `prompts/cover.md`:
|
|
|
|
```markdown
|
|
---
|
|
type: cover
|
|
palette: [confirmed palette]
|
|
rendering: [confirmed rendering]
|
|
references:
|
|
- ref_id: 01
|
|
filename: refs/ref-01-{slug}.{ext}
|
|
usage: direct | style | palette
|
|
- ref_id: 02
|
|
filename: refs/ref-02-{slug}.{ext}
|
|
usage: direct | style | palette
|
|
---
|
|
|
|
# Content Context
|
|
Article title: [full original title from source]
|
|
Content summary: [2-3 sentence summary of key points and themes]
|
|
Keywords: [5-8 key terms extracted from content]
|
|
|
|
# Visual Design
|
|
Cover theme: [2-3 words visual interpretation]
|
|
Type: [confirmed type]
|
|
Palette: [confirmed palette]
|
|
Rendering: [confirmed rendering]
|
|
Font: [confirmed font]
|
|
Text level: [confirmed text level]
|
|
Mood: [confirmed mood]
|
|
Aspect ratio: [confirmed ratio]
|
|
Language: [confirmed language]
|
|
|
|
# Text Elements
|
|
[Based on text level:]
|
|
- none: "No text elements"
|
|
- title-only: "Title: [exact title from source or user]"
|
|
- title-subtitle: "Title: [title] / Subtitle: [context]"
|
|
- text-rich: "Title: [title] / Subtitle: [context] / Tags: [2-4 keywords]"
|
|
|
|
# Mood Application
|
|
[Based on mood level:]
|
|
- subtle: "Use low contrast, muted colors, light visual weight, calm aesthetic"
|
|
- balanced: "Use medium contrast, normal saturation, balanced visual weight"
|
|
- bold: "Use high contrast, vivid saturated colors, heavy visual weight, dynamic energy"
|
|
|
|
# Font Application
|
|
[Based on font style:]
|
|
- clean: "Use clean geometric sans-serif typography. Modern, minimal letterforms."
|
|
- handwritten: "Use warm hand-lettered typography with organic brush strokes. Friendly, personal feel."
|
|
- serif: "Use elegant serif typography with refined letterforms. Classic, editorial character."
|
|
- display: "Use bold decorative display typography. Heavy, expressive headlines."
|
|
|
|
# Composition
|
|
Type composition:
|
|
- [Type-specific layout and structure]
|
|
|
|
Visual composition:
|
|
- Main visual: [metaphor derived from content meaning]
|
|
- Layout: [positioning based on type and aspect ratio]
|
|
- Decorative: [palette-specific elements that reinforce content theme]
|
|
|
|
Color scheme: [primary, background, accent from palette definition, adjusted by mood]
|
|
Color constraint: Color values (#hex) and color names are rendering guidance only — do NOT display color names, hex codes, or palette labels as visible text in the image.
|
|
Rendering notes: [key characteristics from rendering definition — lines, texture, depth, element style]
|
|
Type notes: [key characteristics from type definition]
|
|
Palette notes: [key characteristics from palette definition]
|
|
|
|
[Watermark section if enabled]
|
|
|
|
[Reference images section if provided — REQUIRED, see below]
|
|
```
|
|
|
|
## Reference-Driven Design ⚠️ HIGH PRIORITY
|
|
|
|
When reference images are provided, they are the **primary visual input** and MUST strongly influence the output. The cover should look like it belongs to the same visual family as the references.
|
|
|
|
**Passing `--ref` alone is NOT enough.** Image generation models often ignore reference images unless the prompt text explicitly describes what to reproduce. Always combine `--ref` with detailed textual instructions.
|
|
|
|
## Content-Driven Design
|
|
|
|
- Article title and summary inform the visual metaphor choice
|
|
- Keywords guide decorative elements and symbols
|
|
- The skill controls visual style; the content drives meaning
|
|
|
|
## Visual Element Selection
|
|
|
|
Match content themes to icon vocabulary:
|
|
|
|
| Content Theme | Suggested Elements |
|
|
|---------------|-------------------|
|
|
| Programming/Dev | Code window, terminal, API brackets, gear |
|
|
| AI/ML | Brain, neural network, robot, circuit |
|
|
| Growth/Business | Chart, rocket, plant, mountain, arrow |
|
|
| Security | Lock, shield, key, fingerprint |
|
|
| Communication | Speech bubble, megaphone, mail, handshake |
|
|
| Tools/Methods | Wrench, checklist, pencil, puzzle |
|
|
|
|
Full library: [../visual-elements.md](../visual-elements.md)
|
|
|
|
## Type-Specific Composition
|
|
|
|
| Type | Composition Guidelines |
|
|
|------|------------------------|
|
|
| `hero` | Large focal visual (60-70% area), title overlay on visual, dramatic composition |
|
|
| `conceptual` | Abstract shapes representing core concepts, information hierarchy, clean zones |
|
|
| `typography` | Title as primary element (40%+ area), minimal supporting visuals, strong hierarchy |
|
|
| `metaphor` | Concrete object/scene representing abstract idea, symbolic elements, emotional resonance |
|
|
| `scene` | Atmospheric environment, narrative elements, mood-setting lighting and colors |
|
|
| `minimal` | Single focal element, generous whitespace (60%+), essential shapes only |
|
|
|
|
## Title Guidelines
|
|
|
|
When text level includes title:
|
|
- **Source**: Use the exact title provided by user, or extract from source content
|
|
- **Do NOT invent titles**: Stay faithful to the original
|
|
- Match confirmed language
|
|
|
|
## Watermark Application
|
|
|
|
If enabled in preferences, add to prompt:
|
|
|
|
```
|
|
Include a subtle watermark "[content]" positioned at [position].
|
|
The watermark should be legible but not distracting from the main content.
|
|
```
|
|
|
|
Reference: `config/watermark-guide.md`
|
|
|
|
## Reference Image Handling
|
|
|
|
When user provides reference images (`--ref` or pasted images):
|
|
|
|
### ⚠️ CRITICAL - Frontmatter References
|
|
|
|
**MUST add `references` field in YAML frontmatter** when reference files are saved to `refs/`:
|
|
|
|
```yaml
|
|
---
|
|
type: cover
|
|
palette: warm
|
|
rendering: flat-vector
|
|
references:
|
|
- ref_id: 01
|
|
filename: refs/ref-01-podcast-thumbnail.jpg
|
|
usage: style
|
|
---
|
|
```
|
|
|
|
| Field | Description |
|
|
|-------|-------------|
|
|
| `ref_id` | Sequential number (01, 02, ...) |
|
|
| `filename` | Relative path from prompt file's parent directory |
|
|
| `usage` | `direct` / `style` / `palette` |
|
|
|
|
**Omit `references` field entirely** if no reference files saved (style extracted verbally only).
|
|
|
|
### When to Include References in Frontmatter
|
|
|
|
| Situation | Frontmatter Action | Generation Action |
|
|
|-----------|-------------------|-------------------|
|
|
| Reference file saved to `refs/` | Add to `references` list ✓ | Pass via `--ref` parameter |
|
|
| Style extracted verbally (no file) | Omit `references` field | Describe in prompt body only |
|
|
| File path in frontmatter but doesn't exist | ERROR - fix or remove | Generation will fail |
|
|
|
|
**Before writing prompt with references, verify**: `test -f refs/ref-NN-{slug}.{ext}`
|
|
|
|
### Reference Usage Types
|
|
|
|
| Usage | When to Use | Generation Action |
|
|
|-------|-------------|-------------------|
|
|
| `direct` | Reference matches desired output closely | Pass to `--ref` parameter |
|
|
| `style` | Extract visual style characteristics only | Describe style in prompt text |
|
|
| `palette` | Extract color palette only | Include colors in prompt |
|
|
|
|
### Step 1: Analyze References
|
|
|
|
For each reference image, extract:
|
|
- **Style**: Rendering technique, line quality, texture
|
|
- **Composition**: Layout, visual hierarchy, focal points
|
|
- **Color mood**: Palette characteristics (without specific colors)
|
|
- **Elements**: Key visual elements and symbols used
|
|
|
|
### Step 2: Embed in Prompt ⚠️ CRITICAL
|
|
|
|
**Passing `--ref` alone is NOT enough.** Image generation models frequently ignore reference images unless the prompt text explicitly and forcefully describes what to reproduce. You MUST always write detailed textual instructions regardless of whether `--ref` is used.
|
|
|
|
**If file saved (with or without `--ref` support)**:
|
|
- Pass ref images via `--ref` parameter if skill supports it
|
|
- **ALWAYS** add a detailed mandatory section in the prompt body:
|
|
|
|
```
|
|
# Reference Style — MUST INCORPORATE
|
|
|
|
CRITICAL: The generated cover MUST visually reference the provided images. The cover must feel like it belongs to the same visual family.
|
|
|
|
## From Ref 1 ([filename]) — REQUIRED elements:
|
|
- [Brand element]: [Specific description of logo/wordmark treatment, e.g., "The logo uses vertical parallel lines (|||) for the letter 'm'. Reproduce this exact treatment."]
|
|
- [Signature pattern]: [Specific description, e.g., "Woven intersecting curves forming a diamond/lozenge grid pattern. This MUST appear prominently as a banner, border, or background section."]
|
|
- [Colors]: [Exact hex values, e.g., "Dark teal #2D4A3E background, cream #F5F0E0 text"]
|
|
- [Typography]: [Specific treatment, e.g., "Uppercase text with wide letter-spacing"]
|
|
- [Layout element]: [Specific spatial element, e.g., "Bottom banner strip in dark color"]
|
|
|
|
## From Ref 1 ([filename]) — Characters (if people present):
|
|
- **Character 1**: [Appearance, e.g., "Woman, long wavy blonde hair"] → MUST stylize: [e.g., "flat-vector, simplified face, keep blonde hair, label: 'Nicole Forsgren'"]
|
|
- **Character 2**: [Appearance, e.g., "Man, short dark hair, stubble"] → MUST stylize: [e.g., "flat-vector, simplified face, keep dark hair, label: 'Gergely Orosz'"]
|
|
- **Placement**: [e.g., "Right third, side by side, facing left toward main visual"]
|
|
- **Style**: Match rendering style, NOT photorealistic
|
|
|
|
## From Ref 2 ([filename]) — REQUIRED elements:
|
|
[Same detailed breakdown]
|
|
|
|
## Integration approach:
|
|
[Specific layout instruction describing how reference elements combine with the cover content, e.g., "Use a SPLIT LAYOUT: main illustration area (warm cream background) occupies ~65% of the image, while a dark teal BANNER STRIP (with the woven line pattern from Ref 2) runs along the bottom ~35%, containing branding elements from Ref 1."]
|
|
```
|
|
|
|
**Key rules**:
|
|
- Each visual element gets its own bullet with "MUST" or "REQUIRED"
|
|
- Descriptions must be **specific enough to reproduce** — not vague ("clean style")
|
|
- The integration approach must describe **exact spatial arrangement**
|
|
- After generation, verify reference elements are visible; if not, strengthen and regenerate
|
|
|
|
**If style/palette extracted verbally (NO file saved)**:
|
|
- DO NOT add references metadata to prompt
|
|
- Append extracted info directly to prompt body using the same MUST INCORPORATE format above:
|
|
|
|
```
|
|
# Reference Style — MUST INCORPORATE (extracted from visual analysis)
|
|
|
|
CRITICAL: Apply these specific visual elements extracted from the reference images.
|
|
|
|
## REQUIRED elements:
|
|
- [Same detailed bullet format as above]
|
|
|
|
## Integration approach:
|
|
[Same spatial layout instruction]
|
|
```
|
|
|
|
### Reference Analysis Template
|
|
|
|
Use this format when analyzing reference images. Extract **specific, concrete, reproducible** details — not vague summaries.
|
|
|
|
| Aspect | Analysis Points | Good Example | Bad Example |
|
|
|--------|-----------------|--------------|-------------|
|
|
| **Brand elements** | Logos, wordmarks, distinctive typography | "Logo 'm' formed by 3 vertical lines" | "Has a logo" |
|
|
| **Signature patterns** | Unique motifs, textures, geometric patterns | "Woven curves forming diamond grid" | "Has patterns" |
|
|
| **Colors** | Exact hex values or close approximations | "#2D4A3E dark teal, #F5F0E0 cream" | "Dark and light" |
|
|
| **Layout** | Spatial zones, banner placement, proportions | "Bottom 30% is dark banner with branding" | "Has a banner" |
|
|
| **Typography** | Font style, weight, case, spacing, position | "Uppercase, wide letter-spacing, right-aligned" | "Has text" |
|
|
| **Rendering** | Line quality, texture, depth treatment | "Topographic contour lines as background texture" | "Clean style" |
|
|
| **Elements** | Icon vocabulary, decorative motifs | "Geometric intersecting line ornaments at corners" | "Has decorations" |
|
|
|
|
**Output**: Each extracted element should be written as a **copy-pasteable prompt instruction** prefixed with "MUST" or "REQUIRED".
|